Tennis Weather Dataset

Weather Data
This is the complete dataset used to train and test our prediction model.
DayOutlookTempHumidityWindPlay Tennis?
d1Sunnyhothighweak
no
d2Sunnyhothighstrong
no
d3Overcasthothighweak
yes
d4Rainymildhighweak
yes
d5Rainycoolnormalweak
yes
d6Rainycoolnormalstrong
no
d7Overcastcoolnormalstrong
yes
d8Sunnymildhighweak
no
d9Sunnycoolnormalweak
yes
d10Rainymildnormalweak
yes
d11Sunnymildnormalstrong
yes
d12Overcastmildhighstrong
yes
d13Overcasthotnormalweak
yes
d14Rainymildhighstrong
no
How the Model is Trained
The model learns from the data using a process called "supervised learning".

Step 1: Train-Test Split

The dataset is split into two parts: a larger Training Set to teach the model, and a smaller Testing Set to evaluate its accuracy. The blue rows in the table above represent the testing data.

Full Dataset

14 records

Training Set

10 records

Testing Set

4 records

Step 2: Training the Random Forest

The model is a Random Forest, which is a collection of many individual Decision Trees. Each tree is trained on a random subset of the training data and features. When making a prediction, all trees "vote", and the majority outcome becomes the final prediction. This makes the model more accurate and robust.

Tree 1

Tree 2

Tree 3

...

Many Trees