View statlib-20050214 wseries (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,String
- Download
-
# Instances: 90 / # Attributes: 9
HDF5 (45.9 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- wseries
- Version mldata
- 0
- Comment
These data tell whether or not the home team won for each game played in all World Series prior to 1994. The data appear as the STATS Challenge for Issue 11.
DATA:
Submitted by Jeff Witmer, Oberlin College, Oberlin, Ohio e-mail: fwitmer@ocvaxa.cc.oberlin.edu
Below are data on wins and losses for all World Series games, starting in 1903 and ending in 1993. (There was no World Series in 1904.) For each year the data are presented from the point of view of the eventual winner. A capital L means that the eventual Series winner lost at home; a lower case l means they lost on the road. Likewise, a capital W means that the eventual Series winner won at home and a lower case w means that they won on the road.
Note that in 1903, 1919, 1930, and 1921 the contests were best-five-out-of-nine series; in other years they are best-four-out-of-seven series.
Here are a few questions you might consider as you analyze these data:
1) Is there a constant probability of winning within a given series, or is there evidence of, e.g., a home field advantage?
2) Is there independence in the outcome from game to game? If not, is there a simple kind of conditional independence from game to game?
Some authors have found a propensity for the team that has just lost to win the next game. What do you think?
L= loss at home l= loss on road W= win at home w= win on road
The data description and the data may be freely used for non-commercial purposes and may be freely distributed. Copyright remains with the author and with STATS magazine.
Information about the dataset CLASSTYPE: nominal CLASSINDEX: none specific
- Names
- year,outcome_1,outcome_2,outcome_3,outcome_4,outcome_5,outcome_6,outcome_7,outcome_8,
- Types
- numeric
- nominal:loss_at_home,loss_on_road,win_at_home,win_on_road
- nominal:loss_at_home,loss_on_road,win_at_home,win_on_road
- nominal:loss_at_home,loss_on_road,win_at_home,win_on_road
- nominal:loss_at_home,loss_on_road,win_at_home,win_on_road
- nominal:loss_at_home,loss_on_road,win_at_home,win_on_road
- nominal:loss_at_home,loss_on_road,win_at_home,win_on_road
- nominal:loss_at_home,win_at_home,win_on_road
- nominal:win_at_home,win_on_road
- Data (first 10 data points)
year outc... outc... outc... outc... outc... outc... outc... outc... 1903 loss... win_... loss... loss... win_... win_... win_... win_... 1927 win_... win_... win_... win_... nan nan nan nan 1950 win_... win_... win_... win_... nan nan nan nan 1973 win_... loss... win_... loss... loss... win_... win_... nan 1905 win_... loss... win_... win_... win_... nan nan nan 1928 win_... win_... win_... win_... nan nan nan nan 1951 loss... win_... loss... win_... win_... win_... nan nan 1974 win_... loss... win_... win_... win_... nan nan nan 1906 win_... loss... win_... loss... win_... win_... nan nan 1929 win_... win_... loss... win_... win_... nan nan nan ... ... ... ... ... ... ... ... ...
- Description
A gzip'ed tar containing StatLib datasets (statlib-20050214.tar.gz, 12,785,582 Bytes)
- URLs
- (No information yet)
- Publications
- Data Source
- http://lib.stat.cmu.edu/datasets/
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2011-09-14 15:05
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 2755 times and viewed 2067 times.
No Tasks yet on dataset statlib-20050214 wseries
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.