View statlib-20050214 fl2000 (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,String
- Download
-
# Instances: 67 / # Attributes: 17
HDF5 (22.1 KB) XML CSV ARFF LibSVM Matlab Octave
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- fl2000
- Version mldata
- 0
- Comment
County data from the 2000 Presidential Election in Florida.
Compiled by Brett Presnell Department of Statistics, University of Florida
These data are derived from three sources, described below. As far as I am aware, you are free to use these data in any way that you see fit, though some acknowledgement is always nice.
The candidate vote counts are the final certified counts reported by the Florida Division of Elections. These were obtained from the NORC web site in the file Cert_results.csv. Note that these do NOT inculde the federal absentee votes (so that Gore's total vote is actually higher here than Bush's).
The undervote and overvote counts were extracted from the NORC ballot level data in the file aligned.txt. Since aligned.txt is too large to work with in R (or almost any other program) I used cut (a standard UNIX program) to extract just the columns I needed:
cut -f 2,9,10 -d"|" aligned.txt > tmp
Then I read the results into R and processed them there.
The technology and columns data were extracted from the Media Group data from the NORC web site. "Technology" is simply the type of voting machine used, and "columns" is 1 if the ballot listed the presidential candidates in a single column on a single page, and 2 if the presidential candidates were spread over two columns or two pages of the ballot.
These agree with some earlier data that I had obtained from the NY Times web site, except that in the media group data the PalmBeach county ballot (the famous butterfly ballot) was listed as having one column. I would definitely call this a two-column ballot, so that is the designation recorded here. At one time I thought that MiamiDade County also used a two-column ballot, but I was wrong (the ballot listed the candidates and parties in English and Spanish in opposing columns). Images of most of the ballots can be found on the New York Times web site: www.nytimes.com/images/2001/11/12/politics/recount/index_BALLOT.html
Information about the dataset CLASSTYPE: nominal CLASSINDEX: 2
- Names
- county,technology,columns,under,over,Bush,Gore,Browne,Nader,Harris,
- Types
- nominal:Alachua,Baker,Bay,Bradford,Brevard,Broward,Calhoun,Charlotte,Citrus,Clay,Collier,Columbia,DeSoto,Dixie,Duval,Escambia,Flagler,Franklin,Gadsden,Gilchrist,Glades,Gulf,Hamilton,Hardee,Hendry,Hernando,Highlands,Hillsborough,Holmes,IndianRiver,Jackson,Jefferson,Lafayette,Lake,Lee,Leon,Levy,Liberty,Madison,Manatee,Marion,Martin,MiamiDade,Monroe,Nassau,Okaloosa,Okeechobee,Orange,Osceola,PalmBeach,Pasco,Pinellas,Polk,Putnam,SantaRosa,Sarasota,Seminole,StJohns,StLucie,Sumter,Suwannee,Taylor,Union,Volusia,Wakulla,Walton,Washington
- nominal:Datavote,Hand,Lever,Optical,Votomatic
- nominal:1,2
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
county tech... colu... under over Bush Gore Browne Nader Harris ... Alac... Opti... 1 217 105 34124 47365 658 3226 6 ... Baker Opti... 1 79 46 5610 2392 17 53 0 ... Bay Opti... 1 541 141 38637 18850 171 828 5 ... Brad... Opti... 2 41 695 5414 3075 28 84 0 ... Brev... Opti... 1 277 136 115185 97318 643 4470 11 ... Brow... Voto... 1 4946 7826 177902 387703 1217 7104 54 ... Calh... Opti... 1 78 0 2873 2155 10 39 0 ... Char... Opti... 2 170 2985 35426 29645 127 1462 6 ... Citrus Opti... 1 154 54 29767 25525 194 1379 5 ... Clay Opti... 1 223 157 41736 14632 204 562 1 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A gzip'ed tar containing StatLib datasets (statlib-20050214.tar.gz, 12,785,582 Bytes)
- URLs
- (No information yet)
- Publications
- Data Source
- http://lib.stat.cmu.edu/datasets/
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2011-09-14 14:45
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 2646 times and viewed 2009 times.
No Tasks yet on dataset statlib-20050214 fl2000
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.