View statlib-20050214 collins (public)
























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point,String
- Download
-
# Instances: 500 / # Attributes: 24
HDF5 (117.8 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- collins
- Version mldata
- 0
- Comment
The following are data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled
Variations in Written English: Characterizing Authors' Rhetorical Language Choices Across Corpora of Published Texts" (Completed at Carnegie Mellon Univ, 2003). The source of the corpora was the ICAME CD-ROM (get info at http://www.hit.uib.no/icame/cd).
The data were generated from the texts using tagging and visualization software, Docuscope.
The first row is the variable names. The genre of each text (assigned by the Brown corpus compilers) is in 'Genre' column and the corpus is listed in the 'corpus' column with 1=Brown and 2=Frown corpus.
The dataset may be freely used and distributed for non-commercial purposes.
Jeff Collins jeff.collins@acm.org 11 July 2003
Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
- Names
- Text,FirstPerson,InnerThinking,ThinkPositive,ThinkNegative,ThinkAhead,ThinkBack,Reasoning,Share_SocTies,Direct_Activity,
- Types
- nominal:A01.TXT,A02.TXT,A03.TXT,A04.TXT,A05.TXT,A06.TXT,A07.TXT,A08.TXT,A09.TXT,A10.TXT,A11.TXT,A12.TXT,A13.TXT,A14.TXT,A15.TXT,A16.TXT,A17.TXT,A18.TXT,A19.TXT,A20.TXT,A21.TXT,A22.TXT,A23.TXT,A24.TXT,A25.TXT,A26.TXT,A27.TXT,A28.TXT,A29.TXT,A30.TXT,A31.TXT,A32.TXT,A33.TXT,A34.TXT,A35.TXT,A36.TXT,A37.TXT,A38.TXT,A39.TXT,A40.TXT,A41.TXT,A42.TXT,A43.TXT,A44.TXT,B01.TXT,B02.TXT,B03.TXT,B04.TXT,B05.TXT,B06.TXT,B07.TXT,B08.TXT,B09.TXT,B10.TXT,B11.TXT,B12.TXT,B13.TXT,B14.TXT,B15.TXT,B16.TXT,B17.TXT,B18.TXT,B19.TXT,B20.TXT,B21.TXT,B22.TXT,B23.TXT,B24.TXT,B25.TXT,B26.TXT,B27.TXT,C01.TXT,C02.TXT,C03.TXT,C04.TXT,C05.TXT,C06.TXT,C07.TXT,C08.TXT,C09.TXT,C10.TXT,C11.TXT,C12.TXT,C13.TXT,C14.TXT,C15.TXT,C16.TXT,C17.TXT,D01.TXT,D02.TXT,D03.TXT,D04.TXT,D05.TXT,D06.TXT,D07.TXT,D08.TXT,D09.TXT,D10.TXT,D11.TXT,D12.TXT,D13.TXT,D14.TXT,D15.TXT,D16.TXT,D17.TXT,E01.TXT,E02.TXT,E03.TXT,E04.TXT,E05.TXT,E06.TXT,E07.TXT,E08.TXT,E09.TXT,E10.TXT,E11.TXT,E12.TXT,E13.TXT,E14.TXT,E15.TXT,E16.TXT,E17.TXT,E18.TXT,E19.TXT,E20.TXT,E21.TXT,E22.TXT,E23.TXT,E24.TXT,E25.TXT,E26.TXT,E27.TXT,E28.TXT,E29.TXT,E30.TXT,E31.TXT,E32.TXT,E33.TXT,E34.TXT,E35.TXT,E36.TXT,F01.TXT,F02.TXT,F03.TXT,F04.TXT,F05.TXT,F06.TXT,F07.TXT,F08.TXT,F09.TXT,F10.TXT,F11.TXT,F12.TXT,F13.TXT,F14.TXT,F15.TXT,F16.TXT,F17.TXT,F18.TXT,F19.TXT,F20.TXT,F21.TXT,F22.TXT,F23.TXT,F24.TXT,F25.TXT,F26.TXT,F27.TXT,F28.TXT,F29.TXT,F30.TXT,F31.TXT,F32.TXT,F33.TXT,F34.TXT,F35.TXT,F36.TXT,F37.TXT,F38.TXT,F39.TXT,F40.TXT,F41.TXT,F42.TXT,F43.TXT,F44.TXT,F45.TXT,F46.TXT,F47.TXT,F48.TXT,G01.TXT,G02.TXT,G03.TXT,G04.TXT,G05.TXT,G06.TXT,G07.TXT,G08.TXT,G09.TXT,G10.TXT,G11.TXT,G12.TXT,G13.TXT,G14.TXT,G15.TXT,G16.TXT,G17.TXT,G18.TXT,G19.TXT,G20.TXT,G21.TXT,G22.TXT,G23.TXT,G24.TXT,G25.TXT,G26.TXT,G27.TXT,G28.TXT,G29.TXT,G30.TXT,G31.TXT,G32.TXT,G33.TXT,G34.TXT,G35.TXT,G36.TXT,G37.TXT,G38.TXT,G39.TXT,G40.TXT,G41.TXT,G42.TXT,G43.TXT,G44.TXT,G45.TXT,G46.TXT,G47.TXT,G48.TXT,G49.TXT,G50.TXT,G51.TXT,G52.TXT,G53.TXT,G54.TXT,G55.TXT,G56.TXT,G57.TXT,G58.TXT,G59.TXT,G60.TXT,G61.TXT,G62.TXT,G63.TXT,G64.TXT,G65.TXT,G66.TXT,G67.TXT,G68.TXT,G69.TXT,G70.TXT,G71.TXT,G72.TXT,G73.TXT,G74.TXT,G75.TXT,H01.TXT,H02.TXT,H03.TXT,H04.TXT,H05.TXT,H06.TXT,H07.TXT,H08.TXT,H09.TXT,H10.TXT,H11.TXT,H12.TXT,H13.TXT,H14.TXT,H15.TXT,H16.TXT,H17.TXT,H18.TXT,H19.TXT,H20.TXT,H21.TXT,H22.TXT,H23.TXT,H24.TXT,H25.TXT,H26.TXT,H27.TXT,H28.TXT,H29.TXT,H30.TXT,J01.TXT,J02.TXT,J03.TXT,J04.TXT,J05.TXT,J06.TXT,J07.TXT,J08.TXT,J09.TXT,J10.TXT,J11.TXT,J12.TXT,J13.TXT,J14.TXT,J15.TXT,J16.TXT,J17.TXT,J18.TXT,J19.TXT,J20.TXT,J21.TXT,J22.TXT,J23.TXT,J24.TXT,J25.TXT,J26.TXT,J27.TXT,J28.TXT,J29.TXT,J30.TXT,J31.TXT,J32.TXT,J33.TXT,J34.TXT,J35.TXT,J36.TXT,J37.TXT,J38.TXT,J39.TXT,J40.TXT,J41.TXT,J42.TXT,J43.TXT,J44.TXT,J45.TXT,J46.TXT,J47.TXT,J48.TXT,J49.TXT,J50.TXT,J51.TXT,J52.TXT,J53.TXT,J54.TXT,J55.TXT,J56.TXT,J57.TXT,J58.TXT,J59.TXT,J60.TXT,J61.TXT,J62.TXT,J63.TXT,J64.TXT,J65.TXT,J66.TXT,J67.TXT,J68.TXT,J69.TXT,J70.TXT,J71.TXT,J72.TXT,J73.TXT,J74.TXT,J75.TXT,J76.TXT,J77.TXT,J78.TXT,J79.TXT,J80.TXT,K01.TXT,K02.TXT,K03.TXT,K04.TXT,K05.TXT,K06.TXT,K07.TXT,K08.TXT,K09.TXT,K10.TXT,K11.TXT,K12.TXT,K13.TXT,K14.TXT,K15.TXT,K16.TXT,K17.TXT,K18.TXT,K19.TXT,K20.TXT,K21.TXT,K22.TXT,K23.TXT,K24.TXT,K25.TXT,K26.TXT,K27.TXT,K28.TXT,K29.TXT,L01.TXT,L02.TXT,L03.TXT,L04.TXT,L05.TXT,L06.TXT,L07.TXT,L08.TXT,L09.TXT,L10.TXT,L11.TXT,L12.TXT,L13.TXT,L14.TXT,L15.TXT,L16.TXT,L17.TXT,L18.TXT,L19.TXT,L20.TXT,L21.TXT,L22.TXT,L23.TXT,L24.TXT,M01.TXT,M02.TXT,M03.TXT,M04.TXT,M05.TXT,M06.TXT,N01.TXT,N02.TXT,N03.TXT,N04.TXT,N05.TXT,N06.TXT,N07.TXT,N08.TXT,N09.TXT,N10.TXT,N11.TXT,N12.TXT,N13.TXT,N14.TXT,N15.TXT,N16.TXT,N17.TXT,N18.TXT,N19.TXT,N20.TXT,N21.TXT,N22.TXT,N23.TXT,N24.TXT,N25.TXT,N26.TXT,N27.TXT,N28.TXT,N29.TXT,P01.TXT,P02.TXT,P03.TXT,P04.TXT,P05.TXT,P06.TXT,P07.TXT,P08.TXT,P09.TXT,P10.TXT,P11.TXT,P12.TXT,P13.TXT,P14.TXT,P15.TXT,P16.TXT,P17.TXT,P18.TXT,P19.TXT,P20.TXT,P21.TXT,P22.TXT,P23.TXT,P24.TXT,P25.TXT,P26.TXT,P27.TXT,P28.TXT,P29.TXT,R01.TXT,R02.TXT,R03.TXT,R04.TXT,R05.TXT,R06.TXT,R07.TXT,R08.TXT,R09.TXT
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
Text Firs... Inne... Thin... Thin... Thin... Thin... Reas... Shar... Dire... ... A01.... 0.09 1.72 0.62 0.7 2.15 0.66 1.23 2.24 0.48 ... A02.... 0.13 1.48 0.38 0.63 1.85 0.93 1.48 2.28 0.21 ... A03.... 0.04 1.72 0.34 1.12 1.55 0.9 1.68 2.84 0.17 ... A04.... 0.0 2.74 0.9 2.43 1.21 1.03 3.46 3.06 0.22 ... A05.... 0.26 2.15 0.39 0.61 2.28 0.75 1.93 1.84 0.44 ... A06.... 0.26 1.26 0.7 0.91 1.83 0.52 1.74 2.13 0.3 ... A07.... 0.17 1.31 0.59 0.97 1.74 0.72 1.61 2.67 0.21 ... A08.... 0.0 3.4 0.59 1.4 2.94 0.63 2.17 1.99 0.14 ... A09.... 0.13 1.1 0.3 0.93 1.19 0.89 0.98 1.57 0.08 ... A10.... 0.04 1.98 0.25 0.93 2.19 0.8 1.86 2.32 0.51 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A gzip'ed tar containing StatLib datasets (statlib-20050214.tar.gz, 12,785,582 Bytes)
- URLs
- (No information yet)
- Publications
- Data Source
- http://lib.stat.cmu.edu/datasets/
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2010-11-06 09:59
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
To post a comment, please sign in.This item was downloaded 2444 times and viewed 2309 times.
No Tasks yet on dataset statlib-20050214 collins
Submit a new Task for this Data itemDisclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.