Public Archive Data
-
DMOZ Web Directory Topics - submitted by jeanbaptiste 6484 views, 11490 downloads, 0 comments
last edited by jeanbaptiste - Mar 29, 2012, 16:47 CET Rating
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
- Data Shape: 10630 attributes, 2658 instances ()
- License: unknown
- Tags: bag-of-words Classification DMOZ libsvm multi-class text web-pages
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (4.1 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
-
arcene nips - submitted by t3kcit 23 views, 2558 downloads, 0 comments
last edited by t3kcit - Mar 27, 2012, 23:38 CET Rating
- Summary:
NIPS 2003 Feature selection challenge
- License: unknown
- Tags: Classification
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: zip (8.4 MB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
NIPS 2003 Feature selection challenge
-
Chars74K English img - submitted by 30 views, 24503 downloads, 0 comments
last edited by teo - Mar 27, 2012, 16:13 CET Rating
- Summary:
Images of characters and numerals cropped from street photos
- License: CC0
- Tags: character-recognition computer-vision
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: tgz (127.8 MB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Images of characters and numerals cropped from street photos
-
Yahoo! Web Directory Topics - submitted by jeanbaptiste 2332 views, 13261 downloads, 0 comments
last edited by jeanbaptiste - Mar 13, 2012, 15:16 CET Rating
- Summary:
Contains parsed webpages along with their topics extracted from Yahoo! web directory
- Data Shape: 10630 attributes, 2212 instances ()
- License: unknown
- Tags: bag-of-words Classification multi-class text web-pages Yahoo!
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 1 challenges
- Download: HDF5 (3.6 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from Yahoo! web directory
-
Labeled IHC images of RCC - submitted by jetic 3331 views, 2421 downloads, 0 comments
last edited by jetic - Mar 9, 2012, 11:27 CET Rating
- Summary:
Labeled immunohistochemically stained tissue micro array images for nucleus detection and classification
- License: CC0
- Tags: Carcinoma Cell Classification Clear compuational detection immunohistochemistry pathology Renal
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: zip (20.5 MB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Labeled immunohistochemically stained tissue micro array images for nucleus detection and classification
-
GERMANA - submitted by nserrano 2095 views, 2032 downloads, 0 comments
last edited by nserrano - Mar 7, 2012, 13:22 CET Rating
- Summary:
GERMANA is the result of digitising and annotating a 764-page Spanish manuscript
- License: unknown
- Tags: Handwriting-Recognition Historical-Documents
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: .tar.gz (1.1 GB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
GERMANA is the result of digitising and annotating a 764-page Spanish manuscript
-
RODRIGO - submitted by nserrano 4781 views, 1990 downloads, 0 comments
last edited by nserrano - Mar 7, 2012, 13:08 CET Rating
- Summary:
RODRIGO is the result of digitising and annotating a 853-page bound volume, entitled "Historia de España del arçobispo Don Rodrigo, and completely written in old Castilian (Spanish) by a single author
- License: unknown
- Tags: Handwriting-Recognition Historical-Documents
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: tar.gz (1.2 GB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
RODRIGO is the result of digitising and annotating a 853-page bound volume, entitled "Historia de España del arçobispo Don Rodrigo, and completely written in old Castilian (Spanish) by a single author
-
Record of Heart Sound - submitted by Yiqi 1360 views, 14893 downloads, 0 comments
last edited by Yiqi - Feb 3, 2012, 15:16 CET Rating
- Summary:
Records of heart sound from people in different heart conditions:Normal, Murmur,Extra sound,Extrasystole, Artifact for Heartbeat Classification.
- License: CC0
- Tags: Classification Signal-processing Supervised-Machine-Learning
- Tasks / Methods / Challenges: 2 tasks, 0 methods, 0 challenges
- Download: zip (47.7 MB)
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Records of heart sound from people in different heart conditions:Normal, Murmur,Extra sound,Extrasystole, Artifact for Heartbeat Classification.
-
Translation Initiation Site Pred - submitted by kidzik 6019 views, 17993 downloads, 0 comments
last edited by kidzik - Sep 15, 2011, 18:46 CET Rating
- Summary:
Used to find the Translation Initiation Site (TIS), at which the translation from mRNA to proteins initiates
- Data Shape: 928 attributes, 3312 instances (Integer,String)
- License: unknown (from UCI repository)
- Tags: biomedical Initiation Prediction Regression Site Translation
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (49.3 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Used to find the Translation Initiation Site (TIS), at which the translation from mRNA to proteins initiates
-
DLBCL Tumor from Harvard - submitted by kidzik 3842 views, 20985 downloads, 0 comments
last edited by kidzik - Sep 15, 2011, 12:51 CET Rating
- Summary:
There are two kinds of classifications about diffuse large b-cell lymphoma (DLBCL) addressed in the publication.
- Data Shape: 7130 attributes, 77 instances (Integer,String)
- License: unknown (from UCI repository)
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 0 challenges
- Download: HDF5 (2.7 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
There are two kinds of classifications about diffuse large b-cell lymphoma (DLBCL) addressed in the publication.
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.