Data
-
DMOZ Web Directory Topics - submitted by jeanbaptiste 6483 views, 11484 downloads, 0 comments
last edited by jeanbaptiste - Mar 29, 2012, 16:47 CET Rating
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
- Data Shape: 10630 attributes, 2658 instances ()
- License: unknown
- Tags: bag-of-words Classification DMOZ libsvm multi-class text web-pages
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (4.1 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.