View uci-20070111 house_16H (public)

2010-11-06 09:59 by mldata | Version 1 | Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star
Rating
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Overall (based on 0 votes)
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Interesting
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Documentation
Summary

(No information yet)

License
unknown (from Weka repository)
Dependencies
Tags
arff slurped Weka
Attribute Types
Floating Point
Download
# Instances: 22784 / # Attributes: 17
HDF5 (3.0 MB) XML CSV ARFF LibSVM Matlab Octave

Files are converted on demand and the process can take up to a minute. Please wait until download begins.

Completeness of this item currently: 55%.
You can edit this item to add more meta information and make use of the site's premium features.
Original Data Format
arff
Name
house_16H
Version mldata
0
Comment

This database was designed on the basis of data provided by US Census Bureau http://www.census.gov. The data were collected as part of the 1990 US census. These are mostly counts cumulated at different survey levels. For the purpose of this data set a level State-Place was used. Data from all states was obtained. Most of the counts were changed into appropriate proportions. There are 4 different data sets obtained from this database: House(8H) House(8L) House(16H) House(16L) These are all concerned with predicting the median price of the house in the region based on demographic composition and a state of housing market in the region. A number in the name signifies the number of attributes of the data set. A following letter denotes a very rough approximation to the difficulty of the task. For Low task difficulty, more correlated attributes were chosen as signified by univariate smooth fit of that input on the target. Tasks with High difficulty have had their attributes chosen to make the modelling more difficult due to higher variance or lower correlation of the inputs to the target.

Original source: DELVE repository of data. Source: collection of regression datasets by Luis Torgo (ltorgo@ncc.up.pt) at http://www.ncc.up.pt/~ltorgo/Regression/DataSets.html Characteristics: 22784 cases, 17 continuous attributes.

Names
P1,P5p1,P6p2,P11p4,P14p9,P15p1,P15p3,P16p2,P18p2,P27p4,
Types
  1. numeric
  2. numeric
  3. numeric
  4. numeric
  5. numeric
  6. numeric
  7. numeric
  8. numeric
  9. numeric
  10. numeric
Data (first 10 data points)
    P1 P5p1 P6p2 P11p4 P14p9 P15p1 P15p3 P16p2 P18p2 P27p4 ...
    1551... 0.46... 0.04... 0.22... 0.14... 0.75... 0.01... 0.57... 0.00... 0.07... ...
    1550.0 0.47... 0.00... 0.13... 0.09... 0.86... 0.0 0.69... 0.00... 0.04... ...
    4741.0 0.48... 0.00... 0.18... 0.13... 0.85... 0.0 0.68... 0.00... 0.02... ...
    467.0 0.49... 0.0 0.10... 0.08... 0.90... 0.0 0.78... 0.00... 0.01... ...
    310.0 0.47... 0.68... 0.22... 0.12... 0.89... 0.0 0.75... 0.00... 0.01... ...
    461.0 0.47... 0.08... 0.13... 0.09... 0.92... 0.0 0.79... 0.0 0.00... ...
    723.0 0.50... 0.0 0.12... 0.10... 0.86... 0.0 0.72... 0.01... 0.07... ...
    7572... 0.46... 0.29... 0.16... 0.12... 0.79... 0.00... 0.61... 0.00... 0.06... ...
    2949.0 0.46... 0.00... 0.18... 0.13... 0.82... 0.01... 0.67... 0.00... 0.04... ...
    4490.0 0.50... 0.00... 0.05... 0.03... 0.95... 0.00... 0.88... 0.00... 0.03... ...
    ... ... ... ... ... ... ... ... ... ... ...
Description

A gzip'ed tar containing UCI and UCI KDD datasets (uci-20070111.tar.gz, 17,952,832 Bytes)

URLs
(No information yet)
Publications
    Data Source
    http://www.ics.uci.edu/~mlearn/MLRepository.html http://kdd.ics.uci.edu/
    Measurement Details
    Usage Scenario
    revision 1
    by mldata on 2010-11-06 09:59

    No one has posted any comments yet. Perhaps you would like to be the first?

    Leave a comment

    To post a comment, please sign in.

    This item was downloaded 3770 times and viewed 2112 times.

    No Tasks yet on dataset uci-20070111 house_16H

    Submit a new Task for this Data item

    Data

    Sort by

    Disclaimer

    We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.

    Data | Task | Method | Challenge

    Acknowledgements

    This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
    PASCAL Logo
    http://www.pascal-network.org/.