View uci-20070111 kdd_el_nino-small (public)

2011-09-14 15:35 by mldata | Version 1 | Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star
Rating
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Overall (based on 0 votes)
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Interesting
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Documentation
Summary

(No information yet)

License
unknown (from Weka repository)
Dependencies
Tags
arff slurped Weka
Attribute Types
Integer,Floating Point
Download
# Instances: 782 / # Attributes: 9
HDF5 (67.1 KB) XML CSV ARFF LibSVM Matlab Octave

Files are converted on demand and the process can take up to a minute. Please wait until download begins.

Completeness of this item currently: 55%.
You can edit this item to add more meta information and make use of the site's premium features.
Original Data Format
arff
Name
el_nino-small
Version mldata
0
Comment
                             El Nino Data

Data Type

spatio-temporal

Abstract

The data set contains oceanographic and surface meteorological readings taken from a series of buoys positioned throughout the equatorial Pacific. The data is expected to aid in the understanding and prediction of El Nino/Southern Oscillation (ENSO) cycles.

Sources

Original Owner

[1]Pacific Marine Environmental Laboratory National Oceanic and Atmospheric Administration US Department of Commerce

Donor

[2]Dr Di Cook Department of Statistics Iowa State University [3]dicook@iastate.edu

Date Donated: June 30, 1999

Data Characteristics

This data was collected with the Tropical Atmosphere Ocean (TAO) array which was developed by the international Tropical Ocean Global Atmosphere (TOGA) program. The TAO array consists of nearly 70 moored buoys spanning the equatorial Pacific, measuring oceanographic and surface meteorological variables critical for improved detection, understanding and prediction of seasonal-to-interannual climate variations originating in the tropics, most notably those related to the El Nino/Southern Oscillation (ENSO) cycles.

The moorings were developed by National Oceanic and Atmospheric Administration's (NOAA) Pacific Marine Environmental Laboratory (PMEL). Each mooring measures air temperature, relative humidity, surface winds, sea surface temperatures and subsurface temperatures down to a depth of 500 meters and a few a of the buoys measure currents, rainfall and solar radiation. The data from the array, and current updates, can be viewed on the web at the this address .

The data consists of the following variables: date, latitude, longitude, zonal winds (west<0, east>0), meridional winds (south<0, north>0), relative humidity, air temperature, sea surface temperature and subsurface temperatures down to a depth of 500 meters. Data taken from the buoys from as early as 1980 for some locations. Other data that was taken in various locations are rainfall, solar radiation, current levels, and subsurface temperatures.

Variable Characteristics

The latitude and longitude in the data showed that the bouys moved around to different locations. The latitude values stayed within a degree from the approximate location. Yet the longitude values were sometimes as far as five degrees off of the approximate location.

Looking at the wind data, both the zonal and meridional winds fluctuated between -10 m/s and 10 m/s. The plot of the two wind variables showed no linear relationship. Also, the plots of each wind variable against the other three meteorolgical data showed no linear relationships.

The relative humidity values in the tropical Pacific were typically between 70% and 90%.

Both the air temperature and the sea surface temperature fluctuated between 20 and 30 degrees Celcius. The plot of the two temperatures variables shows a positive linear relationship existing. The two temperatures when each plotted against time also have similar plot designs. Plots of the other meteorological variables against the temperature variables showed no linear relationship.

There are missing values in the data. As mentioned earlier, not all buoys are able to measure currents, rainfall, and solar radiation, so these values are missing dependent on the individual buoy. The amount of data available is also dependent on the buoy, as certain buoys were commissioned earlier than others.

All readings were taken at the same time of day.

Other Relevant Information

Background

The El Nino/Southern Oscillation (ENSO) cycle of 1982-1983, the strongest of the century, created many problems throughout the world. Parts of the world such as Peru and the Unites States experienced destructive flooding from increased rainfalls while the western Pacific areas experienced drought and devastating brush fires. The ENSO cycle was neither predicted nor detected until it was near its peak. This highlighted the need for an ocean observing system (i.e. the TAO array) to support studies of large scale ocean-atmosphere interactions on seasonal-to-interannual time scales.

The TAO array provides real-time data to climate researchers, weather prediction centers and scientists around the world. Forcasts for tropical Pacific Ocean temperatures for one to two years in advance can be made using the ENSO cycle data. These forcasts are possible because of the moored buoys, along with drifting buoys, volunteer ship temperature probes, and sea level measurements.

Research Questions

Research questions of interest include: * How can the data be used to predict weather conditions throughout the world? * How do the variables relate to each other? * Which variables have a greater effect on the climate variations? * Does the amount of movement of the buoy effect the reliability of the data?

When performing an analysis of the data, one should pay attention the possible affect of autocorrelation. Using a multiple regression approach to model the data would require a look at autoregression since the weather statistics of the previous days will affect today's weather.

Data Format

The data is stored in an ASCII files with one observation per line. Spaces separate fields and periods (.) denote missing values.

Past Usage

This data was used in the American Statistical Association Statistical Graphics and Computing Sections 1999 Data Exposition.

References and Further Information

More information and data from the TAO array can be found at the Pacific Marine Environmental Laboratory [4]TAO data webpage.

Information on storm data is available [5]here. This site contains data from January 1994 to April 1998 in a chronological listing by state provided by the National Weather Service. The data includes hurricanes, tornadoes, thunderstorms, hail, floods, drought conditions, lightning, high winds, snow, and temperature extremes.

Hurricane tracking data for the Atlantic is available [6]here. The site contains a map showing the paths of the Atlantic hurricanes and also includes the storms winds (in knots), pressure (in millibars), and the category of the storm based on Saffir-Simpson scale.

Another site of interest related to the ENSO cyles is available [7]here. This site contains information on twelve areas of the world that have demonstrated ENSO-precipitation relationships. Included in the site are maps of the areas and time series plots of actual daily precipitation and accumulated normal precipitation for the areas. _

[8]The UCI KDD Archive
[9]Information and Computer Science
[10]University of California, Irvine
Irvine, CA 92697-3425

Last modified: June 30, 1999

References

  1. http://www.pmel.noaa.gov/
  2. http://www.public.iastate.edu/~dicook/
  3. mailto:dicook@iastate.edu
  4. http://www.pmel.noaa.gov/toga-tao/
  5. http://www.ncdc.noaa.gov/pdfs/sd/sd.html
  6. http://wxp.eas.purdue.edu/hur_atlantic/
  7. http://www.cpc.ncep.noaa.gov/products/analysis_monitoring/ensostuff/current_impacts/precip_accum.html
  8. http://kdd.ics.uci.edu/
  9. http://www.ics.uci.edu/
  10. http://www.uci.edu/

Information about the dataset CLASSTYPE: numeric CLASSINDEX: none specific

Names
buoy,day,latitude,longitude,zon_winds,mer_winds,humidity,air_temp,s_s_temp,
Types
  1. nominal:1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59
  2. nominal:1,2,3,4,5,6,7,8,9,10,11,12,13,14
  3. numeric
  4. numeric
  5. numeric
  6. numeric
  7. numeric
  8. numeric
  9. numeric
Data (first 10 data points)
    buoy day lati... long... zon_... mer_... humi... air_... s_s_...
    1.0 1.0 8.96 -140.... -6.3 -6.4 83.5 27.32 27.57
    1.0 2.0 8.95 -140.... -5.7 -3.6 86.4 26.7 27.62
    1.0 3.0 8.96 -140.... -6.2 -5.8 83.0 27.36 27.68
    1.0 4.0 8.96 -140.... -6.4 -5.3 82.2 27.32 27.7
    1.0 5.0 8.96 -140.... -4.9 -6.2 87.3 27.09 27.85
    1.0 6.0 8.96 -140.... -6.3 -4.9 91.5 26.82 27.98
    1.0 7.0 8.97 -140.... -6.7 -3.7 94.1 26.62 28.04
    1.0 8.0 8.96 -140.... -6.3 -4.8 92.0 26.89 27.98
    1.0 9.0 8.97 -140.... -6.3 -4.9 86.9 27.44 28.13
    1.0 10.0 8.97 -140.... -4.2 -2.5 87.3 26.62 28.14
    ... ... ... ... ... ... ... ... ...
Description

A gzip'ed tar containing UCI and UCI KDD datasets (uci-20070111.tar.gz, 17,952,832 Bytes)

URLs
(No information yet)
Publications
    Data Source
    http://www.ics.uci.edu/~mlearn/MLRepository.html http://kdd.ics.uci.edu/
    Measurement Details
    Usage Scenario
    revision 1
    by mldata on 2011-09-14 15:35

    No one has posted any comments yet. Perhaps you would like to be the first?

    Leave a comment

    To post a comment, please sign in.

    This item was downloaded 4208 times and viewed 2277 times.

    No Tasks yet on dataset uci-20070111 kdd_el_nino-small

    Submit a new Task for this Data item

    Data

    Sort by

    Disclaimer

    We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.

    Data | Task | Method | Challenge

    Acknowledgements

    This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
    PASCAL Logo
    http://www.pascal-network.org/.