1990 US Census Data

Data Type

Multivariate

Abstract

The USCensus1990 data set is a discretized version of the USCensus1990raw data set. Many of the less useful attributes in the original data set have been dropped, the few continuous variables have been discretized and the few discrete variables that have a large number of possible values have been collapsed to have fewer possible values.

Sources

The USCensus1990raw data set was obtained from the (U.S. Department of Commerce) Census Bureau website using the Data Extraction System. This system can be found at http://www.census.gov/DES/www/d es.html.

Donor of database

Chris Meek

Microsoft

Bo Thiesson

Microsoft

David Heckerman

Microsoft

Data Characteristics

The data was collected as part of the 1990 census.

There are 68 categorical attributes. This data set was derived from the USCensus1990raw data set. The attributes are listed in the file USCensus1990.attributes.txt (repeated below) and the coding for the values is described below. Many of the less useful attributes in the original data set have been dropped, the few continuous variables have been discretized and the few discrete variables that have a large number of possible values have been collapsed to have fewer possible values.

More specifically the USCensus1990 data set was obtained from the USCensus1990raw data set by the following sequence of operations;

Other Relevant Information

Hierarchies of values are provided in the file USCensus1990raw.coding.htm and the mapping functions used to transform the USCensus1990raw to the USCensus1990 data sets are giving in the file USCensus1990.mapping.sql.

Data Format

The data is contained in a file called USCensus1990.data.txt. The first row contains the list of attributes. The first attribute is a caseid and should be ignored during analysis. The data is comma delimited with one case per row.

References & Further Information


The UCI KDD Archive
Information and Computer Science
University of California, Irvine
Irvine, CA 92697-3425
Last modified: 6 Nov 2001