Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the following conditions: ((AAGE > 16) && (AGI > 100) && (AFNLWGT > 1) && (HRSWK > 0))

adult

Format

A data frame with 32561 observations on the following 15 variables.

  1. age

  2. workclass

  3. fnlwgt

  4. education

  5. education-num

  6. marital-status

  7. occupation

  8. relationship

  9. race

  10. sex

  11. capital-gain

  12. capital-loss

  13. hours-per-week

  14. native-country

Source

Ronny Kohavi and Barry Becker Data Mining and Visualization Silicon Graphics. e-mail: ronnyk '@' live.com for questions.

Details

Predict whether income exceeds $50K/yr based on census data. Also known as "Census Income" dataset.

References

https://archive.ics.uci.edu/ml/machine-learning-databases/adult/

http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.names

https://archive.ics.uci.edu/ml/datasets/adult