Skip to content

honzaMaly/census_income_data_pyspark_classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Classification with PySpark (Census Income data)

TODO

Getting Started

TODO

  1. Setup a local environment to run PySpark
  2. Python environment should have all libraries specified in 'requirements.txt' and should be able to run Jupyter notebooks
  3. Download the Census Income data at http://archive.ics.uci.edu/ml/datasets/Census-Income+(KDD)
  4. Execute and follow instructions in Jupyter notebook 'main'

About

Example of classification in PySpark done on Census Income data set

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published