Skip to content

romil5044/US-Census-Income-Data-Big-Data-Analysis-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

US-Census-Income-Data-Big-Data-Analysis-

Overview - If people don't know, US Census Income Data is a huge dataset containing financial as well other descriptive information of US citizens. As a data scientist, I utilized this dataset to get an accurate prediction on the income of the US citizens.

• Integrated & prepared census income data by imputation, binning & discretization of independent variables using PySpark • Performed feature engineering, exploratory analysis & dimensionality reduction to finalize a list of features for ML modelling • Achieved an accuracy of 83% in predicting annual income of employees using logistic regression & random forest models

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published