Skip to content

Analyzing a dataset from Aadhaar - a unique identity issued to all resident Indians using SparkSQL in Python

Notifications You must be signed in to change notification settings

fdabhi/Aadhar-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Aadhar-Data-Analysis

Analyzing a dataset from Aadhaar - a unique identity issued to all resident Indians using SparkSQL in Python

##Dependencies

##Usage

I choose Aadhaar Dataset which is available at Aadhaar public data portal using SparkSQL in Python to query below questions.

##Queries

  1. Count the number of cards approved by States.
  2. Count the number of cards approved by Enrolment Agency.
  3. Count the number of cards rejected by States.
  4. Count the number of Aadhaar applicants by gender split by States.
spark-submit AadharAnalysis.py

About

Analyzing a dataset from Aadhaar - a unique identity issued to all resident Indians using SparkSQL in Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages