This repository is to serve as a general overview of the standalone data science projects I've worked on apart from my work. Each project's README.md acts as a full description of the project. All of these projects are in Python and R.
-
Paddy Doctor: Paddy Disease Classification
- Classification challenge on rice paddy images where different dieseases were the class of interest. I spent most of my time on the exploratory data analysis (EDA) and used a deep learning model built on EfficientNet for my submission, where classification accuracy on validation and testing data ranges from 97% to 98%.
-
Happywhale - Whale and Dolphin Identification
- Identify whales and dolphins by unique characteristics
- Clustering
- Active Fire clustering.
- Geospatial
- Visualized geospatial data using pandas, seaborn, matplotlib, numpy and folium.
-
- Chinese Text Analysis
- Data Dashboard
- Created a geospatial dashboard using flexdashboard generate visualizations for Gorillas' home range.
-
Snow Cover Product
- Developed a snow classification scheme on long-term AVHRR satellite data with spatial and temporal filters
- Integration test using pytest