Skip to content

pinkychow1010/machine-learning-project

Repository files navigation

Machine Learning/ Data Science Portfolio by Ka Hei Chow

This repository is to serve as a general overview of the standalone data science projects I've worked on apart from my work. Each project's README.md acts as a full description of the project. All of these projects are in Python and R.

Convolutional Neural Network (CNN)

  • Paddy Doctor: Paddy Disease Classification

    • Classification challenge on rice paddy images where different dieseases were the class of interest. I spent most of my time on the exploratory data analysis (EDA) and used a deep learning model built on EfficientNet for my submission, where classification accuracy on validation and testing data ranges from 97% to 98%.
  • Happywhale - Whale and Dolphin Identification

    • Identify whales and dolphins by unique characteristics

Unsupervised Machine Learning

Visualizations

  • Geospatial
    • Visualized geospatial data using pandas, seaborn, matplotlib, numpy and folium.

Spatial Modelling

Natural Language Processing (NLP)

Dashboard

  • Data Dashboard
    • Created a geospatial dashboard using flexdashboard generate visualizations for Gorillas' home range.

Image Processing

  • Snow Cover Product

    • Developed a snow classification scheme on long-term AVHRR satellite data with spatial and temporal filters
    • Integration test using pytest
  • Time Series Analysis

About

This repo includes my machine learning projects from unsupervised clustering to CNN deep learning. This is part of my data science portfolio, also featured in my GitHub page (https://pinkychow1010.github.io/).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published