Skip to content
View datascisteven's full-sized avatar
Block or Report

Block or report datascisteven

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
datascisteven/README.md

Steven Yan (@datascisteven)

πŸ‘‹ Thanks for visiting my Github page.

Driven by an unwavering commitment to effect positive societal change, I have consistently sought opportunities that promote community engagement and provide avenues to give back to the community. Transitioning from a decade-long career in education management, I've cultivated a growing passion for leveraging data to create impactful solutions as an early career data scientist. At StartOut, I gathered and analyzed both policy (LGBTQ+ legislation and fiscal) and non-policy (demographic, geographic, social) data, integrating them with indicators of high-growth entrepreneurship to train machine learning models. This enabled me to provide actionable insights to stakeholders and policy recommendations to decision-makers. This experience has honed my practical skills in extracting valuable insights from raw data to address pressing social issues.

As a career changer, my background encompasses diverse roles including management, classroom instruction, content writing, training and development, program administration, and career advising. Crafting content and conducting MCAT bootcamps has refined my written and verbal communication skills, enabling me to convey technical concepts clearly and fluently to both technical and non-technical audiences. My collaboration on machine learning models with data scientists around the world at Omdena, coupled with my prior dedication to teaching students from underrepresented backgrounds, has instilled in me cultural humility and competence while allowing me to contribute to meaningful causes.

Fueled by a passion for social good and equipped with a versatile skill set, I am committed to deploying data-driven solutions to address complex challenges within your organization and the broader field of data science.

Check out the project at the StartOut website:

StartOut Index

image

Extracurricular Projects:

1. 🦠 Omdena Local Chapter Challenge: Identifying Diseases in Chest X-Rays & COVID-19 Detection

  • Role: Task Lead
  • Description: I contributed to the Omdena Myanmar chapter as the Tuberculosis team lead for their project to democratize access to resources for the following respiratory lung disorders: tuberculosis, lung cancer, pneumonia, and COVID. Four teams worked alongside each other in building models for each disease for 8 weeks, and within each team, we selected the best model for deployment. We had a team member who was experienced in Streamlit that developed a webapp for the model.

2. 🦠 Omdena Local Chapter Challenge: COVID-19 Detection from Chest X-Ray Images using Deep Learning

  • Role: Task Lead
  • Description: This was an offshoot of the original project above because there was not enough participation in the original group for COVID. I worked with the members in this new challenge to provide what EDA and data preprocessing I had already completed, as well as the dataset. I worked with and supported the group to ensure the goals and deadlines were met, but did not continue to finish the project.

3. πŸ›°οΈ Omdena AI Challenge: Developing an AI model to Identify School Locations in Sudan using Satellite Imagery

  • Role: Lead ML Engineer
  • Description: We collaborated in this OmdenaLore AI challenge with the Giga team, a joint initiative between UNICEF and ITU for two months. We built several Computer Vision and Deep Learning models to detect school locations in Sudan using Satellite Imagery. We did an extensive and thorough analysis of the data and built multiple models using datasets provided by the Giga team to solve this problem.

4. πŸ³οΈβ€πŸŒˆ Essteem Equalithon: Inclusion Impact Index Dashboard developed by StartOut and Socos Lab

  • Technologies: Python, Numpy, Pandas, Tableau
  • Status: Developed timeline feature for dashboard and proposed some visualization changes

5. πŸ’» DataKind DataDive: Broadband Access Project with CDAC at UChicago

  • Technologies: Python, Numpy, Pandas, Geopandas, Tableau, SciPy, Scikit-learn, Matplotlib
  • Status: Contributed Tableau visualizations for EDA and pipeline for data processing

Certifications:

DeepLearning.AI Tensorflow Developer Certificate: πŸ₯‡

  1. Introduction to Tensorflow for Artificial Intelligence, Machine Learning, and Deep Learning: πŸŽ–οΈ
  2. Convolutional Neural Networks in Tensorflow: πŸŽ–οΈ
  3. Natural Language Processing in Tensorflow: πŸŽ–οΈ
  4. Sequences, Time Series and Prediction: πŸŽ–οΈ

DeepLearning.AI Deep Learning Specialization: πŸ₯‡

  1. Neural Networks and Deep Learning: πŸŽ–οΈ
  2. Improving Deep Neural Networks: Hyperparameter Tuning, Regularization, and Optimization: πŸŽ–οΈ
  3. tructuring Machine Learning Projects: πŸŽ–οΈ
  4. Convolutional Neural Networks: πŸŽ–οΈ
  5. Sequence Models: πŸŽ–οΈ

Contact Info:

Email Badge

Github Badge

LinkedIn Badge

Medium Badge

Portfolio Badge

Technology Stack:

Thank you for visiting my page!

Pinned

  1. datascisteven.github.io datascisteven.github.io Public

    My Github landing page for my portfolio of data science projects

    HTML

  2. Melanoma-Image-Classification Melanoma-Image-Classification Public

    Developing a Melanoma Detector with Neural Networks and Flask for Deployment

    Jupyter Notebook 11 6

  3. Automated-Hate-Tweet-Detection Automated-Hate-Tweet-Detection Public

    Developing a classification model to detect hate tweets ready for deployment using various NLP techniques

    Jupyter Notebook 18 7

  4. OmdenaAI/myanmar-chapter-chest-x-rays OmdenaAI/myanmar-chapter-chest-x-rays Public

    Jupyter Notebook 16 13