Skip to content
View hxu47's full-sized avatar
Block or Report

Block or report hxu47

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
hxu47/README.md

Hi there! ๐Ÿ‘‹

I'm Sophie(Huidong). With over 3+ years of experience, I specialize in implementing machine learning algorithms and crafting scalable data-driven applications. My strengths lie in handling vast datasets and creating optimized ETL processes. I'm passionate about the ever-evolving world of data and always eager to learn, adapt, and contribute.

๐Ÿ›  Work Experience:

  • Telematics Data Mastery ๐ŸŒ

    • Managed end-to-end lifecycle of large-scale telematics (GPS/accelerometer) data.
    • Developed ETL processes to manage billions of rows, extracting essential features for risk prediction.
  • Performance Optimization โšก

    • Designed a real-time infrastructure for GLM-based risk prediction, integrating it with auto insurance pricing to exceed industry benchmarks by 200%.
    • Achieved 90% processing time reduction via geospatial data optimization.
  • Automation & Monitoring ๐Ÿค–

    • Introduced automation with Databricks and Apache Airflow.
    • Set up monitoring protocols for consistent data quality and model performance.
  • Cloud Mastery โ˜๏ธ

    • Designed and hosted a web app on AWS EC2 using Streamlit, marking a significant decrease in manual effort.

๐Ÿ“ฆ GitHub Repositories:


๐Ÿ“ซ How to Reach Me

If you'd like to connect, feel free to reach out to me through LinkedIn.

I'm always open to discussing new projects, ideas, or opportunities. Let's get in touch and explore the world of data together!

Pinned

  1. Article-Recommendation-Engine Article-Recommendation-Engine Public

    Building a Article Recommendation Engine using Word2vec and Stanford's GloVe

    Jupyter Notebook

  2. Decision-Tree-from-Scratch Decision-Tree-from-Scratch Public

    Implementation of decision trees for classification and regression as objects similar to sklearn's.

    Python

  3. Feature-Importance Feature-Importance Public

    Investigation of various feature importance strategies.

    Jupyter Notebook

  4. KMeans-from-Scratch KMeans-from-Scratch Public

    Implementation of K-means and K-means++ from scratch.

    Jupyter Notebook

  5. Random-Forest-from-Scratch Random-Forest-from-Scratch Public

    Implementation of Random Forest comparable to scikit-learnโ€™s.

    Python

  6. Sentiment-Analysis-with-Naive-Bayes-from-Scratch Sentiment-Analysis-with-Naive-Bayes-from-Scratch Public

    Implementation of Naive Bayes from scratch and its application to Sentiment Analysis.

    Jupyter Notebook