Skip to content
View hyunjoonbok's full-sized avatar
🐵
Always striving
🐵
Always striving
Block or Report

Block or report hyunjoonbok

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hyunjoonbok/README.md

Hits GitHub stars Maintenance MIT license


HJ github stats

Experience

🔥 Product-focused Data Scientist 🔥

  • Ad Tech (current)
  • Gaming
  • Non-Profit

I worked at small to large-scale companies, so I enjoy getting my hands dirty and solving complex data-driven problems. My passion lies in enabling product growth using data and statistical theories.

  • End-to-End designing of company-wide product success measurement through constant hypothesis validation on user behaviors
  • Production-level software & data tooling development. Projects involve data pipelining, object-oriented design/refactoring, integration-testing, and operational maintenance.
  • Experience with petabyte-scale data handling techniques such as Spark
  • Familiarity with Ad-tech domains; real-time bidding, incrementality testing, brand-lift, AB experimenting, etc

Skills

  • Programming - Python, R, SQL (Snowflake, MySQL, Postgre), Shell, HTML, CSS
  • Cloud Service - AWS, Docker, Kubernetes, Jenkins, Spark,
  • Visualization - Tableau, Dash(python), Power BI, Excel
  • Web - Heap, Google Analytics
  • Version / Collaboration - Git, Wiki, JIRA, Confluence

Featured Repo

Main

Repo Description Link
R Projects R Portforlio in .R or .Rmd files by business topics (i.e. ML, DL, Text Mining, Time Series, etc) Link
Python Projects Python Portforlio mostly in Jupyter Notebooks by ML/DL framework (i.e. Pytorch, Tensorflow, Fast.ai, etc) Link
Medium Blog Tech medium blog that talks about data/ml Link

Data Science (Tools&Knowledges)

Repo Description Link
AWS SageMaker in Production End-to-End curated examples that show how to solve business problems using Amazon SageMaker and it's ML/DL algorithm. Mostly in Jupyter Notebook for easy accessibility Link
PySpark PySpark functions and utilities with Real-world Data examples. Can be used to build complete ETL process of data modeling Link
Recommendation System Production-level Implementations of Recommender System in Pytorch. Clone repo and start training by running 'main.py' Link
Natural Language Processing (NLP) Examples Full implementation examples of several Natural Language Processing methods in Python. Ordered in a personal level of complexity Link

Kaggle

Repo Description Link
Bengali.AI Handwritten Grapheme Classification Classify three constituent elements in the image, given the image of a handwritten Bengali grapheme Link

Linkedin Badge Gmail Badge

Pinned

  1. R-projects R-projects Public

    Portfolio in R

    R 6

  2. Python-Projects Python-Projects Public

    Portfolio in Python

    Jupyter Notebook 37 14

  3. amazon-sagemaker amazon-sagemaker Public

    End-to-End examples that show how to solve business problems using Amazon SageMaker and it's ML/DL algorithm.

    Jupyter Notebook 15 10

  4. Recommendation_System-PyTorch Recommendation_System-PyTorch Public

    Full Implementation of Recommender System in Pytorch (with examples)

    Python 23 4

  5. natural-language-processing natural-language-processing Public

    Ready-to-use Implementation of Natural Language Processing models in Keras/Tensorflow (transformer)

    Jupyter Notebook 4 4

  6. PySpark PySpark Public

    PySpark functions and utilities with examples. Assists ETL process of data modeling

    Jupyter Notebook 89 73