Skip to content
View seoyunion's full-sized avatar
Block or Report

Block or report seoyunion

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
seoyunion/README.md

Portfolio

About

Personal Information

Personal Statement

I am Seoyun Kim, a 2nd year graduate student who is double majored in Data Science & Artificial Intelligence at Sungkyunkwan University, Seoul, Korea. I am currently I am a M.Sc. student in Data eXperience Lab at Sungkyunkwan University, advised by Eunil Park.

As a member of the categorical data analysis team of P-Sat of the Statistical project team of Sungkyunkwan University, I have experience developing various models and handling categorical data. In addition, various projects related to Data Visualization, Prediction ML Model, DL Model such as LSTM, NLP techniques, and Data Preprocessing have been carried out. I also participated in various data analysis/AI researches in DX Lab, including researches using image, text, video, and time series data.

Research Interests

  • Data Analysis
  • Computer Vision
  • Machine Learning and Deep Learning
  • Artificial Intelligence(AI)
  • Multimodal Modeling
  • Social&Affective Computing

Research/Publications

  • D-ViSA: A Dataset for Detecting Visual Sentiment from Art Images pdf github

    • In Proceedings of the IEEE/CVF International Conference on Computer Vision
    • Built abstract art image dataset annotated with dimensional emotion labels, conducting deep learning model experiment for detecting dimensional emotion from art images
  • Understanding mental health issues in different subdomains in social networking services: computational analysis of text-based reddit posts github

    • Journal of medical Internet research
    • Examined and classified the linguistic characteristics of user posts on specific mental disorder subreddit channels (depression, anxiety, bipolar, borderline personality disorder, schizophrenia, autism, and mental health) on Reddit using sentiment analysis and unsupervised clustering methods
  • Micro-Locational Fine Dust Prediction Utilizing Machine Learning and Deep Learning Models github

    • Computer Systems Science and Engineering
    • Predicted micro-locational fine dust concentration from air quality and meteorologucal time-series data using ML/DL models

Experiences

  • M.Sc. in Data Experience Lab at SKKU (2022.02. ~ present)

  • A Member of P-Sat at SKKU (2020.08. ~ 2021.02.)

  • Data Science Team Manager at Dacon (2021.12. ~ 2022.02.)

  • Dam Water Level LSTM Prediction link
    Predicted and Analyzed by using statistical methods and deep leaning RNN model - LSTM - to predict water-level of the dam using multi-variable dataset

  • Real-time news crawling link
    Built real-time news crawling engine including search keyword by using BeautifulSoup and made news data preprocessing module

  • Binalry Classification in Predicting Political Party link
    During 'Theme Analysis' we used statistical metodes including t-test, homogeneous test and variable selection, EDA & feature engineering for Preprocessing
    Modeling using Ensemble model, XGBoost, Light GBM, and Random Forest in order to predict the 'Party' variable

    Building prediction model for Inbalanced dataset using PCA, SMOTE, and various Prediction model such as Ligh GBM and Cross Validation and measured F-1 score in 'Kaggle competition'

  • Predicting Wheter-to-vote link
    Predicted wheter the person will vote or not using psychological survey dataset with XGBoost model

  • Wine Filtering and Recommendation System link
    Built wine recommendation program using QtPy and filtering methods

  • Visualization of Alchol Cunsumption around the Globe link
    Using R, visualized the correlance between happiness score, region, and alcohol consumption

Pinned

  1. 2019-R-visualization 2019-R-visualization Public

    skku data science and R(2019): final R visualization project

    HTML

  2. P-SAT P-SAT Public

    SKKU P-Sat, Department of Statistics of SKKU, Korea / Qualitative Data Analysis Team (Sep. 2020 ~ Feb. 2021)

    Jupyter Notebook

  3. real-time-news-crawling real-time-news-crawling Public

    중앙일보 한겨레일보 크롤링 및 전처리 함수

    Python

  4. Ringle-contest-WordCloud Ringle-contest-WordCloud Public

    English Edu-tech Start-up PM Contest , (주)링글잉글리시에듀케이션서비스, Korea / Award 3rd prize (Dec. 2020 ~ Mar. 2021)

    Jupyter Notebook

  5. water-level-prediction water-level-prediction Public

    2021 Big Contest 홍수zero dam water level prediction.

    Jupyter Notebook 1

  6. wine-recommendation wine-recommendation Public

    2021 Python Boot Camp final project; wine recommendation program.

    Jupyter Notebook