Skip to content
View gscharly's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report gscharly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gscharly/README.md

Hi 👋 My name is Carlos Gomez

Data Scientist / ML Engineer | Python, ML | Help businesses make data-driven decisions and build ML based-data products


  • 🌍 I'm based in Madrid
  • ✉️ You can contact me at carlos.gomez.sanchez94@gmail.com
  • 5+ years in developing end-to-end ML data-based products: gathering initial requirements, discussions with business stakeholders, use cases definition, exploratory analysis, ETL pipelines, ML models, evaluation & A/B testing, model monitoring and production
  • Expertise in applying Machine Learning to solving business use cases, from initial prototypes to production ML-based products following MLOps best practices
  • In-depth understanding of Python
  • Experience in Functional and Object-Oriented Programming styles
  • Highly experienced in dealing with high dimension datasets using Spark
  • Knowledge of Cloud platforms, especially Google Cloud Platform
  • Good exposure to CI/CD best practices using Jenkins and GCP
  • Highly sklled in SQL and working with data warehouses
  • Worked for industry clients in the telco and banking sectors

Skills

PythonGitDockerTensorFlowPyTorchGoogle Cloud

Tools | Frameworks

  • Python ML & DL toolkit: Scikit-learn, Lightgbm, Tensorflow (Keras), Pytorch, Spacy
  • Data processing: Numpy, Pandas
  • ETL orchestration: Luigi, Apache Airflow
  • Data visualization: Matplotlib, Seaborn, Plotly
  • Cloud Services: GCP (Vertex AI, Cloud Dataproc, Cloud Build, BigQuery), AWS (Sagemaker, EMR)
  • Docker
  • IDEs: Jupyter-lab, Pycharm, IntelliJ
  • Web development with Nodejs and Flask

Research thesis

  • Automatic generation of sport news using NLP techniques and ranking systems (Universidad Rey Juan Carlos Madrid, 2021)
  • Design and implementation of initialization techniques for Deep Learning algorithms (Universidad Politécnica de Madrid, 2018)

Socials

Pinned

  1. drl_p3_collaboration_competition drl_p3_collaboration_competition Public

    Multi-agent Reinforcement Learning - Train a pair of agents to play tennis. Part of Udacity DRL Nanodegree

    Python

  2. drl_p2_continous_control drl_p2_continous_control Public

    Policy Based Methods - Train a double-jointed arm to follow target locations. Part of Udacity DRL Nanodegree

    Python

  3. drl_p1_navigation drl_p1_navigation Public

    Value Based Methods - Train an agent to collect bananas. Part of Udacity DRL Nanodegree

    Python

  4. master_ds_tfm master_ds_tfm Public

    Data Science Master degree Final thesis. Automatic generation of sport news using NLP and ranking techniques

    Jupyter Notebook

  5. practica_sistemas_distribuidos practica_sistemas_distribuidos Public

    Sentiment analysis in Spain using Twitter and MRJob. Part of Master Degree on Data Science in URJC Madrid

    Python

  6. chefs_graph chefs_graph Public

    This project uses graph analysis techniques to visualize the relation between different chefs

    HTML