Skip to content
View cdrc1103's full-sized avatar
Block or Report

Block or report cdrc1103

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
cdrc1103/README.md

Welcome to my Github Page! ✌️

I am a Data Scientist / Engineer, currently working as consultant for Enterprise Data and Analytics at Capgemini. In my current role, my work is dedicated to analyzing client data, bringing alive data-driven software, implementing data pipelines and machine learning lifecycles, and designing cloud architectures. I focus on Natural Language Processing use cases and help clients to bring their AI prototypes to production by applying automation and MLOps tools.

Below, you can find a digest of the projects I am currently working on or have finished in the past.

Natural Language Processing

Name Use Case Tech Stack
Language Models Experimental use case development and deployment based on Generative AI Huggingface
Speech Transcription Speech transcription using Azure cognitive services Azure SDK, Streamlit
Named Entity Recognition Recognize entities in text using the BERT model Transformers, PyTorch

Academic Projects

Name Project Year Tech Stack Description
Patent Classifier Multi-Label Patent Classification with Deep Neural Networks 2021 TensorFlow, Transformers A comprehensive study to identify, implement and evaluate suitable approaches for the classification of patents using different neural network architectures like CNN, RNN, and Transformers. A domain-specific data set of 200.000 patent documents is used.
Crowdedness Prediction Crowdedness Prediction in Public Transport Under Covid-19 2020 PyTorch During Covid-19 pandemic social distancing in public transport is an important matter to prevent spreading the virus. Thus, it would be beneficial to know when and where there are bottlenecks in the public transport network. Our goal is to reduce the capacity problem by predicting the crowdedness for a specified time interval with RNNs.

Portfolio Optimization

Name Description Hosted Instance
My Portfolio A dashboard to keep track of a stock portfolio's development based on the order history. 📈Streamlit App

Pinned

  1. CrowdednessPrediction CrowdednessPrediction Public

    Crowdedness prediction in public transport under Covid-19

    Jupyter Notebook 1

  2. PatentClassifier PatentClassifier Public

    Multi-Label Classification for Cosmetic Patents Using Neural Networks.

    Jupyter Notebook

  3. NER NER Public

    Jupyter Notebook

  4. ObjectDetection ObjectDetection Public

    A Streamlit app that uses the YOLO model to detect objects in images.

    Python