Data Engineer β’ Cloud Analytics Professional β’ Tech Enthusiast
- π Currently pursuing a Master's in Information Systems at Northeastern University.
- π Skilled in ETL, Data Engineering, Machine Learning, and IoT-driven Solutions.
- π Certified Azure Data Engineer Associate.
- β‘ Fun Fact: After my class hours, youβll find me wrestling with vanishing gradients, taming activation functions, and convincing loss functions to take the hint β all in the name of 'convergence'. π€ππ
- π± What I'm Up To: Currently diving deep into MLOps to explore the building and deployement of end to end Machine Learning pipelines.
1οΈβ£ Programming Languages
2οΈβ£ ETL Tools & Distributed Systems
3οΈβ£ Databases
4οΈβ£ Machine Learning Models
5οΈβ£ Data Modeling
Hereβs a list of repositories from the BigDataTeam5 organization that can be included in your GitHub profile's README:
-
master-financial-database
Repository for managing financial data with Python. -
AI-Info-Extractor_Markdown_Viewer
Forked project for extracting and visualizing AI-related information using markdown. -
Incremental DataPipeline using Snowflake
Developed an efficient ETL pipeline with incremental loading capabilities using Snowflake. -
LiteLLM SummaryGenerator with Q&A
Python-based project for summarization and question answering with LiteLLM. -
Building a RAG Pipeline with Airflow
Implemented RAG concepts to reduce input tokens in a language model pipeline. -
Nvidia-Agentic-Architecture-Workflow
Built workflows to integrate agentic architectures with FastAPI and Streamlit. -
Multi-Agentic Hackathon Project
A multi-agent system for crime analysis reports hosted on Streamlit. -
MarketScope AI-Powered Industry Segment Intelligence Platform
A multifaceted application for healthcare vendors utilizing LangGraph and Airflow.
- Azure Spotify ML Pipeline
Built scalable ETL pipelines and Random Forest models achieving an RΒ² score of 0.82. - Motor Vehicles Crash Analysis
Analyzed crash data using Power BI and Talend, reducing traffic incidents by 35%. - Kansas City Service Request Analysis
Processed 1.56M service requests to optimize resource planning by 25%.