Skip to content

This is a small compilation of data engineering and data pipelining work.

License

Notifications You must be signed in to change notification settings

portfolioRM/DataEngineering-Pipelines

Repository files navigation

Data Engineering and Pipelines Sample

RM Slide Pic

Project Summary:

As part of my current work, I engineer data pipelines to integrate external data providing accessibility within the enterprise data warehouse (EDW). I also create data pipelines within the EDW to transform data for visualization and analysis. The files attached show a sample of one pipeline that uses Python to extract specific data from flat-files. This script is written in an object-oriented style. There is also a sample of a SQL pipeline where I pull and aggregate encounter level data to monthly and fiscal year figures across several measures and from different sources.

Please note some of the code has been obfuscated for privacy reasons.

Files Uploaded:

  1. "RashmiMariyappa_1_Pipeline_External Data"
    This is the Python script, written in an object-oriented style, that extracts specific data from flat-files and uploads it into an EDW.

  2. "RashmiMariyappa_2_Pipeline_KPIPresentation"
    This is an excerpt of a presentation I put together for my team on a complex back-end data pipeline that I re-engineered.

  3. "RashmiMariyappa_3_Pipeline_SQLSamples"
    This is a small sample of SQL work of mine. This last entry is one (of many) views in the KPI pipeline in the presentation in this repository. There are also 2 other sample SQL queries.