Skip to content

paladique/codespaces-etl-basic-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ETL in GitHub Codespaces

Open in GitHub Codespaces

Sign up for Azure

🎓 Students get $100 of credits with Azure for Students! No credit card required: Sign up

Extract, Transform, and Load Operations with Python & Pandas library with Jupyter Notebooks and Azure Cosmos DB

This sample loads a csv file as a Pandas dataframe, filters the records by airports located in the United States, then the filtered data into JSON. A sample of the pared data is then loaded into Azure Cosmos DB.

Instructions

Convert a filtered CSV File into JSON, then insert into Azure Cosmos DB in minutes with GitHub Codespaces

  1. Create a Cosmos DB NoSQL Account, you can stop after creating the resource

  2. After account is created in the Azure Portal, navigate to the resource (you can find it in your notifications)

  3. To the right of the resource overview select Keys. locate the URI, and PRIMARY KEY secrets.

    Azure cosmos db secrets

  4. Set copied secrets it as secrets in your Codespaces settings here. URI should be the COSMOS_ENDPOINT secret and PRIMARY KEY should be the COSMOS_KEY secret codespaces secrets

  5. Run the Notebook

  6. Clean up your Cosmos DB Account Resources after you're done!

Learn More

Releases

No releases published

Packages

No packages published