Deep Learning from End to End

Open Source Learning and the Democratization of AI

The National Center for Supercomputing Applications (NCSA) Center of AI Innovation (CAII) at the University of Illinois Urbana-Champaign has a focus on driving and enabling student-driven research. Their goal is to give access to the state-of-the-art tools and hardware so people can find novel ways to solve unanswered problems. I am one such student that has learned an incredible amount of knowledge and gained a lot of intuition on Artificial Intelligence through my mentors here.

Along with the NCSA, I also want to acknowledge all the amazing open source materials that I have found and learned from over the years. These resources often filled the gap for me between the theory that justified the model architecture and the actual implementation of them. I will do my best to reference all that work throughout, so you can see where I had learned it from initially! The purpose of this repository is to bring together that wealth of knowledge to truly be a one-stop-shop for everyone from beginners to researchers to gain something from and continue to push the boundaries of Open Source Research!!

If you want to contribute, (and please do it you want!!) go ahead an submit a PR and I can review it!

Getting Started

All of these tutorials can easily be run on the HAL system at the NCSA and you can follow the new users instructions here to setup an account.

If you prefer to use Google Colaboratory, that will also work fine! You will just need to setup the environment for specific packages needed (Easy pip installs to get those). For the datasets, you can save them in your Google Drive and access them from there!

Data Prep

We will be using a couple of datasets in our Deep Learning Adventures!!

Ensure you have a /data folder in your root directory of the git repo and run the following to install all datasets

bash download_data.sh

Extra Datasets

There are a few other datasets that we will use but are inconsistent to automatically download and are used in the more advanced architectures! Just download them from the link and save them in the /data folder! These datasets may also be too large to train in Google Drive so keep that in mind!

Foundations

Intro to PyTorch: Exploring the Mechanics
PyTorch Datasets and DataLoaders
Leveraging Pre-Trained Models for Transfer Learning
Intro to PyTorch for Vision: Digging into Convolutions
Going Deeper with Residual Connections
Digging into the LSTM: Sequence Classification
Lets Write a Story: Sequence Models for Text Generation
Large Model Considerations: Distributed Training
Distributed Training with Huggingface 🤗 Accelerate

Computer Vision

Going Deeper with ResNet
UNet for Image Segmentation
Moving from Convolutions: Vision Transformer
Masked Image Modeling with Masked Autoencoders
Self-Supervised Learning with DINO
Hierarchical Vision Transformers with Swin Transformer

Natural Language Processing

Speech Processing

Intro to Audio Processing in PyTorch
Connectionist Temporal Classification Loss
Intro to Automatic Speech Recognition
ASR through Self-Supervised Learning: Wav2Vec2
RNN Transducer as an Alternative to CTC

Generative AI

AutoEncoders
Autoregressive Generation
- PixelCNN
- WaveNet
Generative Adversarial Networks
- Intro to Generative Adversarial Networks
- SuperResolution with SRGAN
- Image2Image Translation with CycleGAN
Diffusion
- Intro to Diffusion
- Text-Conditional Diffusion with Classifier Free Guidance
- Latent-Space Diffusion

MultiModal AI

Building Vision/Language Representations: CLIP
Automatic Image Captioning
Visual Question Answering

Dive into Attention

Barebones Attention Mechanism
Sparse Windowed Attention
Linear Attention

Sequence to Sequence Modeling

Seq2Seq for Language Translation
CNN/RNN for Image Captioning
Attention is All You Need for Language Translation

Reinforcement Learning

Q-Learning
Deep-Q Learning

Name		Name	Last commit message	Last commit date
Latest commit History 350 Commits
PyTorch Basics		PyTorch Basics
PyTorch for Computer Vision		PyTorch for Computer Vision
PyTorch for Generation		PyTorch for Generation
PyTorch for NLP		PyTorch for NLP
data		data
models		models
src/visuals		src/visuals
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download_data.sh		download_data.sh
prep_data.py		prep_data.py
requirements.txt		requirements.txt

License

priyammaz/HAL-DL-From-Scratch

Folders and files

Latest commit

History

Repository files navigation

Deep Learning from End to End

Open Source Learning and the Democratization of AI

Getting Started

Data Prep

Extra Datasets

Foundations

Computer Vision

Natural Language Processing

Speech Processing

Generative AI

AutoEncoders

Autoregressive Generation

Generative Adversarial Networks

Diffusion

MultiModal AI

Dive into Attention

Sequence to Sequence Modeling

Reinforcement Learning

About

Topics

Resources

License

Stars

Watchers

Forks

Languages