Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

A Tutorial presented at SIBGRAPI 2021

Moacir Antonelli Ponti, Fernando Pereira dos Santos, Leo Sampaio Ferraz Ribeiro, Gabriel Biscaro Cavallari

Paper (extended version): https://arxiv.org/abs/2109.02752

Abstract: Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be particularly useful in datasets that are not as well-prepared as those in challenges, and also under scarce annotation and/or small data. We describe basic procedures: as data preparation, optimization and transfer learning, but also recent architectural choices such as use of transformer modules, alternative convolutional layers, activation functions, wide and deep networks, as well as training procedures including as curriculum, contrastive and self-supervised learning.

Content

How to Start: Basic Checklist - slides
1. Data quality
2. Normalization
3. Input representation valid
4. Loss function and evaluation choice
5. Model tuning
6. Feature projection/visualization
7. Internal and External validation and evaluation
Notebook:
- notebook: Normalization, Loss and Evaluation
Common Issues - slides
1. Data quality and Small datasets
2. Imbalanced data
3. Bias-variance dillema in DNNs: overfitting/underfitting
4. Sensitivity to attack
Notebooks:
Architecture Options - notebook
- Convolutions
- Width X Depth in Networks
- Pooling and subsampling
- Attention mechanism and transformers
Improving Optimization
- Algorithms
- Learning rate scheduling
- Layer-wise normalization
- Regularization and dropout
- Data augmentation
Basic training procedures
- Transfer learning and fine-tuning
- Feature extraction
Advanced training procedures
- Curriculum Learning
- Contrastive Learning - notebook
- Self-supervising Learning
Improving predictions
- Novel activation functions
- Test augmentation

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
1_How-to-start_Common-issues		1_How-to-start_Common-issues
3. Architecture Options		3. Architecture Options
4 Improving Optimization		4 Improving Optimization
6.2. Contrastive Learning		6.2. Contrastive Learning
6.3 Self-supervised learning		6.3 Self-supervised learning
SIBGRAPI_PartIII		SIBGRAPI_PartIII
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1_How-to-start_Common-issues

1_How-to-start_Common-issues

3. Architecture Options

3. Architecture Options

4 Improving Optimization

4 Improving Optimization

6.2. Contrastive Learning

6.2. Contrastive Learning

6.3 Self-supervised learning

6.3 Self-supervised learning

SIBGRAPI_PartIII

SIBGRAPI_PartIII

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

A Tutorial presented at SIBGRAPI 2021

Moacir Antonelli Ponti, Fernando Pereira dos Santos, Leo Sampaio Ferraz Ribeiro, Gabriel Biscaro Cavallari

Content

About

Releases

Packages

Contributors 4

Languages

maponti/trainingdeepnetworks

Folders and files

Latest commit

History

Repository files navigation

Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

A Tutorial presented at SIBGRAPI 2021

Moacir Antonelli Ponti, Fernando Pereira dos Santos, Leo Sampaio Ferraz Ribeiro, Gabriel Biscaro Cavallari

Content

About

Resources

Stars

Watchers

Forks

Languages