[IMBD 2021] Pytorch-2021-IMBD-Reggression

2021全國智慧製造大數據分析競賽

└── README.md 

主要訓練程式碼
├── train.py                執行訓練檔
├── training.yaml           調整訓練參數
├── dataset.py              讀取訓練驗證資料
├── model.py                網路架構
└── checkpoint              訓練完成存模型及log的資料夾

主要測試程式碼   
├── demo.py                 執行預測並匯出結果csv
└── test.py                 執行預測並計算成績(有答案)

其他程式碼
├── utils
|    ├── csv_utils          csv檔相關函式
|    ├── dir_utils          路徑相關函式
|    ├── model_utils        網路模型相關函式
|    └── score_utils        計算分數相關函式
├── csv_data
|    ├── testing            測試csv資料夾    
|    └── training           訓練csv資料夾
└── colab ver_              Colab版本

0. Competition result

Preliminary uploaded data
Preliminary result
- Participating teams: 118
- Official Preliminary list
Finals uploaded data
- Confidentiality agreement which couldn't publish the finals' data!
Finals result
- Participating teams: 31
- Official Finalists

1. Training

1.1 Prepair training data

Official training data: 98072

Data format (Our objective is entering the input features F1 ~ F13 and predict the final Output)

Data number	F1	F2	F3	F4	F5	F6	F7	F8	F9	F10	F11	F12	F13	Output
1	0	23.5	23.6	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	0
2	0	23.5	23.6	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	-0.6
3	0	23.5	23.5	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	0.6
4	0	23.5	23.5	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	-0.6
5	0	23.5	23.6	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	-0.3
......	...	...	...	...	...	...	...	...	...	...	...	...	...	...

CSV to independent data
Because the training data has some deviations which the same input feature values get different output results as showed below:

Data number	F1	F2	F3	F4	F5	F6	F7	F8	F9	F10	F11	F12	F13	Output
1	0	23.5	23.6	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	0
2	0	23.5	23.6	23.6	23.6	23.8	24.3	23.6	23.5	22.6	23.3	23.1	22.3	-0.6
......	...	...	...	...	...	...	...	...	...	...	...	...	...	...

You could run csv_utils.py to let all training data are independent with mean or mediam value of output.

1.2 Set hyperparameters and train

Configuration file: training.yaml

  TRAINING:
    Network: 'MLP'
    EPOCH: 1000
    LR: 0.01
    LR_MIN: 0.0001
    GPU: true
    BATCH: 1000
    VAL_RATE: 0.8  # split validation set from training set
    VAL_AFTER_EVERY: 1  # save the model per ? epoch
    TRAIN_DIR: './csv_data/training/independent_mean.csv'  # path to training data
    SAVE_DIR: './checkpoints'  # path to save models and images

Start training: train.py
```
python train.py
```

1.3 Training and validation loss curve

log file direction: checkpoints -> log folder
```
tensorboard --logdir [log path]
```

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
checkpoints		checkpoints
colab ver.		colab ver.
csv_data		csv_data
figures		figures
final_code		final_code
preliminary file		preliminary file
utils		utils
warmup_scheduler		warmup_scheduler
README.md		README.md
dataset.py		dataset.py
demo.py		demo.py
model.py		model.py
pretrained.pth		pretrained.pth
train.py		train.py
training.yaml		training.yaml

FanChiMao/Competition-2021-Pytorch-Reggression

Folders and files

Latest commit

History

Repository files navigation

[IMBD 2021] Pytorch-2021-IMBD-Reggression

0. Competition result

1. Training

1.1 Prepair training data

1.2 Set hyperparameters and train

1.3 Training and validation loss curve

2. Testing

2.1 Prepair preliminary testing data

2.2 Load the model and test

2.3 Score

3. Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages