PyTorch Transformer Implementation

This repository contains a PyTorch implementation of the Transformer model as described in the paper "Attention Is All You Need" by Vaswani et al. (2017).

Overview

This implementation provides a complete pipeline for training and using Transformer models, including:

Data preprocessing from Excel files
Model training with customizable hyperparameters
Validation during training
Model architecture following the original paper

Project Structure

.
├── main.py              # Main training script
├── train.py            # Training loop implementation
├── preprocess.py       # Data preprocessing utilities
├── arguments.py        # Command line argument definitions
├── module/            # Core model components
│   ├── Models.py      # Transformer model architecture
│   ├── Constants.py   # Constants and special tokens
│   └── Optim.py       # Optimizer with learning rate scheduling
├── data/              # Data directory
└── output/            # Training outputs and model checkpoints

Requirements

Python 3.6+
PyTorch
NumPy
pandas (for Excel file handling)

Usage

Prepare your data in Excel format
Run the training script:

python main.py --excel path/to/your/data.xlsx --output_dir output

Key Arguments

--excel: Path to input Excel file
--output_dir: Directory to save model checkpoints
--batch_size: Batch size for training
--d_model: Model dimension
--n_layers: Number of transformer layers
--n_head: Number of attention heads
--dropout: Dropout rate
--n_warmup_steps: Number of warmup steps for learning rate scheduling

Model Architecture

The implementation follows the original Transformer architecture with:

Multi-head self-attention
Position-wise feed-forward networks
Layer normalization
Residual connections
Positional encoding

Views

Acknowledgments

This implementation is based on:

The original Transformer paper: "Attention Is All You Need" by Vaswani et al. (2017)
The module implementation by jadore801120

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyTorch Transformer Implementation

Overview

Project Structure

Requirements

Usage

Key Arguments

Model Architecture

Views

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
module		module
.gitignore		.gitignore
README.md		README.md
arguments.py		arguments.py
main.py		main.py
preprocess.py		preprocess.py
train.py		train.py

97yong/Transformer-kor-en

Folders and files

Latest commit

History

Repository files navigation

PyTorch Transformer Implementation

Overview

Project Structure

Requirements

Usage

Key Arguments

Model Architecture

Views

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages