LSTM Quaternion Sequence Extrapolation (QLSTM)

Overview • Screenshots • Installation • Requirements • Acknowledgements • License

Overview

This repository is dedicated to the exploration and implementation of LSTM networks using quaternion data, as part of the research project titled "Quaternion sequences in deep learning". The primary focus is on the application of quaternion algebra in deep learning for predicting sequences, particularly in the context of rotation sequences.

Abstract

The purpose of this paper is to verify the hypothesis that neural networks are capable of working with quaternion data. The main problem of this issue was the absence of common commercial solutions using artificial intelligence based on quaternion data. The decisive factor in this situation is the innovative nature of the solution, the popularity of which in scientific circles has only increased in the early 21st century. Research on systems integrating neural networks with quaternion data is important for the further development of the field of machine learning, making it possible to identify aspects in which the use of quaternions has an impact on the efficiency and precision of the network. For the purposes of the research, a model of the quaternion recurrent network QLSTM was developed, all of whose elements are in the form of quaternion data, and the key processes of machine learning were extended by the algebra of quaternions. A self-developed loss function was also implemented, determining the error based on the angle included between the resulting quaternion and the expected quaternion. The research was conducted on training sets that are quaternion sequences describing joint rotations over time. The experiments focused on comparing the results of networks equipped with Hamilton algebra, with basic recurrent networks in a regression problem. This problem involves predicting the further progress of rotation based on the input sequence of quaternions. The conclusions of the study define the advantage of the QLSTM network in the context of working with quaternion data, also highlighting the problems associated with it.

Key Features

Quaternion Networks: Implementation of LSTM networks using quaternion data.
QALE Loss Function: A custom loss function for evaluating prediction accuracy based on quaternion angles.
Quaternion Data Handling: Techniques for processing and manipulating quaternion data within neural networks.

Screenshots

Animation presentation of an extrapolated skeleton from a motion capture recording in Mokka software.

Installation

Set Up Conda Environment:
- Open a Conda terminal in the repository directory.
- Create the environment using the provided environment.yml file:
```
conda env create -f environment.yml
```
IDE Configuration:
- Open the source code in your preferred IDE.
- Set the Python interpreter to qlstm.

Usage

The repository includes scripts and modules for training and evaluating the QLSTM model.
Detailed usage instructions are provided in the thesis.pdf file.

Requirements

Python 3.10
Conda

Acknowledgements

Special thanks to Titouan Parcollet for the foundational work in quaternion neural networks. His PhD thesis and the code available at Orkis-Research/Pytorch-Quaternion-Neural-Networks were invaluable resources in this project.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Portfolio • GitHub • LinkedIn • YouTube • TikTok

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.vscode		.vscode
data/mockup		data/mockup
lstm_quaternion_sequence_extrapolation		lstm_quaternion_sequence_extrapolation
models		models
runs		runs
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Thesis.pdf		Thesis.pdf
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vscode

.vscode

data/mockup

data/mockup