OpenAI XOR Warmup

⭐ Train an LSTM to solve the XOR problem: that is, given a sequence of bits, determine its parity. The LSTM should consume the sequence, one bit at a time, and then output the correct answer at the sequence’s end.

Questions

Test the two approaches below:

Generate a dataset of random 100,000 binary strings of length 50. Train the LSTM; what performance do you get?

It converges after 16,000 examples.

Generate a dataset of random 100,000 binary strings, where the length of each string is independently and randomly chosen between 1 and 50. Train the LSTM. Does it succeed? What explains the difference?

Yes. It converges after 4,000 examples. It converges faster is because it is easier to learn shorter sequences.

Getting Started

Install dependencies

pip install -r requirements.txt

Train an LSTM without varying lengths

python train.py --momentum 0.99

Train an LSTM with varying lengths

python train.py --vary_lengths True --momentum 0.9

Train a basic neural network written in Numpy to learn XOR on colab

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py
xor_dataset.py		xor_dataset.py
xor_dataset_test.py		xor_dataset_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks

notebooks

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

train.py

train.py

utils.py

utils.py

xor_dataset.py

xor_dataset.py

xor_dataset_test.py

xor_dataset_test.py

Repository files navigation

OpenAI XOR Warmup

Questions

Getting Started

About

Releases

Packages

Contributors 2

Languages

License

jperl/xor

Folders and files

Latest commit

History

Repository files navigation

OpenAI XOR Warmup

Questions

Getting Started

About

Topics

Resources

License

Stars

Watchers

Forks

Languages