StockMate

A python based tool to build agents and forecasters for stock price forecasting and trade automation.

StockMate is just a temporary name, I'll change it once I come up with a better one.

What is StockMate

StockMate is a Python based tool where you can create models for stock price prediction and trade automation. Using StockMate you can use the provided frameworks to get the latest stock data, build and test models for forecasting, etc. Currently StockMate's data retrieval is built around India's National Stock Exchange (NSE), and support for other exchanges haven't been built.

Terminology

1. Forecasters

Forecasters are regression models for predicting stock prices. Forecasters can be trained for individual stocks or for entire indices. A few forecasters have been implemented already and can be used out of the box.

2. Agents

Agents are used to automate trading. Agents decide when to buy, sell, or hold stock. Currently there are no free trading apis, so the next best solution for automated trading is to make a chatbot that tells you when to buy and sell.

3. Data Processing Framework (DPF)

The Data Processing Framework was something that had to be built to make the models more portable. Think of the model as a black box; Each model can have its own way of handling

The incoming raw stock data
- Operations like normalization, etc, could be applied to the raw stock data before being fed to the model
The model output
- The model output could be the normalized price values, which then have to get converted back to the original prices

These two functions have been abstracted out of the model and incorporated into a Data Processing Framework (DPF). Basically what this means is that, to create a model you would have define your own DPF by inheriting Core.DataProcessor and overriding the following methods,

inputProcessor()
outputProcessor()

Please check the documentation under Core.DataProcessor to see what the parameters that are passed into it and what the expected outputs are.

Also, a few DPFs have already been implemented for handing univariate and multivariate stock data respectively. You can find them under Examples/Processors/BasicProcessors.py

4. Other Things to Know

forecast - The number of days in the future for which prices are to be predicted
lookBack - The number of days to be used to make forecast predictions.
interval - To specify what kind of data the model is going to train on. 1 day interval data or 5 minute intervals, 1 minute intervals ...

UI

Currently, the implemented features for StockMate include

UI Forecaster Creation

Viewing Forecasters

The following models have been implemented,

Viewing Agent Actions

Chatbot Features

Forecasters

ANN.BasicRegressor
ANN.DenseRegressor
LSTM.BasicLSTM
LSTM.DenseLSTM
CNN.BasicCNN
CNN.WaveNet

Agents

QLearning.BasicDQN

Frameworks

A Framework for creating forecaster models
Framework for agent creation
Data processing framework

Planned Features

Packaging this repo
Creating Agents through the UI
Additional customization for viewing models in the UI
Implementation of additional forecasters and agents

WTF? TF1 and TF2 Together?

Well yes. The Forecasters use TF2 and Agents use TF1. After going through pages of Tensorflow documentation, it looks like they have compromised on speed for the sake of better looking code. Yes, I tried the @tf.function decorators, and yes I tried tf.compat.v1.disable_eager_execution(). No matter what, TF1 was just faster.

How much time did I spend you ask? Well I spent over two week trying to find a way to keep it uniformly TF2. I even raised an issue to Tensorflow. But at the end, for the sake of speed, I sadly had no choice but go with TF1.

To prove my point further, here is a benchmark comparing TF1, TF2 and PyTorch. The benchmark was run only on CPU for 1 epoch of DQN.

Framework	TF1	TF2	PyTorch
Time (s)	9.32	258	42

Although it would have increased the speeds, I did not build and install Tensorflow locally.

Usage

Prerequisites

plotly
selenium
phantomjs driver
tensorflow 2.0 or greater
- As of June 5th 2020, tensorflow 2.0 has an error when loading a saved model containing LSTM layers. So the workaround for this is to install tf-nightly as they have patched this in this version.
lxml
python-telegram-bot

Installation and Setup

It is recommended to use this repo inside a virtual environment. Make sure you are using Python 3.5 or greater.

pip install virtualenv
python -m venv stockmate
source stockmate/bin/activate

Installing all the python dependencies dependencies,

pip install plotly tensorflow lxml selenium flask python-telegram-bot

Clone this repository to some location

git clone https://github.com/AakashSasikumar/StockMate.git

Start the flask server
```
python app.py
```
Go to the Manage Subscriptions page and set up a Telegram Bot for yourself by following the instructions listed

Saving Stock Data

1. Data retrieval for a single stock (TCS)

from DataStore.APIInterface import YFinance
import pandas as pd

ticker = "TCS"

source = YFinance()

oneDayInterval = source.getIntraDay(ticker)
# The data returned is the entire historical data of TCS with 1 day interval
oneMinuteInterval = source.getInterDay(ticker, "1m")
# The data returned is the past 7 days of 1 minute interval
fiveMinuteInterval = source.getInterDay(ticker, "5m")
# The data returned is the past 60 days of 5 minute interval

# All of the data returned are pandas dataframes

# These limits on how far back it goes are set by the yahoo finance api

2. Saving ticker data for an entire index

from DataStore import Indices
from DataStore.APIInterface import YFinance

# For category and index names, check the file saved by running the setup.py file
category = "Broad Market Indices :"
index = "NIFTY 50"

nse = Indices.NSEIndices()
indices = nse.getIndices()

# for a single api key
source = YFinance(autoRotate=True)

constituents = indices["type"][category][index]

for stock in constituents:
    source.saveIntraDay(stock, savePath="DataStore/StockData/)

# The above method saves the stock data in that location directly

AutoRotate is a feature that takes a list of api keys and rotates them so that the daily limit can be breached. It also scrapes a list of proxy addresses so that AlphaVantage doesn't block the source IP.

Forecaster Creation

from DataStore import Indices
from Examples.Processors import MultiVarProcessor
from Models.Forecasters.CNN import WaveNet

indices = Indices.NSEIndices().getIndices()
category = "Broad Market Indices"
index = "NIFTY 50"

constituents = indices["type"][category][index]

# how many days to look back to make prediction
lookBack = 30
# how many days in future to predict
forecast = 5

# target feature is the feature that we want to predict
# in this case it is the closing price
dpf = MultiVarProcessor(tickers=constituents, features=["Open", "High",
                                                        "Low", "Close",
                                                        "Volume"],
                        lookBack=lookBack, forecast=forecast,
                        targetFeature="Close", interval="1d",
                        isSeq2Seq=True)

model = WaveNet()
model.assignDataProcessor(dpf)
model.buildModel(learningRate=1e-5)
model.train(validationSplit=0.9, epochs=1000, batchSize=64)
# By default, models will save in DataStore/SavedModels/
model.saveModel("waveNetTest")
# saved models can be loaded again by calling 
# model.loadModel("name") and trained/inferenced upon

Agent Creation

from Examples.Processors import DQNProcessor
from Models.Agents.QLearning import BasicDQN

dpf = DQNProcessor(tickers=["IOC"], features=["Close"],
                   lookBack=30, interval="1d")

agent = BasicDQN(initialMoney=10000, gamma=0.95,
                 epsilon=0.5, epsilonDecay=0.99,
                 epsilonMin=0.01, batchSize=32)

agent.assignDataProcessor(dpf)
agent.buildModel(learningRate=1e-5)
agent.train(epochs=200)
agent.saveModel("basicAgent")
# This model can now be viewed in the UI, or you may
# choose to plot it yourself

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
Core		Core
DataStore		DataStore
Examples/Processors		Examples/Processors
Images		Images
Misc/Benchmarks		Misc/Benchmarks
Models		Models
UI		UI
Utils		Utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
app.py		app.py

AakashSasikumar/StockMate

Folders and files

Latest commit

History

Repository files navigation