Skip to content

A python and common lisp based implementation of Kalman Temporal Difference (KalmanTD, kalman-td)

Notifications You must be signed in to change notification settings

digikar99/kalman-td

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Kalman TD

A python and common lisp based implementation of Kalman TD based on

  1. A Unifying Probabilistic View of Associative Learning, Gershman (2015)
  2. Kalman Temporal Differences, Geist and Pietquin (2014) - specifically algorithm 1
  3. Kalman Filter - Wikipedia

The python version was originally developed as a course project in Computational Cognitive Science (CS786, IIT Kanpur) under Prof. Nisheeth Srivastava, in collaboration with Anish Thankachan and Diksha Yadav. Since then, a python/simulations_helper.py has been added to help simplify the simulations. The report (based on the python version) is available here.

The main interface (similar in python and common lisp) comprises of:

  • file: KalmanTD.py / kalman-td.lisp
  • class: KalmanTD / kalman-td
  • function: predict_reward / predict-reward
  • function: update / update
  • function: default_kalman_td / default-kalman-td

About 9 simulations covered in Gershman (2015) have been coded up in the following files. These also illustrate example usage of the above classes and functions.

Recommended way to run the simulations in common lisp is (in-package :kalman-td). In other cases, one may need to play around with *array-element-type*.

Other Implementations

About

A python and common lisp based implementation of Kalman Temporal Difference (KalmanTD, kalman-td)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published