README file for CME 241 (Winter quarter 2020)

Each week, I release a new version in order to track the progress of my work.

Lectures

The pdf files are meant to be a summary of the most important notions studied during lectures.

lecture_1.pdf sums up the notions of Markov Chains, Value State Function and Optimal Policies studied during lecture 1.
lecture_3.pdf sums up the most important notions from the Utility Theory lecture and the main results of the Portfolio application problems for CARA and CRRA utility functions.
RL.pdf contains some RL theorems and properties with their proofs.

Notebooks

problem1.ipynb (Merton Application problem lecture summary + code application)
problem2.ipynb (Option pricing lecture summary + code illustration)

`./code` folder

processes

This folder contains Python files for the implementation of Markov Processes, Markov Reward Processes and Markov Decision Processes. All these processes are modelled as Python class. Here, the objective is to define objects that will be used in Dynamic Programming and Reinforcement Learning algorithms. The structure of these classes is incremental where _MP_ class is the basis for all other processes.

DP (Main Dynamic Programming algorithms)
RL (Main Reinforcement Learning Algorithm)
- algorithms for prediction
- algorithms for control
- algorithms for value approximation (prediction only)
option_pricing.py (code for pricing European and American options)
run_predicitions.py (code for running predictions with DP and RL algorithms)
utils
- Helper code for the algorithms above
- sampling.py

Functions to generate sequence of episodes given a MDP and Policy.

Type of data

Discrete Markov chains are implemented as Python class. The data that feed these objects are stored as dict. Let us give some examples :

MP :

$${ 1 : {2:0.25,3:0.75}, 2 : {2:1}, 3 : {2:0.45,3:0.55} }$$

MRP :

$${ 1 : ({2:0.25,3:0.75},10), 2 : ({2:1},15), 3 : ({2:0.45,3:0.55},-5) }$$

Policy :

$${ 1: {'a': 0.4, 'b': 0.6}, 2: {'a': 0.7, 'c': 0.3}, 3: {'b': 1.0} }$$

MDP :

$${ 1: { 'a': ({1: 0.3, 2: 0.6, 3: 0.1}, 5.0), 'b': ({2: 0.3, 3: 0.7}, 2.8), 'c': ({1: 0.2, 2: 0.4, 3: 0.4}, -7.2) }, 2: { 'a': ({1: 0.3, 2: 0.6, 3: 0.1}, 5.0), 'c': ({1: 0.2, 2: 0.4, 3: 0.4}, -7.2) }, 3: { 'a': ({3: 1.0}, 0.0), 'b': ({3: 1.0}, 0.0) } }$$

Usage

In order to illustrate the methods and attributes of a MP object, let us run : python3 mp.py

Then the output is :

$$· states list : {1: {2: 0.25, 3: 0.75}, 2: {2: 1}, 3: {2: 0.45, 3: 0.55}} · number of states : 3 · sink states : {2} · matrix transition : [[0. 0.25 0.75] [0. 1. 0. ] [0. 0.45 0.55]] {1: 0.0, 2: 1.0, 3: 0.0} stationary : {1: 0.5333333333333333, 2: 0.4666666666666667}$$

This folder contains a Python file policy.py for a Policy implementation. This class is used in the class _MDP_. It also contains a file det_policy.py used in policy improvement method (method in MDP objects).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code

code

.gitignore

.gitignore

README.md

README.md

RL.pdf

RL.pdf

lecture_1.pdf

lecture_1.pdf

lecture_3.pdf

lecture_3.pdf

problem1.ipynb

problem1.ipynb

problem2.ipynb

problem2.ipynb

Repository files navigation

README file for CME 241 (Winter quarter 2020)

Lectures

Notebooks

`./code` folder

Type of data

Usage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
code		code
.gitignore		.gitignore
README.md		README.md
RL.pdf		RL.pdf
lecture_1.pdf		lecture_1.pdf
lecture_3.pdf		lecture_3.pdf
problem1.ipynb		problem1.ipynb
problem2.ipynb		problem2.ipynb

greedythib/cme241-thibaudb

Folders and files

Latest commit

History

Repository files navigation

README file for CME 241 (Winter quarter 2020)

Lectures

Notebooks

./code folder

Type of data

Usage

About

Resources

Stars

Watchers

Forks

Languages

`./code` folder