Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 1.48 KB

incremental-learning-1.md

File metadata and controls

23 lines (15 loc) · 1.48 KB

Reinforcement Learning

  1. Vidhya on Getting ready for AI based gaming agents – Overview of Open Source Reinforcement Learning Platforms
  2. Vidhya on Simple Beginner’s guide to Reinforcement Learning & its implementation
  3. ZipRecruiter on Classifying Job Titles With Noisy Labels Using REINFORCE - Fine-grained job title classification with noisy labels using the REINFORCE algorithm and multi-task learning
    1. this article has a very nice trick in adding a reward component to the loss function in order to mitigate for unbalanced class label problem, instead of the usual balancing.

Q-LEARN

  • Markov chain problem, (state, action, new state, reward)
  • Lots of Exploration in the beginning, then exploitation
  • Returns optimal policy.
  • Refer to youtube here

Deep Learning

  1. A review paper about RL in DL
  2. deep Q-learning\

RLHF

  1. illustrated RLHF by Huggingface