Skip to content
View otmhi's full-sized avatar
  • CREST | CentraleSupelec | ENS
  • Paris, France

Highlights

  • Pro
Block or Report

Block or report otmhi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. sadeqa/Super-Mario-Bros-RL sadeqa/Super-Mario-Bros-RL Public

    This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super Mario Bros

    Jupyter Notebook 75 15

  2. fopo fopo Public

    Source code for the paper "Fast Offline Policy Optimization for Large Scale Recommendation" published at the Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23).

    Python 2

  3. criteo-research/blob criteo-research/blob Public

    Source code for our paper "BLOB: a probabilistic model for recommendation that combines organic and bandit signals" published at KDD 2020.

    Python 15 10

  4. Reward-Optimizing-Reco Reward-Optimizing-Reco Public

    Materials for the "Reward Optimising Recommendation using Deep Learning and Fast Maximum Inner Product Search" tutorial delivered at the 28th SIGKDD Conference on Knowledge Discovery and Data Minin…

    Jupyter Notebook 5 2

  5. olivierjeunen/decision-theory-www-2021 olivierjeunen/decision-theory-www-2021 Public

    Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).

    Jupyter Notebook 11 1

  6. nd7141/recsystutorial nd7141/recsystutorial Public

    Jupyter Notebook 16 1