Skip to content
View habanoz's full-sized avatar
💭
Busy...
💭
Busy...
Block or Report

Block or report habanoz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. tezos-reward-distributor-organization/tezos-reward-distributor tezos-reward-distributor-organization/tezos-reward-distributor Public

    Tezos Reward Distributor (TRD): A reward distribution software for tezos bakers.

    Python 88 50

  2. reinforcement-learning-an-introduction reinforcement-learning-an-introduction Public

    Solutions to Sutton and Barto book exercises

    TeX 15 2

  3. qlora_templates qlora_templates Public

    Forked from jondurbin/qlora

    QLoRA: Efficient Finetuning of Quantized LLMs (Uses Huggingface Chat Templates)

    Python

  4. cs330-2021-stanford-meta-learning-hw-answers cs330-2021-stanford-meta-learning-hw-answers Public

    http://cs330.stanford.edu/fall2021 coding hw answers.

    Python 3

  5. berkeley_rl_hw_answers berkeley_rl_hw_answers Public

    Forked from berkeleydeeprlcourse/homework_fall2022

    My answers to Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)

    Jupyter Notebook 2

  6. rl-algo rl-algo Public

    Classic RL control algorithm implementations found in Sutton and Barto book.

    Python 1