kl-divergence

Here are 40 public repositories matching this topic...

aneeshdurai / Entropy-Based-Techniques-in-Generative-Language-Models

In this project, we explore how we can entropy and information in language models and how we can optimize it for generative tasks.

entropy information-theory language-model kl-divergence

Updated May 26, 2024
Jupyter Notebook

edwarddramirez / svi-dist-fit

Star

Novel technique to fit a target distribution with a class of distributions using SVI (via NumPyro). Unlike standard SVI, our "data" is a distribution rather than a finite collection of samples.

kl-divergence stochastic-variational-inference svi numpyro fitting-distributions

Updated May 21, 2024
Jupyter Notebook

SatvikVarshney / IsingModelBoltzmannMachine

Star

Using Monte-Carlo simulated datasets, a completely transparent Boltzmann Machine trained on 1-D Ising chain data is implemented to predict model couplers in the absence of past coupler values. Methods from machine learning applied to theoretical physics are on display in this work.

python machine-learning numpy statistical-mechanics matplotlib ising-model hyperparameter-tuning kl-divergence boltzmann-machine

Updated Apr 4, 2024
HTML

Khamies / LSTM-Variational-AutoEncoder

Star

A PyTorch Implementation of Generating Sentences from a Continuous Space by Bowman et al. 2015.

nlp lstm vae ptb kl-divergence elbo reconstruction-error

Updated Mar 26, 2024
Python

crhisto / NMMFlex

Star

Implementation of the Non-negative Multiple Matrix Factorization (NMMF) algorithm proposed in Takeuchi et al, 2013 with some modifications. There is a python native version NMMFlexPy and a R wrapper NMMFlexR

machine-learning-algorithms kl-divergence nmf-matrix-factorization

Updated Mar 6, 2024
Python

antonio-f / kl-divergence

Star

Kullback-Leibler divergence in Python

python tutorial simple kullback-leibler-divergence kl-divergence hands-on explained

Updated Feb 22, 2024
Jupyter Notebook

rochitasundar / Generative-AI-with-Large-Language-Models

Star

This repository contains the lab work for Coursera course on "Generative AI with Large Language Models".

reinforcement-learning transformer kl-divergence proximal-policy-optimization large-language-models prompt-engineering flan-t5 instruction-finetuning low-rank-adaptation reward-model parameter-efficient-fine-tuning llm-evaluation

Updated Dec 1, 2023
Jupyter Notebook

curvysquare / PPO-and-A2C-for-HULH-poker

Star

My MSc project on applying, tuning and modifying the PPO and A2C algorithms to Pettingzoo MARL library two player poker game

poker python3 nash-equilibrium actor-critic kl-divergence ppo a2c action-masking hulh

Updated Nov 8, 2023
Python

kyosek / change-point-detection-kl-divergence

Star

Change point detection using KL divergence

monitoring kl-divergence bitcoin-price price-changes

Updated Oct 31, 2023
Python

harcel / unstable_populations

Star

The Unstable Population Indicator

data-science kl-divergence data-drift data-shift

Updated Sep 18, 2023
Jupyter Notebook

yoyololicon / pytorch-NMF

Sponsor

Star

A pytorch package for non-negative matrix factorization.

gpu pytorch nmf em-algorithm kl-divergence nonnegative-matrix-factorization 1d-convolution beta-divergence plca siplca

Updated Jul 9, 2023
Python

petermchale / diffusion

Star

Implementation of a Denoising Diffusion Probabilistic Model with some mathematical background.

deep-learning pytorch generative-model markov-process variational-autoencoder kl-divergence elbo denoising-diffusion autoregressive-model ornstein-uhlenbeck-process

Updated Jun 11, 2023
Jupyter Notebook

coolkid1 / Optimal-Relative-Transport

Star

Some code to Get the Optimal relative Transport started. This will be slowly updated if needed.

statistics mathematics loss-functions optimal-transport kl-divergence wasserstein-distance relative-distributions

Updated Jun 4, 2023
Julia

vsmicrogenomics / Average-KL-Divergence-Calculator

Star

average-KL-divergence-calculator.py is a Python script that calculates the average KL divergence for each FASTA file in a directory and produces separate output files and a combined output file with the results.

kl-divergence selective-pressures