Skip to content
View gordicaleksa's full-sized avatar
💭
Working on ML projects for: https://www.youtube.com/c/TheAiEpiphany
💭
Working on ML projects for: https://www.youtube.com/c/TheAiEpiphany

Highlights

  • Pro
Block or Report

Block or report gordicaleksa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gordicaleksa/README.md


I'm an ex Research Engineer at Google DeepMind & Microsoft, I run the The AI Epiphany community - and I'm currently in the process of building my first startup Runa AI. I'm also a proud father of 16 A100s and 16 H100s (generously sponsored by Together AI & Hyperstack respectively).

Latest:
* YugoGPT - trained SOTA 7B LLM for Croatian, Bosnian, Serbian, Montenegrin langs
* SlovenianGPT - trained SOTA 7B LLM for Slovenian language
* YugoChat - talk to YugoGPT
* First Serbian LLM eval
* Open-NLLB - Replicating Meta's "no language left behind" machine translation (MT) project

Most recent OSS contributions:

  • airoboros - synthetic instruction following data generation framework

My older recent projects:


The AI Epiphany banner

Pinned

  1. Open-NLLB Open-NLLB Public

    Effort to open-source NLLB checkpoints.

    Python 379 33

  2. get-started-with-JAX get-started-with-JAX Public

    The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well a…

    Jupyter Notebook 570 92

  3. pytorch-original-transformer pytorch-original-transformer Public

    My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

    Jupyter Notebook 937 156

  4. pytorch-GAT pytorch-GAT Public

    My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entr…

    Jupyter Notebook 2.3k 311

  5. pytorch-neural-style-transfer pytorch-neural-style-transfer Public

    Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the i…

    Python 348 77

  6. serbian-llm-eval serbian-llm-eval Public

    Serbian LLM Eval.

    Python 82 7