ollmer

Follow

Oleh Shliazhko ollmer

Follow

ML Research Engineer, LLMs for Code

45 followers · 33 following

ServiceNow Research

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Block or Report

Block or report ollmer

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

mmlu mmlu Public

Forked from hendrycks/test

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 11 2
wikichat wikichat Public

Talk to wikipedia offline with LLM!

Jupyter Notebook 3 1
lm-evaluation-harness lm-evaluation-harness Public

Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Python
SWE-agent SWE-agent Public

Forked from princeton-nlp/SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Python