Skip to content
View ollmer's full-sized avatar
  • ServiceNow Research
Block or Report

Block or report ollmer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. mmlu mmlu Public

    Forked from hendrycks/test

    Measuring Massive Multitask Language Understanding | ICLR 2021

    Python 11 2

  2. wikichat wikichat Public

    Talk to wikipedia offline with LLM!

    Jupyter Notebook 3 1

  3. lm-evaluation-harness lm-evaluation-harness Public

    Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of autoregressive language models.

    Python

  4. SWE-agent SWE-agent Public

    Forked from princeton-nlp/SWE-agent

    SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

    Python