Skip to content
View chziakas's full-sized avatar
Block or Report

Block or report chziakas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. athina-ai/ariadne athina-ai/ariadne Public

    LLM Evals for Text Summarization and RAG use-cases.

    Python 33

  2. redeval redeval Public

    Red-teaming LLM applications.

    Python 16

  3. backbone-learn backbone-learn Public

    A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.

    Python 11

  4. node-embeddings-eval node-embeddings-eval Public

    Evaluation protocol for graph embedding methods on link prediction, node classification, and node clustering

    Jupyter Notebook

  5. confident-ai/deepeval confident-ai/deepeval Public

    The LLM Evaluation Framework

    Python 2k 139

  6. humaneval_sample_eval humaneval_sample_eval Public

    This project evaluates OpenAI's GPT-3.5 model on a sample from the HumanEval dataset to assess its code generation capabilities. The implementation is built in a way that can easily integrate new m…

    Python