Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...

Showing runs from all workflows
4,909 workflow runs
4,909 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

Update polemo2_out.yaml (#1871)
Unit Tests #2410: Commit 70e1de0 pushed by lintangsutawika
May 22, 2024 09:16 5m 33s main
May 22, 2024 09:16 5m 33s
Update polemo2_out.yaml (#1871)
Tasks Modified #2438: Commit 70e1de0 pushed by lintangsutawika
May 22, 2024 09:16 1m 49s main
May 22, 2024 09:16 1m 49s
Update polemo2_out.yaml
Unit Tests #2409: Pull request #1871 opened by zhabuye
May 22, 2024 09:15 5m 44s zhabuye:0
May 22, 2024 09:15 5m 44s
Update polemo2_out.yaml
Tasks Modified #2437: Pull request #1871 opened by zhabuye
May 22, 2024 09:15 1m 34s zhabuye:0
May 22, 2024 09:15 1m 34s
Added tests for Anthropic LLMs
Tasks Modified #2436: Pull request #1868 opened by zafstojano
May 21, 2024 16:03 16m 16s zafstojano:test-coverage-anthropic
May 21, 2024 16:03 16m 16s
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Tasks Modified #2434: Pull request #1867 synchronize by maximegmd
May 21, 2024 12:53 Action required maximegmd:main
May 21, 2024 12:53 Action required
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Unit Tests #2406: Pull request #1867 synchronize by maximegmd
May 21, 2024 12:53 Action required maximegmd:main
May 21, 2024 12:53 Action required
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Tasks Modified #2433: Pull request #1867 opened by maximegmd
May 21, 2024 12:15 Action required maximegmd:main
May 21, 2024 12:15 Action required
fixed docs typos (#1863)
Unit Tests #2404: Commit cb22e50 pushed by lintangsutawika
May 21, 2024 09:56 5m 46s main
May 21, 2024 09:56 5m 46s
fixed docs typos (#1863)
Tasks Modified #2432: Commit cb22e50 pushed by lintangsutawika
May 21, 2024 09:56 16s main
May 21, 2024 09:56 16s
Draft - Support ov models via genai
Unit Tests #2403: Pull request #1862 synchronize by sstrehlk
May 21, 2024 09:15 Action required sstrehlk:support-ov-models-via-genai
May 21, 2024 09:15 Action required
Draft - Support ov models via genai
Tasks Modified #2431: Pull request #1862 synchronize by sstrehlk
May 21, 2024 09:15 Action required sstrehlk:support-ov-models-via-genai
May 21, 2024 09:15 Action required
fixed incorrect check for task type (replace ~ with not) (#1865)
Tasks Modified #2430: Commit 00b7a61 pushed by lintangsutawika
May 21, 2024 09:00 1m 43s main
May 21, 2024 09:00 1m 43s
fixed incorrect check for task type (replace ~ with not) (#1865)
Unit Tests #2402: Commit 00b7a61 pushed by lintangsutawika
May 21, 2024 09:00 5m 32s main
May 21, 2024 09:00 5m 32s
Fix incorrect check for task type
Unit Tests #2401: Pull request #1865 opened by zafstojano
May 21, 2024 08:09 3m 57s zafstojano:fix-check-task-type
May 21, 2024 08:09 3m 57s
Fix incorrect check for task type
Tasks Modified #2429: Pull request #1865 opened by zafstojano
May 21, 2024 08:09 2m 14s zafstojano:fix-check-task-type
May 21, 2024 08:09 2m 14s
mmlu-pro for the Italian language
Tasks Modified #2428: Pull request #1860 synchronize by giux78
May 20, 2024 17:24 2m 9s giux78:mmlu-pro-ita-2
May 20, 2024 17:24 2m 9s
mmlu-pro for the Italian language
Unit Tests #2400: Pull request #1860 synchronize by giux78
May 20, 2024 17:24 6m 0s giux78:mmlu-pro-ita-2
May 20, 2024 17:24 6m 0s
mmlu-pro for the Italian language
Unit Tests #2399: Pull request #1860 synchronize by giux78
May 20, 2024 17:03 5m 15s giux78:mmlu-pro-ita-2
May 20, 2024 17:03 5m 15s
mmlu-pro for the Italian language
Tasks Modified #2427: Pull request #1860 synchronize by giux78
May 20, 2024 17:03 1m 38s giux78:mmlu-pro-ita-2
May 20, 2024 17:03 1m 38s
Fixing typos in docs
Tasks Modified #2426: Pull request #1863 opened by zafstojano
May 20, 2024 14:28 20s zafstojano:fix-docs-typos
May 20, 2024 14:28 20s