Skip to content

Actions: allenai/reward-bench

Actions

All workflows

Actions

Loading...

Showing runs from all workflows
824 workflow runs
824 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

Add ArmoRM to RewardBench (#135)
Tests #415: Commit 0851402 pushed by natolambert
May 24, 2024 20:11 3m 11s main
May 24, 2024 20:11 3m 11s
Add ArmoRM to RewardBench (#135)
Quality #415: Commit 0851402 pushed by natolambert
May 24, 2024 20:11 2m 42s main
May 24, 2024 20:11 2m 42s
Add ArmoRM to RewardBench
Tests #414: Pull request #135 synchronize by Haoxiang-Wang
May 24, 2024 17:48 4m 17s Haoxiang-Wang:ArmoRM
May 24, 2024 17:48 4m 17s
Add ArmoRM to RewardBench
Quality #414: Pull request #135 synchronize by Haoxiang-Wang
May 24, 2024 17:48 3m 5s Haoxiang-Wang:ArmoRM
May 24, 2024 17:48 3m 5s
Add ArmoRM to RewardBench
Tests #413: Pull request #135 opened by Haoxiang-Wang
May 24, 2024 03:43 3m 11s Haoxiang-Wang:ArmoRM
May 24, 2024 03:43 3m 11s
Add ArmoRM to RewardBench
Quality #413: Pull request #135 opened by Haoxiang-Wang
May 24, 2024 03:43 2m 32s Haoxiang-Wang:ArmoRM
May 24, 2024 03:43 2m 32s
Fixes to analysis scripts (#131)
Tests #412: Commit 2eacee8 pushed by natolambert
May 22, 2024 18:36 3m 36s main
May 22, 2024 18:36 3m 36s
Fixes to analysis scripts (#131)
Quality #412: Commit 2eacee8 pushed by natolambert
May 22, 2024 18:36 2m 28s main
May 22, 2024 18:36 2m 28s
Gemini prompt for llm-as-a-judge
Tests #411: Pull request #133 synchronize by natolambert
May 22, 2024 18:33 5m 32s gemini
May 22, 2024 18:33 5m 32s
Gemini prompt for llm-as-a-judge
Quality #411: Pull request #133 synchronize by natolambert
May 22, 2024 18:33 2m 31s gemini
May 22, 2024 18:33 2m 31s
Gemini prompt for llm-as-a-judge
Quality #410: Pull request #133 opened by natolambert
May 22, 2024 17:55 2m 32s gemini
May 22, 2024 17:55 2m 32s
Gemini prompt for llm-as-a-judge
Tests #410: Pull request #133 opened by natolambert
May 22, 2024 17:55 3m 20s gemini
May 22, 2024 17:55 3m 20s
Logging some new models
Quality #409: Pull request #132 opened by natolambert
May 20, 2024 17:15 6m 9s dpo_models
May 20, 2024 17:15 6m 9s
Logging some new models
Tests #409: Pull request #132 opened by natolambert
May 20, 2024 17:15 4m 14s dpo_models
May 20, 2024 17:15 4m 14s
Fixes to analysis scripts
Tests #408: Pull request #131 synchronize by natolambert
May 20, 2024 17:05 5m 48s improve_analysis
May 20, 2024 17:05 5m 48s
Fixes to analysis scripts
Quality #408: Pull request #131 synchronize by natolambert
May 20, 2024 17:05 2m 43s improve_analysis
May 20, 2024 17:05 2m 43s
Fixes to analysis scripts
Tests #407: Pull request #131 synchronize by natolambert
May 20, 2024 00:14 4m 11s improve_analysis
May 20, 2024 00:14 4m 11s
Fixes to analysis scripts
Quality #407: Pull request #131 synchronize by natolambert
May 20, 2024 00:14 2m 40s improve_analysis
May 20, 2024 00:14 2m 40s
Fixes to analysis scripts
Quality #406: Pull request #131 opened by natolambert
May 20, 2024 00:11 2m 32s improve_analysis
May 20, 2024 00:11 2m 32s
Fixes to analysis scripts
Tests #406: Pull request #131 opened by natolambert
May 20, 2024 00:11 4m 21s improve_analysis
May 20, 2024 00:11 4m 21s
Mixed bag of fixes / updates (#129)
Tests #405: Commit f87a336 pushed by natolambert
May 19, 2024 22:08 3m 27s main
May 19, 2024 22:08 3m 27s
Mixed bag of fixes / updates (#129)
Quality #405: Commit f87a336 pushed by natolambert
May 19, 2024 22:08 2m 32s main
May 19, 2024 22:08 2m 32s
Mixed bag of fixes / updates
Quality #404: Pull request #129 synchronize by natolambert
May 19, 2024 05:39 3m 32s nits05
May 19, 2024 05:39 3m 32s
Mixed bag of fixes / updates
Tests #404: Pull request #129 synchronize by natolambert
May 19, 2024 05:39 4m 10s nits05
May 19, 2024 05:39 4m 10s
Mixed bag of fixes / updates
Quality #403: Pull request #129 synchronize by natolambert
May 19, 2024 04:53 2m 32s nits05
May 19, 2024 04:53 2m 32s