Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,699 workflow runs
5,699 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

separate category for global_mmlu (#2652)
Unit Tests #4096: Commit 5c006ed pushed by baberabb
January 24, 2025 16:00 7m 28s main
January 24, 2025 16:00 7m 28s
separate category for global_mmlu (#2652)
Tasks Modified #4124: Commit 5c006ed pushed by baberabb
January 24, 2025 16:00 1h 18m 54s main
January 24, 2025 16:00 1h 18m 54s
Add loncxt tasks
Unit Tests #4094: Pull request #2629 synchronize by baberabb
January 23, 2025 18:34 7m 28s longcxt
January 23, 2025 18:34 7m 28s
Add loncxt tasks
Tasks Modified #4122: Pull request #2629 synchronize by baberabb
January 23, 2025 18:34 1m 43s longcxt
January 23, 2025 18:34 1m 43s
fix multiple input chat tempalte
Tasks Modified #4121: Pull request #2576 synchronize by baberabb
January 23, 2025 15:59 1m 40s multiple_input
January 23, 2025 15:59 1m 40s
fix multiple input chat tempalte
Unit Tests #4093: Pull request #2576 synchronize by baberabb
January 23, 2025 15:59 7m 37s multiple_input
January 23, 2025 15:59 7m 37s
Add Moral Stories
Tasks Modified #4120: Pull request #2653 opened by upunaprosk
January 23, 2025 14:31 1m 46s upunaprosk:moral_stories
January 23, 2025 14:31 1m 46s
Add Moral Stories
Unit Tests #4092: Pull request #2653 opened by upunaprosk
January 23, 2025 14:31 7m 19s upunaprosk:moral_stories
January 23, 2025 14:31 7m 19s
Easily evaluate models steered by SAEs
Unit Tests #4091: Pull request #2641 synchronize by AMindToThink
January 23, 2025 03:50 Action required AMindToThink:sae_steered
January 23, 2025 03:50 Action required
Easily evaluate models steered by SAEs
Tasks Modified #4119: Pull request #2641 synchronize by AMindToThink
January 23, 2025 03:50 Action required AMindToThink:sae_steered
January 23, 2025 03:50 Action required
separate category for global_mmlu
Tasks Modified #4118: Pull request #2652 opened by bzantium
January 23, 2025 02:06 2h 0m 24s feature/#2649
January 23, 2025 02:06 2h 0m 24s
separate category for global_mmlu
Unit Tests #4090: Pull request #2652 opened by bzantium
January 23, 2025 02:06 7m 5s feature/#2649
January 23, 2025 02:06 7m 5s
Add loncxt tasks
Unit Tests #4089: Pull request #2629 synchronize by baberabb
January 23, 2025 00:53 6m 56s longcxt
January 23, 2025 00:53 6m 56s
Add loncxt tasks
Tasks Modified #4117: Pull request #2629 synchronize by baberabb
January 23, 2025 00:53 1m 53s longcxt
January 23, 2025 00:53 1m 53s
Add loncxt tasks
Tasks Modified #4116: Pull request #2629 synchronize by baberabb
January 22, 2025 23:03 1m 32s longcxt
January 22, 2025 23:03 1m 32s
Add loncxt tasks
Unit Tests #4088: Pull request #2629 synchronize by baberabb
January 22, 2025 23:03 6m 55s longcxt
January 22, 2025 23:03 6m 55s
Add loncxt tasks
Unit Tests #4087: Pull request #2629 synchronize by baberabb
January 22, 2025 22:44 6m 40s longcxt
January 22, 2025 22:44 6m 40s
Add loncxt tasks
Tasks Modified #4115: Pull request #2629 synchronize by baberabb
January 22, 2025 22:44 1m 51s longcxt
January 22, 2025 22:44 1m 51s
Add loncxt tasks
Unit Tests #4086: Pull request #2629 synchronize by baberabb
January 22, 2025 22:25 6m 52s longcxt
January 22, 2025 22:25 6m 52s
Add loncxt tasks
Tasks Modified #4114: Pull request #2629 synchronize by baberabb
January 22, 2025 22:25 1m 43s longcxt
January 22, 2025 22:25 1m 43s
add TransformerLens example
Tasks Modified #4113: Pull request #2651 opened by nickypro
January 22, 2025 17:55 14s nickypro:patch-1
January 22, 2025 17:55 14s
add TransformerLens example
Unit Tests #4085: Pull request #2651 opened by nickypro
January 22, 2025 17:55 7m 16s nickypro:patch-1
January 22, 2025 17:55 7m 16s
humaneval instruct
Unit Tests #4084: Pull request #2650 opened by baberabb
January 22, 2025 16:49 7m 2s humaneval_instruct
January 22, 2025 16:49 7m 2s
humaneval instruct
Tasks Modified #4112: Pull request #2650 opened by baberabb
January 22, 2025 16:49 1m 57s humaneval_instruct
January 22, 2025 16:49 1m 57s