Skip to content

Actions: EleutherAI/lm-evaluation-harness

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,844 workflow runs
2,844 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix gen_prefix
Unit Tests #4022: Pull request #2630 opened by baberabb
January 17, 2025 19:14 5m 56s prefixfix
January 17, 2025 19:14 5m 56s
Add loncxt tasks
Unit Tests #4021: Pull request #2629 synchronize by baberabb
January 17, 2025 18:40 6m 23s longcxt
January 17, 2025 18:40 6m 23s
Add loncxt tasks
Unit Tests #4020: Pull request #2629 synchronize by baberabb
January 17, 2025 18:36 6m 0s longcxt
January 17, 2025 18:36 6m 0s
Add loncxt tasks
Unit Tests #4019: Pull request #2629 synchronize by baberabb
January 17, 2025 17:18 5m 46s longcxt
January 17, 2025 17:18 5m 46s
Add loncxt tasks
Unit Tests #4018: Pull request #2629 synchronize by baberabb
January 17, 2025 17:17 5m 50s longcxt
January 17, 2025 17:17 5m 50s
Add loncxt tasks
Unit Tests #4017: Pull request #2629 synchronize by baberabb
January 17, 2025 17:16 5m 51s longcxt
January 17, 2025 17:16 5m 51s
Add loncxt tasks
Unit Tests #4016: Pull request #2629 synchronize by baberabb
January 17, 2025 16:58 6m 37s longcxt
January 17, 2025 16:58 6m 37s
Add loncxt tasks
Unit Tests #4015: Pull request #2629 synchronize by baberabb
January 17, 2025 14:57 3m 47s longcxt
January 17, 2025 14:57 3m 47s
Add loncxt tasks
Unit Tests #4014: Pull request #2629 synchronize by baberabb
January 17, 2025 14:52 2m 8s longcxt
January 17, 2025 14:52 2m 8s
Add loncxt tasks
Unit Tests #4013: Pull request #2629 synchronize by baberabb
January 17, 2025 14:49 2m 15s longcxt
January 17, 2025 14:49 2m 15s
Add loncxt tasks
Unit Tests #4012: Pull request #2629 opened by baberabb
January 17, 2025 14:18 3m 3s longcxt
January 17, 2025 14:18 3m 3s
Added small fix to split by eos_token_id before decoding
Unit Tests #4011: Pull request #2512 synchronize by EtashGuha
January 16, 2025 21:40 Action required EtashGuha:etashg/tokenize_fix
January 16, 2025 21:40 Action required
add hrm8k benchmark for both Korean and English
Unit Tests #4010: Pull request #2627 synchronize by bzantium
January 16, 2025 08:41 6m 27s feature/#2623
January 16, 2025 08:41 6m 27s
Mathvista
Unit Tests #4009: Pull request #2321 synchronize by baberabb
January 16, 2025 02:05 5m 35s mathvista
January 16, 2025 02:05 5m 35s
Mathvista
Unit Tests #4008: Pull request #2321 synchronize by baberabb
January 16, 2025 01:51 5m 28s mathvista
January 16, 2025 01:51 5m 28s
Mathvista
Unit Tests #4007: Pull request #2321 synchronize by baberabb
January 16, 2025 01:47 4m 56s mathvista
January 16, 2025 01:47 4m 56s
Mathvista
Unit Tests #4006: Pull request #2321 synchronize by baberabb
January 16, 2025 01:25 5m 30s mathvista
January 16, 2025 01:25 5m 30s
Mathvista
Unit Tests #4005: Pull request #2321 synchronize by baberabb
January 16, 2025 00:41 5m 11s mathvista
January 16, 2025 00:41 5m 11s
assistant prefill (#2615)
Unit Tests #4004: Commit 703fbff pushed by baberabb
January 15, 2025 23:09 5m 50s main
January 15, 2025 23:09 5m 50s
assistant prefill
Unit Tests #4003: Pull request #2615 synchronize by baberabb
January 15, 2025 23:07 6m 5s prefix
January 15, 2025 23:07 6m 5s
assistant prefill
Unit Tests #4002: Pull request #2615 synchronize by baberabb
January 15, 2025 23:01 5m 50s prefix
January 15, 2025 23:01 5m 50s
assistant prefill
Unit Tests #4001: Pull request #2615 synchronize by baberabb
January 15, 2025 21:28 6m 0s prefix
January 15, 2025 21:28 6m 0s
assistant prefill
Unit Tests #4000: Pull request #2615 synchronize by baberabb
January 15, 2025 21:18 6m 21s prefix
January 15, 2025 21:18 6m 21s
assistant prefill
Unit Tests #3999: Pull request #2615 synchronize by baberabb
January 15, 2025 21:15 6m 14s prefix
January 15, 2025 21:15 6m 14s
Add MLQA (#2622)
Unit Tests #3998: Commit e86cece pushed by baberabb
January 15, 2025 21:14 6m 0s main
January 15, 2025 21:14 6m 0s