Skip to content

Actions: bigcode-project/bigcode-evaluation-harness

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
222 workflow runs
222 workflow runs

Filter by Event

Loading

Filter by Status

Loading

Filter by Branch

Loading

Filter by Actor

Loading
Add a new benchmark ENAMEL for evaluating the efficiency of LLM-generated code
CI #622: Pull request #260 opened by q-rz
July 22, 2024 06:42 Action required q-rz:main
July 22, 2024 06:42 Action required
Fix Max New Tokens in HF's Generation Config
CI #617: Pull request #257 opened by mostafaelhoushi
July 18, 2024 19:42 Action required mostafaelhoushi:patch-1
July 18, 2024 19:42 Action required
Merge pull request #255 from bigcode-project/arjunguha-patch-1
CI #616: Commit 0f3e95f pushed by loubnabnl
July 14, 2024 18:07 2m 55s main
July 14, 2024 18:07 2m 55s
Merge pull request #254 from bigcode-project/MultiPL-Ev3
CI #615: Commit a83b1ca pushed by loubnabnl
July 14, 2024 18:07 2m 55s main
July 14, 2024 18:07 2m 55s
[draft]save prompts and tests passed/failed
CI #614: Pull request #253 synchronize by kbmlcoding
July 12, 2024 21:16 Action required kbmlcoding:mk-eval-changes
July 12, 2024 21:16 Action required
[draft]save prompts and tests passed/failed
CI #613: Pull request #253 synchronize by kbmlcoding
July 12, 2024 21:15 Action required kbmlcoding:mk-eval-changes
July 12, 2024 21:15 Action required
[draft]save prompts and tests passed/failed
CI #612: Pull request #253 synchronize by kbmlcoding
July 12, 2024 20:43 Action required kbmlcoding:mk-eval-changes
July 12, 2024 20:43 Action required
Update Dockerfile-multiple
CI #611: Pull request #255 opened by arjunguha
July 12, 2024 14:02 2m 54s arjunguha-patch-1
July 12, 2024 14:02 2m 54s
Update MultiPL-E to v3 prompts
CI #610: Pull request #254 opened by arjunguha
July 12, 2024 13:36 2m 22s MultiPL-Ev3
July 12, 2024 13:36 2m 22s
Merge pull request #244 from meher-m/transformers_fix
CI #577: Commit 334efb7 pushed by loubnabnl
June 24, 2024 09:22 2m 24s main
June 24, 2024 09:22 2m 24s
June 21, 2024 09:33 2m 16s
Merge pull request #238 from Elfsong/mercury
CI #569: Commit f0f2b52 pushed by loubnabnl
May 29, 2024 22:30 2m 36s main
May 29, 2024 22:30 2m 36s
Add a new dataset Mercury
CI #568: Pull request #238 synchronize by Elfsong
May 28, 2024 16:31 2m 18s Elfsong:mercury
May 28, 2024 16:31 2m 18s
May 5, 2024 22:31 2m 43s
Add StudentEval from LLM4Code 2024
CI #562: Commit a1b4a79 pushed by arjunguha
April 23, 2024 18:09 4m 22s main
April 23, 2024 18:09 4m 22s
Add StudentEval from LLM4Code 2024
CI #561: Pull request #229 opened by arjunguha
April 23, 2024 11:10 3m 30s arjunguha:main
April 23, 2024 11:10 3m 30s
Merge pull request #223 from ganler/evalplus-maintain
CI #559: Commit 1b0147c pushed by loubnabnl
April 19, 2024 21:47 2m 24s main
April 19, 2024 21:47 2m 24s
refactor(evalplus): maintain mbpp+ v0.2.0
CI #558: Pull request #223 opened by ganler
April 19, 2024 18:20 9m 33s ganler:evalplus-maintain
April 19, 2024 18:20 9m 33s
Merge pull request #219 from bigcode-project/loubnabnl-patch-9
CI #555: Commit 642c57f pushed by loubnabnl
April 16, 2024 20:06 2m 37s main
April 16, 2024 20:06 2m 37s
Add instruct models prompts
CI #554: Pull request #219 opened by loubnabnl
April 16, 2024 20:06 9m 36s loubnabnl-patch-9
April 16, 2024 20:06 9m 36s
Merge pull request #208 from bigcode-project/aurora-prompt
CI #550: Commit 094c7cc pushed by Muennighoff
March 27, 2024 15:54 2m 30s main
March 27, 2024 15:54 2m 30s