Skip to content

Actions: tatsu-lab/alpaca_eval

test format leaderboard

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
236 workflow runs
236 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add REBEL-Llama-3-8B-Instruct-Armo to AlpacaEval
test format leaderboard #239: Pull request #403 opened by ZhaolinGao
August 28, 2024 18:50 2m 4s ZhaolinGao:main
August 28, 2024 18:50 2m 4s
Add Shopee-SlimMoA-v1 to AlpacaEval
test format leaderboard #238: Pull request #398 synchronize by YannDubs
August 26, 2024 21:32 2m 3s LLM-Alignment-sh:main
August 26, 2024 21:32 2m 3s
Add blendaxai-gm-l6-vo31 to AlpacaEval
test format leaderboard #237: Pull request #399 opened by ym-blendax-ai
August 23, 2024 12:45 2m 7s Blendax-AI:main
August 23, 2024 12:45 2m 7s
Add Shopee-SlimMoA-v1 to AlpacaEval
test format leaderboard #236: Pull request #398 opened by LLM-Alignment-sh
August 23, 2024 11:38 2m 5s LLM-Alignment-sh:main
August 23, 2024 11:38 2m 5s
Added Llama3-PBM-Nova-70B model
test format leaderboard #235: Pull request #395 synchronize by PKU-Baichuan
August 23, 2024 06:09 2m 4s PKU-Baichuan:main
August 23, 2024 06:09 2m 4s
Add blendaxai-gm-l6-vo14 to AlpacaEval
test format leaderboard #233: Pull request #397 synchronize by ym-blendax-ai
August 22, 2024 20:11 1m 58s Blendax-AI:main
August 22, 2024 20:11 1m 58s
Add blendaxai-gm-l6-vo14 to AlpacaEval
test format leaderboard #232: Pull request #397 opened by ym-blendax-ai
August 22, 2024 20:05 2m 13s Blendax-AI:main
August 22, 2024 20:05 2m 13s
Added Llama3-PBM-Nova-70B model
test format leaderboard #231: Pull request #395 synchronize by PKU-Baichuan
August 21, 2024 06:59 2m 11s PKU-Baichuan:main
August 21, 2024 06:59 2m 11s
Added Llama3-PBM-Nova-70B model
test format leaderboard #230: Pull request #395 opened by PKU-Baichuan
August 19, 2024 13:10 2m 11s PKU-Baichuan:main
August 19, 2024 13:10 2m 11s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
test format leaderboard #229: Pull request #393 synchronize by YannDubs
August 17, 2024 22:48 2m 6s yann/models_rubriceval
August 17, 2024 22:48 2m 6s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
test format leaderboard #228: Pull request #393 opened by YannDubs
August 17, 2024 22:39 2m 8s yann/models_rubriceval
August 17, 2024 22:39 2m 8s
Add blendaxai-gm-l3-v35 to AlpacaEval
test format leaderboard #227: Pull request #389 synchronize by ym-blendax-ai
August 14, 2024 17:57 2m 8s Blendax-AI:main
August 14, 2024 17:57 2m 8s
Add blendaxai-gm-l3-v35 to AlpacaEval
test format leaderboard #226: Pull request #389 opened by ym-blendax-ai
August 14, 2024 15:32 2m 5s Blendax-AI:main
August 14, 2024 15:32 2m 5s
Change the name of the Infinity-Instruct-7M-0729-Models to Infinity-Instruct-7M-Gen-Models
test format leaderboard #224: Pull request #387 opened by cszhengyh
August 13, 2024 03:32 2m 12s cszhengyh:main
August 13, 2024 03:32 2m 12s
Change the name of the Infinity-Instruct-7M-0729-Models to Infinity-Instruct-7M-Gen-Models
test format leaderboard #223: Pull request #386 synchronize by cszhengyh
August 13, 2024 03:30 2m 6s cszhengyh:main
August 13, 2024 03:30 2m 6s
Change the name of the Infinity-Instruct-7M-0729-Models to Infinity-Instruct-7M-Gen-Models
test format leaderboard #222: Pull request #386 opened by cszhengyh
August 12, 2024 06:49 2m 1s cszhengyh:main
August 12, 2024 06:49 2m 1s
Add gemma-2-9b-it-WPO-HB to AlpacaEval
test format leaderboard #221: Pull request #384 opened by wzhouad
August 8, 2024 21:38 2m 21s wzhouad:main
August 8, 2024 21:38 2m 21s
[ENH] add llama 3.1
test format leaderboard #219: Pull request #378 opened by YannDubs
July 26, 2024 01:04 2m 6s yann/llama31
July 26, 2024 01:04 2m 6s
Add Llama-3-Instruct-8B-WPO-HB-v2 to AlpacaEval
test format leaderboard #218: Pull request #377 opened by wzhouad
July 24, 2024 20:27 1m 54s wzhouad:main
July 24, 2024 20:27 1m 54s
[ENH] add the code to compute instruction_following
test format leaderboard #217: Pull request #371 opened by YannDubs
July 18, 2024 16:24 1m 57s yann/instruction_difficulty
July 18, 2024 16:24 1m 57s
test format leaderboard
test format leaderboard #216: Manually run by YannDubs
July 18, 2024 12:37 1m 51s main
July 18, 2024 12:37 1m 51s
Added Ghost 8B Beta (d0x5) model
test format leaderboard #215: Pull request #366 synchronize by YannDubs
July 18, 2024 11:13 2m 6s lh0x00:AddedGhost8BBetaD0x5
July 18, 2024 11:13 2m 6s
Added Ghost 8B Beta (d0x5) model
test format leaderboard #214: Pull request #366 synchronize by YannDubs
July 18, 2024 11:12 1m 55s lh0x00:AddedGhost8BBetaD0x5
July 18, 2024 11:12 1m 55s
Added Ghost 8B Beta (d0x5) model
test format leaderboard #213: Pull request #366 synchronize by lh0x00
July 18, 2024 06:54 2m 3s lh0x00:AddedGhost8BBetaD0x5
July 18, 2024 06:54 2m 3s