Use `outlines.processors` for `models.llamacpp` #997

lapp0 · 2024-06-21T15:07:40Z

Fixes #965

Problem

in main

generate.fsm only supports SequenceGenerator,
models.llamacpp doesn't support SequenceGenerator, it only supports SequenceGeneratorAdapter
integrations.llamacpp doesn't have a FSM logits processor.

Ensure models.llamacpp uses outlines.processors / SequenceGeneratorAdapter for all generators
Update generate.fsm to use SequenceGeneratorAdapter for all unified models
test_generate.py tests for generate.fsm on MLXLM, llamacpp, and transformers (the three models using outlines.processors)
- fix critical generate.fsm bug discovered through this test in guide.py impacting all models

lapp0 mentioned this pull request Jun 21, 2024

LlamaCpp doesnt work with generate.fsm for custom FSMs #965

Closed

rlouf assigned lapp0 Jun 22, 2024

lapp0 force-pushed the fix-llamacpp-fsm branch 2 times, most recently from 5c7c546 to 6d3b51d Compare June 30, 2024 22:06

Use LogitsProcessors for models.transformers -> outlines.generate.*

cdd49e5

lapp0 force-pushed the fix-llamacpp-fsm branch 2 times, most recently from 3b4f1e6 to 7b8ab97 Compare July 15, 2024 08:10

enable generate.fsm with llamacpp by using outlines.processors

4ead465

lapp0 force-pushed the fix-llamacpp-fsm branch from 7b8ab97 to 4ead465 Compare July 15, 2024 08:48

lapp0 marked this pull request as ready for review July 15, 2024 08:55

rlouf merged commit 5a7f082 into dottxt-ai:main Jul 15, 2024
7 checks passed