Add CFG to vllm serving #517

mory91 · 2024-01-10T09:52:54Z

Hi,
This pull request adds support for CFG in vllm serving.

rlouf · 2024-01-11T19:32:07Z

outlines/serve/serve.py

@@ -122,6 +127,8 @@ async def stream_results() -> AsyncGenerator[bytes, None]:
    # Sets default for the model (`facebook/opt-125m`)
    engine = AsyncLLMEngine.from_engine_args(engine_args)

+    _adapt_tokenizer(engine.engine.tokenizer)


Why are you calling this function here? The result is not used.

The tokenizer is changed inside the function anyways. I now assigned it to the tokenizer though.

Ah makes sense. It's not needed however as vLLM handles tokenisation on its end during encoding/decoding.

It's needed because in here

outlines/outlines/fsm/fsm.py

Line 345 in fde61a8

self.generation += self.tokenizer.decode([token_id])[0]

outlines expects tokenizer to return a list but vllm tokenizers return string.
Also I just realized that this change makes a breaking change to the library. If it's fine by the project directors its fine, if not we might need a change.
for example we can call _adapt_tokenizer inside __init__ functions of the logit processors

rlouf · 2024-01-12T07:00:55Z

Thank you for your contributions! I added some documentation before merging.

mory91 mentioned this pull request Jan 10, 2024

Add CFG guided generation to vLLM integration #494

Closed

rlouf linked an issue Jan 10, 2024 that may be closed by this pull request

Add CFG guided generation to vLLM integration #494

Closed

rlouf force-pushed the vllm-cfg branch from 749eed6 to 9c2f50f Compare January 10, 2024 20:09

rlouf added structured generation Linked to structured generation vLLM Things involving vLLM support labels Jan 10, 2024

mory91 force-pushed the vllm-cfg branch from 9c2f50f to 67e26b4 Compare January 10, 2024 20:39

rlouf force-pushed the vllm-cfg branch from d682fef to 128741c Compare January 11, 2024 19:30

rlouf reviewed Jan 11, 2024

View reviewed changes

mory91 force-pushed the vllm-cfg branch 2 times, most recently from 012a0e9 to 281c0af Compare January 11, 2024 23:20

rlouf force-pushed the vllm-cfg branch from 281c0af to f902bab Compare January 12, 2024 06:51

Add CFG to vllm serving

acfde57

rlouf force-pushed the vllm-cfg branch from f902bab to acfde57 Compare January 12, 2024 06:58

rlouf merged commit fde61a8 into outlines-dev:main Jan 12, 2024
4 checks passed

lapp0 mentioned this pull request Jan 13, 2024

Add Grammars vllm-project/vllm#2105

Closed

11 tasks

rlouf mentioned this pull request Jan 14, 2024

Revert "Add CFG to vllm serving" #537

Merged

brucethemoose mentioned this pull request Jan 17, 2024

[Feature Request] CFG in Backend Calls sgl-project/sglang#21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CFG to vllm serving #517

Add CFG to vllm serving #517

mory91 commented Jan 10, 2024

rlouf Jan 11, 2024

mory91 Jan 11, 2024

rlouf Jan 12, 2024

mory91 Jan 12, 2024 •

edited

Loading

rlouf commented Jan 12, 2024

Add CFG to vllm serving #517

Add CFG to vllm serving #517

Conversation

mory91 commented Jan 10, 2024

rlouf Jan 11, 2024

Choose a reason for hiding this comment

mory91 Jan 11, 2024

Choose a reason for hiding this comment

rlouf Jan 12, 2024

Choose a reason for hiding this comment

mory91 Jan 12, 2024 • edited Loading

Choose a reason for hiding this comment

rlouf commented Jan 12, 2024

mory91 Jan 12, 2024 •

edited

Loading