Fix vLLM integration #711

saattrupdan · 2024-02-27T16:58:31Z

When integrating Outlines with vLLM I faced the following issues, which are fixed in this PR:

When calling vllm.LLM.generate then within the internals of vLLM a copy.deepcopy of the vLLM SamplingParams is made, which includes the logits processor from Outlines (RegexLogitsProcessor, say). This requires everything to be pickleable, and the RegexLogitsProcessor.fsm.vocabulary is a dict_values object, which doesn't satisfy that. The fix is easy: just convert it to a list. This doesn't affect how this vocabulary variable is being used in the code.
The RegexLogitsProcessor takes an llm argument, which the docstring states should be a vllm.LLM object, but then attempts to extract the underlying tokenizer via llm.tokenizer.tokenizer. The tokenizer of vllm.LLM currently lies in the vllm.LLM.llm_engine.tokenizer.tokenizer attribute, but this is a big mess and isn't backwards compatible with previous vLLM versions. Instead, they have a convenience method, vllm.LLM.get_tokenizer, which fetches the tokenizer. To remain backwards compatibility, in case people have supplied vllm.LLM.llm_engine directly into RegexLogitsProcessor, it falls back to a tokenizer or tokenizer.tokenizer attribute.

I also updated the vLLM example script, as that was outdated as well (used the previous _patched_apply_logits_processors).

Closes #704

rlouf · 2024-02-27T19:45:55Z

Thank you for taking the time to submit a PR. This looks great!

saattrupdan added 5 commits February 27, 2024 17:31

fix: Ensure that RegexFSM.vocabulary is pickleable

e36833b

fix: Get tokenizer properly in vLLM integration

e29afaf

fix: Update vLLM integration example

aeb9470

chore: Typo in file name: "transfomers"

e840f5b

chore: Ignore virtual environment

94034f5

saattrupdan mentioned this pull request Feb 27, 2024

Compatibility issue with vllm 0.32 #704

Closed

fix: Wrap other vocabulary in list()

7a0f557

rlouf merged commit d938678 into outlines-dev:main Feb 27, 2024
5 checks passed

saattrupdan deleted the fix/vllm-integration branch February 27, 2024 22:02

Provide feedback