Add Beam Search sampler #618

rlouf · 2024-02-06T18:04:59Z

Closes #258.

In order to implement Beam Search I had to make a few changes to the samplers, they now:

Update the sequences' weights (log-probability)
Return the new token_ids's ancestors, i.e. the sequence to which they need to be added. While trivial for greedy and multinomial sampling, this is what will allow us to update "beams" in beam search.

token_ids, attention_masks, kv_cache, fsm and fsm_states are now updated using the ancestors information. GenerationState contains the sequence's weights and ancestors so we can inspect the sampling process.

I also simplified the get_generated_token_ids method of SequenceGenerator. We should soon do a cleaning pass on this class: remove deprecated init arguments, make some methods independant functions and test them.

lapp0

Included a few questions and a documentation fix.

Nice to see how straightforward a sampler implementation can be when accompanied with a well designed SequenceGenerator.

lapp0 · 2024-02-08T19:16:55Z

docs/reference/samplers.md

+from outlines import models, generate, samplers
+
+
+model = models.transformers("mistralai/Mistral-7B-0.1")


Typo, it should be mistralai/Mistral-7B-v0.1

However I think we should be recommending mistralai/Mistral-7B-Instruct-v0.2

docs/reference/samplers.md

outlines/generate/api.py

outlines/generate/generator.py

outlines/samplers.py

rlouf · 2024-02-09T14:48:46Z

I overlooked something: some Beam Search implementations clone each beam K times and then down sample them to preserve some kind of sample diversity, see this implementation for instance. This can be done in another PR.

rlouf added enhancement transformers Linked to the `transformers` integration labels Feb 6, 2024

rlouf force-pushed the beam-search-implementation branch 10 times, most recently from 137740a to a06f4b5 Compare February 8, 2024 12:32

rlouf marked this pull request as ready for review February 8, 2024 13:29

rlouf force-pushed the beam-search-implementation branch from 16dc89e to f4c1eeb Compare February 8, 2024 13:51

lapp0 reviewed Feb 8, 2024

View reviewed changes

rlouf force-pushed the beam-search-implementation branch 2 times, most recently from d23d3fd to c269c0b Compare February 9, 2024 17:09

rlouf added 3 commits February 9, 2024 18:11

Return sequence weights and ancestors in samplers

1892cdb

Add Beam Search sampler

091ee18

Clean SequenceGenerator

c269c0b

rlouf merged commit 94e2a38 into main Feb 9, 2024
5 checks passed

rlouf deleted the beam-search-implementation branch February 9, 2024 17:37

leloykun mentioned this pull request May 5, 2024

Hotfix for CFG Generation #865

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Beam Search sampler #618

Add Beam Search sampler #618

rlouf commented Feb 6, 2024 •

edited

Loading

lapp0 left a comment

lapp0 Feb 8, 2024

rlouf commented Feb 9, 2024 •

edited

Loading

		from outlines import models, generate, samplers


		model = models.transformers("mistralai/Mistral-7B-0.1")

Add Beam Search sampler #618

Add Beam Search sampler #618

Conversation

rlouf commented Feb 6, 2024 • edited Loading

lapp0 left a comment

Choose a reason for hiding this comment

lapp0 Feb 8, 2024

Choose a reason for hiding this comment

rlouf commented Feb 9, 2024 • edited Loading

rlouf commented Feb 6, 2024 •

edited

Loading

rlouf commented Feb 9, 2024 •

edited

Loading