Ensure `models.llamacpp` Doesn't Have Implicit `max_tokens` #996

lapp0 · 2024-06-21T08:50:33Z

Fixes #973

Problem

In #973 an invalid pattern was being generated, however it was only invalid because it was incomplete. in the LlamaCpp.generate() call llama_cpp_params["max_tokens"] was unset, however the model was still finishing for reason "length":

[{'text': '{ "name": "Grace Lee", "age":32, "', 'index': 0, 'logprobs': None, 'finish_reason': 'length'}]

In turns out the max tokens was being implicitly set to 30.

Solution

Allow 2**30 tokens if max_length is unset /None, because llama_cpp_params requires an int for the max_tokens field.

outlines/models/llamacpp.py

lapp0 added the llama.cpp Related to the `llama.cpp` integration label Jun 21, 2024

rlouf assigned lapp0 Jun 22, 2024

lapp0 force-pushed the fix-973 branch from e66e68b to 071c299 Compare June 22, 2024 19:21

lapp0 marked this pull request as ready for review June 22, 2024 22:14

lapp0 requested a review from rlouf June 22, 2024 22:15

rlouf reviewed Jun 23, 2024

View reviewed changes

outlines/models/llamacpp.py Show resolved Hide resolved

lapp0 force-pushed the fix-973 branch from 071c299 to 6c8d443 Compare June 23, 2024 18:31

Ensure no implicit max_tokens in models.llamacpp

d7b98e0

lapp0 force-pushed the fix-973 branch from 6c8d443 to d7b98e0 Compare June 23, 2024 18:32

rlouf merged commit 42206e2 into outlines-dev:main Jun 24, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure `models.llamacpp` Doesn't Have Implicit `max_tokens` #996

Ensure `models.llamacpp` Doesn't Have Implicit `max_tokens` #996

lapp0 commented Jun 21, 2024 •

edited

Loading

Ensure models.llamacpp Doesn't Have Implicit max_tokens #996

Ensure models.llamacpp Doesn't Have Implicit max_tokens #996

Conversation

lapp0 commented Jun 21, 2024 • edited Loading

Problem

Solution

Ensure `models.llamacpp` Doesn't Have Implicit `max_tokens` #996

Ensure `models.llamacpp` Doesn't Have Implicit `max_tokens` #996

lapp0 commented Jun 21, 2024 •

edited

Loading