`mlx` library integration (via `mlx-lm`) #918

pngwn · 2024-05-24T11:53:52Z

An additional library integration for mlx.

Context

mlx is an ML framework that support high-performance inference on apple silicon (amongst other things). It has now has a rich ecosystem and vibrant community of users.

Request

The mlx-lm python library provides some simple utilities for text generation and it would be great if there were an integration with outlines.

Extra detail

mlx-lm supports a logit_bias param in its top level generate function

Related to #806

The text was updated successfully, but these errors were encountered:

pngwn · 2024-05-24T12:53:14Z

There is a separate library that integrates outlines with mlx. I thought I would post it here in case it is useful:

https://github.com/sacha-ichbiah/outlines-mlx

I have also pinged the author.

namin · 2024-06-08T21:27:42Z

In the meantime, note that it's possible to use MLX via the OpenAI API. Of course, then, one is limited to generators choice and text, but I guess this is better than nothing.

This is what I did to get it working:

# see https://github.com/ml-explore/mlx-examples/pull/810 which must be merged
# tested as follows
# mlx_lm.server --model mlx-community/gemma-1.1-7b-it-4bit --port 11435 --chat-template=CHAT_TEMPLATE
# where CHAT_TEMPLATE is as in the tokenizer_config below

from openai import AsyncOpenAI
from outlines.models.openai import OpenAI, OpenAIConfig

from mlx_lm.tokenizer_utils import load_tokenizer
from pathlib import Path
model_path = Path("mlx-community/gemma-1.1-7b-it-4bit")
tokenizer_config = {"chat_template": ""{{ bos_token }}{% set ns = namespace(extra_system='') %}{% for message in messages %}{% set role = message['role'] %}{% if (message['role'] == 'assistant') %}{% set role = 'model' %}{% endif %}{% if (role == 'system') %}{% set ns.extra_system = ns.extra_system + message['content'] %}{% else %}{% set message_system = '' %}{% if (role == 'user') %}{% if (ns.extra_system == '') %}{% else %}{% set message_system = 'System: ' + ns.extra_system + '\\n' %}{% set ns.extra_system = '' %}{% endif %}{% endif %}{{ '<start_of_turn>' + role + '\\n' + message_system + message['content'] | trim + '<end_of_turn>\\n' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{'<start_of_turn>model\\n'}}{% endif %}"}

base_url = "http://localhost:11435/v1"
api_key="not_needed"

config = OpenAIConfig(model="openai/mlx-gemma")
client = AsyncOpenAI(
    base_url=base_url,
    api_key=api_key,
)
tokenizer = load_tokenizer(model_path, tokenizer_config)

model = OpenAI(client, config, tokenizer)

rlouf · 2024-06-09T07:28:38Z

You will be able to use every generator once #926 is merged

rlouf added enhancement help wanted labels May 24, 2024

pngwn mentioned this issue May 24, 2024

native outlines integration sacha-ichbiah/outlines-mlx#3

Open

lapp0 mentioned this issue May 29, 2024

Use LogitsProcessor for transformers integration #926

Closed

lapp0 mentioned this issue Jun 12, 2024

Introduce outlines.models.mlxlm #956

Merged

rlouf closed this as completed in #956 Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`mlx` library integration (via `mlx-lm`) #918

`mlx` library integration (via `mlx-lm`) #918

pngwn commented May 24, 2024

pngwn commented May 24, 2024

namin commented Jun 8, 2024 •

edited

Loading

rlouf commented Jun 9, 2024 •

edited

Loading

mlx library integration (via mlx-lm) #918

mlx library integration (via mlx-lm) #918

Comments

pngwn commented May 24, 2024

pngwn commented May 24, 2024

namin commented Jun 8, 2024 • edited Loading

rlouf commented Jun 9, 2024 • edited Loading

`mlx` library integration (via `mlx-lm`) #918

`mlx` library integration (via `mlx-lm`) #918

namin commented Jun 8, 2024 •

edited

Loading

rlouf commented Jun 9, 2024 •

edited

Loading