llamac2py

llamac2py is a Python package that provides a wrapper for running inference using the Llama-2 Transformer model. The package includes a C executable (run.c) from Karpathy's llama2.c that performs the inference, and the package allows easy inference for the same.

Note: On Windows, use build_msvc.bat in a Visual Studio Command Prompt to build with msvc, or you can use make win64 to use mingw compiler toolchain from linux or windows to build the windows target. MSVC build will automatically use openmp and max threads appropriate for your CPU unless you set OMP_NUM_THREADS env.

On Centos 7, Amazon Linux 2018 use rungnu Makefile target: make rungnu or make runompgnu to use openmp.

Get Started:

Clone the Repository: git clone https://github.com/adarshxs/llamac2py

cd into the Repository: cd llamac2py

download the Model (Will add support for more models):

wget https://huggingface.co/karpathy/tinyllamas/resolve/main/stories15M.bin

Compile the C file: make run

Then in a notebook or a Python script, run:

from llamac2py.wrapper import generate_short_story

# Load your Llama-2 model checkpoint (model.bin) here
checkpoint_file = 'stories15M.bin'

# Generate a short story with a prompt
prompt_text = "Once upon a time, in a faraway land,"
short_story = generate_short_story(prompt_text, checkpoint_file)
print(short_story)

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
build/lib.win-amd64-cpython-311/llamac2py		build/lib.win-amd64-cpython-311/llamac2py
llamac2py		llamac2py
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
build_msvc.bat		build_msvc.bat
configurator.py		configurator.py
model.py		model.py
requirements.txt		requirements.txt
run.c		run.c
sample.py		sample.py
setup.py		setup.py
tinystories.py		tinystories.py
tokenizer.bin		tokenizer.bin
tokenizer.model		tokenizer.model
win.c		win.c
win.h		win.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llamac2py

llamac2py is a Python package that provides a wrapper for running inference using the Llama-2 Transformer model. The package includes a C executable (run.c) from Karpathy's llama2.c that performs the inference, and the package allows easy inference for the same.

Get Started:

About

Releases

Packages

Contributors 2

Languages

License

adarshxs/llamac2py

Folders and files

Latest commit

History

Repository files navigation

llamac2py

llamac2py is a Python package that provides a wrapper for running inference using the Llama-2 Transformer model. The package includes a C executable (run.c) from Karpathy's llama2.c that performs the inference, and the package allows easy inference for the same.

Get Started:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages