Source for v2 (mobile inference engine) #194

peeteeman · 2024-06-12T00:57:46Z

Hello there!

I came across the v2 paper yesterday, and saw the updates on the project readme.

I am interested in porting the v2 framework to iOS. The goal is to complete a naive port at first, and then include metal shaders.

Any plans on releasing the source and instructions for running v2 on Android?

0wwafa · 2024-06-12T12:58:47Z

please release PowerInfer-2 so that it can be tested on low resource PCs (like llama.cpp) for a comparison.

YixinSong-e · 2024-06-13T02:46:28Z

PowerInfer-2 will be open-sourced in the future. We’re refining it to untangle from our testing platform and making it accessible on PCs for the community.

sqzhang-jeremy · 2024-06-13T06:55:56Z

Can't wait to test your amazing work!

0wwafa · 2024-06-13T17:19:32Z

Can't wait to test your amazing work!

same here! I wish to test it on low resource pc with no gpu or an old and small one.

UUSR · 2024-06-14T07:30:28Z

This is fantastic!, on my old smartphone with 6 Gb of memory the Meta-Llama-3-8B-Instruct-Q4_K_M.gguf model ran, I hope for v2 in the near future.

Stephen888888 · 2024-06-19T05:37:56Z

when can i use it on anroid phone ?

ethanc8 · 2024-06-22T13:57:44Z

PowerInfer-2 will be open-sourced in the future. We’re refining it to untangle from our testing platform and making it accessible on PCs for the community.

Is it possible you could release the testing platform and the code entangled with the testing platform, so that the reported results can be reproduced?

peeteeman added the question Further information is requested label Jun 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Source for v2 (mobile inference engine) #194

Source for v2 (mobile inference engine) #194

peeteeman commented Jun 12, 2024

0wwafa commented Jun 12, 2024

YixinSong-e commented Jun 13, 2024

sqzhang-jeremy commented Jun 13, 2024

0wwafa commented Jun 13, 2024

UUSR commented Jun 14, 2024

Stephen888888 commented Jun 19, 2024

ethanc8 commented Jun 22, 2024

Source for v2 (mobile inference engine) #194

Source for v2 (mobile inference engine) #194

Comments

peeteeman commented Jun 12, 2024

0wwafa commented Jun 12, 2024

YixinSong-e commented Jun 13, 2024

sqzhang-jeremy commented Jun 13, 2024

0wwafa commented Jun 13, 2024

UUSR commented Jun 14, 2024

Stephen888888 commented Jun 19, 2024

ethanc8 commented Jun 22, 2024