Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add CMake flag for pipeline parallelism for multi-GPU #940

Conversation

Nexesenex
Copy link

@Nexesenex Nexesenex commented Jun 23, 2024

LCPP Default is set to 4, which is a bit too much in my opinion. Setting to 2 saves VRAM (0.5-1%?), some compute and some electricity if set to 2, at the expense of some potential performance (prompt processing?), that I do not notice in usage. 2 is thus my own setting.

ggerganov#6017

LCPP Default is set to 4, which is a bit too much in my opinion.
Saves VRAM (0.5-1%?), some compute and some electricity if set to 2, at the expense of some potential performance (prompt processing?), that I do not notice in usage. 2 is thus my own setting.
@LostRuins LostRuins merged commit dd5cda0 into LostRuins:concedo_experimental Jun 25, 2024
@Nexesenex Nexesenex deleted the pipeline_parallelism_setting branch June 25, 2024 11:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants