-
Notifications
You must be signed in to change notification settings - Fork 176
Issues: casper-hansen/AutoAWQ
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
error when quantizing my finetuned 405b model using autoawq
#571
opened Aug 5, 2024 by
Atomheart-Father
request: update prereq list to show supported python versions
#569
opened Aug 3, 2024 by
AartBluestoke
ImportError: cannot import name 'initialize_tasks' from 'lm_eval.tasks'
#565
opened Aug 1, 2024 by
kunzeng-ch
Memory-efficient quantization: Load and quantize layer by layer
#561
opened Jul 30, 2024 by
casper-hansen
Quantitative model report wrong, RuntimeError: Expected all tensors to be on the same device
#558
opened Jul 28, 2024 by
ShelterWFF
CUDA error: no kernel image is available for execution on the device
#557
opened Jul 25, 2024 by
AragornHorse
awq quantization is not fully optimized yet. The speed can be slower than non-quantized models
#545
opened Jul 22, 2024 by
jackNhat
Calibration Dataset: how to avoid computing loss on instructions?
#525
opened Jun 28, 2024 by
RanchiZhao
Same AWQ model behaves differently on two similar machines
#518
opened Jun 22, 2024 by
Alf-Z-SymphoMe
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.