Skip to content

Issues: OpenAccess-AI-Collective/axolotl

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

New Optimizer: Implement Adam-Mini optimizer enhancement New feature or request
#1720 opened Jun 30, 2024 by SicariusSicariiStuff
5 tasks done
OOM Training 70B on 4x3090 24GB both FSDP or Deepspeed Zero3 bug Something isn't working
#1717 opened Jun 27, 2024 by Nero10578
6 of 8 tasks
Support loading a local hf dataset with load_dataset enhancement New feature or request
#1715 opened Jun 22, 2024 by ccdv-ai
5 tasks done
Support BAdam Optimizer enhancement New feature or request
#1714 opened Jun 21, 2024 by bratao
5 tasks done
Add a chat_template strategy for DPO datasets enhancement New feature or request
#1708 opened Jun 14, 2024 by fozziethebeat
5 tasks done
Support for Aya 23 models. enhancement New feature or request
#1707 opened Jun 13, 2024 by SoshyHayami
5 tasks done
Zero loss and nan grad_norm when Flash Attention is enabled bug Something isn't working
#1706 opened Jun 13, 2024 by fgdfgfthgr-fox
6 of 8 tasks
ValueError: Expected a cuda device, but got: cpu when using Deepspeed zero3 bug Something isn't working
#1705 opened Jun 13, 2024 by l3utterfly
6 of 8 tasks
Support for GLM Models enhancement New feature or request
#1701 opened Jun 10, 2024 by ashmalvayani
5 tasks done
Llama3-8b: LlamaForCausalLM.forward() got an unexpected keyword argument 'length' bug Something isn't working
#1700 opened Jun 10, 2024 by DMR92
6 of 8 tasks
DeepSpeed Zero3 is Incompatible with Freeze Range Code bug Something isn't working
#1687 opened Jun 6, 2024 by josharian
7 of 8 tasks
Can't use chat_template: phi_3 with type: sharegpt bug Something isn't working
#1683 opened Jun 4, 2024 by ccdv-ai
5 of 8 tasks
Adopt qlora-pipe approaches enhancement New feature or request
#1679 opened Jun 2, 2024 by kallewoof
5 tasks done
Train FAILED. Crashed while training with SIGTERM bug Something isn't working possibly_solved
#1670 opened May 29, 2024 by RodriMora
6 of 8 tasks
Llama Reserved Tokens Initialization enhancement New feature or request
#1666 opened May 28, 2024 by cinjon
5 tasks done
Pulling the image from Docker retuns with ''unauthorized: authentication required'' bug Something isn't working
#1658 opened May 25, 2024 by Fischherboot
6 of 8 tasks
Llama3 Lora training fails to output and save bug Something isn't working
#1650 opened May 23, 2024 by austinm1120
6 of 8 tasks
dataset type sharegpt no longer works bug Something isn't working
#1649 opened May 22, 2024 by thepowerfuldeez
6 of 8 tasks
Llama 3 & Mistral LoRA Examples Error (needs eval_sample_packing: False) bug Something isn't working
#1644 opened May 21, 2024 by VelocityRa
6 of 8 tasks
Llama 3 8b OOM with GaLore on 2x A100s (Mistral 7b is fine?) bug Something isn't working possibly_solved
#1641 opened May 19, 2024 by e-p-armstrong
6 of 8 tasks
Support RecurrentGemma enhancement New feature or request
#1637 opened May 17, 2024 by julien-blanchon
5 tasks done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.