-
-
Notifications
You must be signed in to change notification settings - Fork 749
Issues: OpenAccess-AI-Collective/axolotl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
New Optimizer: Implement Adam-Mini optimizer
enhancement
New feature or request
#1720
opened Jun 30, 2024 by
SicariusSicariiStuff
5 tasks done
OOM Training 70B on 4x3090 24GB both FSDP or Deepspeed Zero3
bug
Something isn't working
#1717
opened Jun 27, 2024 by
Nero10578
6 of 8 tasks
Support loading a local hf dataset with New feature or request
load_dataset
enhancement
#1715
opened Jun 22, 2024 by
ccdv-ai
5 tasks done
Support BAdam Optimizer
enhancement
New feature or request
#1714
opened Jun 21, 2024 by
bratao
5 tasks done
I get ValueError when trying to run Axoltlt on pretraining dataset
#1713
opened Jun 21, 2024 by
Ahmedn1
Add Documentation on how to perform Multi-Node Finetuning in Slurm
#1709
opened Jun 14, 2024 by
juvi21
3 tasks done
Add a New feature or request
chat_template
strategy for DPO datasets
enhancement
#1708
opened Jun 14, 2024 by
fozziethebeat
5 tasks done
Support for Aya 23 models.
enhancement
New feature or request
#1707
opened Jun 13, 2024 by
SoshyHayami
5 tasks done
Zero loss and nan grad_norm when Flash Attention is enabled
bug
Something isn't working
#1706
opened Jun 13, 2024 by
fgdfgfthgr-fox
6 of 8 tasks
ValueError: Expected a cuda device, but got: cpu when using Deepspeed zero3
bug
Something isn't working
#1705
opened Jun 13, 2024 by
l3utterfly
6 of 8 tasks
Support for GLM Models
enhancement
New feature or request
#1701
opened Jun 10, 2024 by
ashmalvayani
5 tasks done
Llama3-8b: LlamaForCausalLM.forward() got an unexpected keyword argument 'length'
bug
Something isn't working
#1700
opened Jun 10, 2024 by
DMR92
6 of 8 tasks
Using native chat_template from tokenizer config in New feature or request
chat_template
strategy
enhancement
#1689
opened Jun 7, 2024 by
chiragjn
5 tasks done
DeepSpeed Zero3 is Incompatible with Freeze Range Code
bug
Something isn't working
#1687
opened Jun 6, 2024 by
josharian
7 of 8 tasks
Can't use Something isn't working
chat_template: phi_3
with type: sharegpt
bug
#1683
opened Jun 4, 2024 by
ccdv-ai
5 of 8 tasks
Adopt qlora-pipe approaches
enhancement
New feature or request
#1679
opened Jun 2, 2024 by
kallewoof
5 tasks done
Train FAILED. Crashed while training with SIGTERM
bug
Something isn't working
possibly_solved
#1670
opened May 29, 2024 by
RodriMora
6 of 8 tasks
Llama Reserved Tokens Initialization
enhancement
New feature or request
#1666
opened May 28, 2024 by
cinjon
5 tasks done
Pulling the image from Docker retuns with ''unauthorized: authentication required''
bug
Something isn't working
#1658
opened May 25, 2024 by
Fischherboot
6 of 8 tasks
Llama3 Lora training fails to output and save
bug
Something isn't working
#1650
opened May 23, 2024 by
austinm1120
6 of 8 tasks
dataset type sharegpt no longer works
bug
Something isn't working
#1649
opened May 22, 2024 by
thepowerfuldeez
6 of 8 tasks
DPO Prompt Strategies only support single-turn and will fail silently on multi-turn datasets
bug
Something isn't working
#1645
opened May 21, 2024 by
bjoernpl
6 of 8 tasks
Llama 3 & Mistral LoRA Examples Error (needs Something isn't working
eval_sample_packing: False
)
bug
#1644
opened May 21, 2024 by
VelocityRa
6 of 8 tasks
Llama 3 8b OOM with GaLore on 2x A100s (Mistral 7b is fine?)
bug
Something isn't working
possibly_solved
#1641
opened May 19, 2024 by
e-p-armstrong
6 of 8 tasks
Support RecurrentGemma
enhancement
New feature or request
#1637
opened May 17, 2024 by
julien-blanchon
5 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.