Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

catch ConnectionError when checking dataset from HuggingFace #743

Merged
merged 1 commit into from
Oct 19, 2023

Conversation

Napuh
Copy link
Contributor

@Napuh Napuh commented Oct 18, 2023

When using axolotl in HPC environments, commonly without access to the internet, HF_DATASETS_OFFLINE enviroment variable need to be set. This causes data loading, even for a JSON file, to throw an uncatched error and impeding the training run to begin.

This addition catches ConnectionError and continues treating it as if it was a FileNotFoundError, checking for local files next.

You can test the difference just by adding HF_DATASETS_OFFLINE=1 to the training command. The run must use a local JSON file.

@winglian winglian merged commit 992d57f into axolotl-ai-cloud:main Oct 19, 2023
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants