Skip to content

Commit

Permalink
fix: revert local dir dataset load (axolotl-ai-cloud#878)
Browse files Browse the repository at this point in the history
  • Loading branch information
NanoCode012 committed Nov 18, 2023
1 parent b1bcd31 commit 684003e
Showing 1 changed file with 8 additions and 1 deletion.
9 changes: 8 additions & 1 deletion src/axolotl/utils/data.py
Original file line number Diff line number Diff line change
Expand Up @@ -242,7 +242,14 @@ def for_d_in_datasets(dataset_configs):
local_path = Path(config_dataset.path)
if local_path.exists():
if local_path.is_dir():
ds = load_from_disk(config_dataset.path)
# TODO dirs with arrow or parquet files could be loaded with `load_from_disk`
ds = load_dataset(
config_dataset.path,
name=config_dataset.name,
data_files=config_dataset.data_files,
streaming=False,
split=None,
)
elif local_path.is_file():
ds_type = get_ds_type(config_dataset)

Expand Down

0 comments on commit 684003e

Please sign in to comment.