Skip to content

Commit

Permalink
Correct typos in datasets.py (#639)
Browse files Browse the repository at this point in the history
  • Loading branch information
felixonmars committed Sep 27, 2023
1 parent 895f0a0 commit d1236f2
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions src/axolotl/datasets.py
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ class TokenizedPromptDataset(Dataset):
"""
Dataset that returns tokenized prompts from a stream of text files.
Args:
prompt_tokenizer (PromptTokenizingStrategy): The prompt tokenizing method for proccessing the data.
prompt_tokenizer (PromptTokenizingStrategy): The prompt tokenizing method for processing the data.
dataset (dataset.Dataset): Dataset with text files.
"""

Expand Down Expand Up @@ -55,7 +55,7 @@ class ConstantLengthDataset(IterableDataset):
"""
Iterable dataset that returns constant length chunks of tokens from stream of text files.
Args:
tokenizer (Tokenizer): The processor used for proccessing the data.
tokenizer (Tokenizer): The processor used for processing the data.
dataset (dataset.Dataset): Dataset with text files.
seq_length (int): Length of token sequences to return.
"""
Expand Down

0 comments on commit d1236f2

Please sign in to comment.