Bump version to 0.7.0 #1063

irenedea · 2024-03-26T01:07:33Z

Bumps version to 0.7.0 and removes deprecated features.

Removals

Regression tests all passed: llm-foundry-regression-tests-runner-4bPcjd llm-foundry-regression-tests-runner-s1KZPH

.github/workflows/pr-cpu.yaml

.github/workflows/pr-gpu.yaml

.github/workflows/smoketest.yaml

.github/workflows/codeql-analysis.yml

.github/workflows/code-quality.yaml

* Remove hf_prefix_lm * Remove prefix_lm from mpt modeling * Remove bidirectional mask * Remove text denoising dataloading * Remove adapt tokenizer

tests/models/layers/test_flash_attn.py

tests/models/hf/test_hf_config.py

dakinggg

LGTM, will see if Alex can take a quick skim

alextrott16

Minor changes requested. Otherwise LGTM.

RIP denoising.
RIP prefix LM.

alextrott16 · 2024-03-26T23:04:18Z

llmfoundry/data/finetuning/dataloader.py

-                                    is_subseq, batch['bidirectional_mask'][j] ==
-                                    1)],
-                                             skip_special_tokens=False,
-                                             clean_up_tokenization_spaces=True))


You shouldn't delete these lines. Instead, find a different way to slice out the context. batch['bidirectional_mask'][j] == 1 was just a convenient way to do that. The context occurs where attention_mask==1 and labels==_HF_IGNORE_INDEX.

alextrott16 · 2024-03-26T23:04:44Z

llmfoundry/data/finetuning/dataloader.py

-                        tokenizer.decode(batch['input_ids'][
-                            j, batch['bidirectional_mask'][j] == 1],
-                                         skip_special_tokens=False,
-                                         clean_up_tokenization_spaces=True))


See above comment. Don't get rid of this, just do slicing differently.

alextrott16 · 2024-03-26T23:08:55Z

scripts/train/train.py

@@ -55,43 +55,10 @@ def validate_config(cfg: DictConfig):
            loaders.append(eval_loader)
    for loader in loaders:
        if loader.name == 'text':
-            if cfg.model.name in ['hf_prefix_lm', 'hf_t5']:
+            if cfg.model.name in ['hf_t5']:


Suggested change

if cfg.model.name in ['hf_t5']:

if cfg.model.name == 'hf_t5':

* Bump version * Remove triton (#1062) * Remove github action workflows for version bumps * Fix cpu test issues * code quality * Fix gpu tests * Fix gpu tests nicely * Remove z-loss (#1064) * Remove prefix lm and denoising (#1065) * Remove hf_prefix_lm * Remove prefix_lm from mpt modeling * Remove bidirectional mask * Remove text denoising dataloading * Remove adapt tokenizer * Remove llama attention patch (#1066) * Remove bidirectional mask in tests * Fix test_hf_config_override with patch

irenedea added 2 commits March 25, 2024 17:33

Bump version

5e8666f

Remove triton (#1062)

4c99952

irenedea commented Mar 26, 2024

View reviewed changes

irenedea added 8 commits March 25, 2024 18:18

Remove github action workflows for version bumps

d5e91cd

Fix cpu test issues

391871d

code quality

b71f134

Fix gpu tests

2129f40

Fix gpu tests nicely

f84e10a

Remove z-loss (#1064)

ed0647c

Remove prefix lm and denoising (#1065)

b71e4b0

* Remove hf_prefix_lm * Remove prefix_lm from mpt modeling * Remove bidirectional mask * Remove text denoising dataloading * Remove adapt tokenizer

Remove llama attention patch (#1066)

2d7390e

irenedea commented Mar 26, 2024

View reviewed changes

tests/models/layers/test_flash_attn.py Show resolved Hide resolved

dakinggg reviewed Mar 26, 2024

View reviewed changes

tests/models/hf/test_hf_config.py Outdated Show resolved Hide resolved

dakinggg requested review from alextrott16 and vchiley March 26, 2024 05:46

Remove bidirectional mask in tests

4963afe

irenedea force-pushed the bump_version_v0.7.0 branch from 2c9ec60 to 09ab444 Compare March 26, 2024 08:12

Fix test_hf_config_override with patch

a62600d

irenedea force-pushed the bump_version_v0.7.0 branch from 09ab444 to a62600d Compare March 26, 2024 15:51

irenedea marked this pull request as ready for review March 26, 2024 15:51

irenedea requested a review from a team as a code owner March 26, 2024 15:51

dakinggg approved these changes Mar 26, 2024

View reviewed changes

Merge branch 'main' into bump_version_v0.7.0

c84c033

irenedea enabled auto-merge (squash) March 26, 2024 23:05

irenedea merged commit 7f0fdae into main Mar 26, 2024
9 checks passed

alextrott16 suggested changes Mar 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump version to 0.7.0 #1063

Bump version to 0.7.0 #1063

irenedea commented Mar 26, 2024 •

edited

Loading

dakinggg left a comment

alextrott16 left a comment

alextrott16 Mar 26, 2024

alextrott16 Mar 26, 2024

alextrott16 Mar 26, 2024

	if cfg.model.name in ['hf_t5']:
	if cfg.model.name == 'hf_t5':

Bump version to 0.7.0 #1063

Bump version to 0.7.0 #1063

Conversation

irenedea commented Mar 26, 2024 • edited Loading

Removals

dakinggg left a comment

Choose a reason for hiding this comment

alextrott16 left a comment

Choose a reason for hiding this comment

alextrott16 Mar 26, 2024

Choose a reason for hiding this comment

alextrott16 Mar 26, 2024

Choose a reason for hiding this comment

alextrott16 Mar 26, 2024

Choose a reason for hiding this comment

irenedea commented Mar 26, 2024 •

edited

Loading