Pretrained Models #4

KawshikManikantan · 2022-12-13T15:03:18Z

That is a really interesting idea proposed. Thanks for the release of the codebase. Can you please tell me when the pre-trained models would be released?

Hubotcoder · 2022-12-14T01:44:14Z

I downloaded them from huggingface, and I guess you should put them in the directory mentioned in each conf, for example:
data_dir = ${ASP}/data/conll03_ner/
model_dir = ${ASP}/data/conll03_ner/
log_root = ${ASP}/data/conll03_ner/

KawshikManikantan · 2022-12-14T10:46:51Z

@Hubotcoder @lyutyuh Huggingface contains pretrained T5 but if I understand right there needs to be additional weights (feedforward networks) for the specific tasks like NER? Please correct me if I am wrong.

Hubotcoder · 2022-12-15T01:34:55Z

@Hubotcoder @lyutyuh Huggingface contains pretrained T5 but if I understand right there needs to be additional weights (feedforward networks) for the specific tasks like NER? Please correct me if I am wrong.

You are right. Now I get what you mean. Let's wait for the author.

KawshikManikantan · 2022-12-31T12:43:28Z

@Hubotcoder, tried contacting the authors via mail but no reply even there. How did you advance with this repository? Were you able to train the models?

Thank you

Hubotcoder · 2023-02-10T01:44:52Z

@Hubotcoder, tried contacting the authors via mail but no reply even there. How did you advance with this repository? Were you able to train the models?

Thank you

@KawshikManikantan Hello. Not yet. I am still asking my boss to buy me a GPU.

Niklss · 2023-06-04T21:53:27Z

@Hubotcoder, tried contacting the authors via mail but no reply even there. How did you advance with this repository? Were you able to train the models?

Thank you

May be too late, but I've been able to start training t5_large based ere model with flant5_large_conll04 config (except use_amp, I changed it to false. V100 is not working with bf16). It requires 25870MiB of GPU memory.

The last results are this

06/04/2023 21:50:34 - INFO - /home/jovyan/au/ASP/util/runner.py - *******************EPOCH 34*******************
06/04/2023 21:50:37 - INFO - /home/jovyan/au/ASP/util/runner.py - Step 4940: avg loss 0.26; steps/sec 2.45
06/04/2023 21:50:45 - INFO - /home/jovyan/au/ASP/util/runner.py - Step 4960: avg loss 0.18; steps/sec 2.46
06/04/2023 21:50:54 - INFO - /home/jovyan/au/ASP/util/runner.py - Step 4980: avg loss 0.25; steps/sec 2.41
06/04/2023 21:51:02 - INFO - /home/jovyan/au/ASP/util/runner.py - Step 5000: avg loss 0.06; steps/sec 2.52
06/04/2023 21:51:02 - INFO - /home/jovyan/au/ASP/util/runner.py - Dev
06/04/2023 21:51:02 - INFO - /home/jovyan/au/ASP/util/runner.py - Step 5000: evaluating on 231 samples with batch_size 32
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Ent_Precision: 99.7755
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Ent_Recall: 99.5521
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Ent_F1: 99.6637
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_Precision: 98.5465
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_Recall: 98.8338
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_F1: 98.6900
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_p_Precision: 98.5465
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_p_Recall: 98.8338
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_p_F1: 98.6900
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Test
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval max f1: 98.84
06/04/2023 21:51:34 - INFO - /home/jovyan/au/ASP/util/runner.py - Test max f1: 0.00

As you may see, It doesn't performs evaluation on test dataset.

UPD: it doesn't perform test evaluation on the run, but there is a separate method to run the evaluation.

4000 step result (27 epochs)

06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Ent_Precision: 88.8889
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Ent_Recall: 90.4541
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Ent_F1: 89.6647
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_Precision: 72.5806
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_Recall: 74.6445
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_F1: 73.5981
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_p_Precision: 72.5806
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_p_Recall: 74.6445
06/04/2023 22:14:06 - INFO - /home/jovyan/au/ASP/util/runner.py - Eval_Rel_p_F1: 73.5981

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pretrained Models #4

Pretrained Models #4

KawshikManikantan commented Dec 13, 2022

Hubotcoder commented Dec 14, 2022

KawshikManikantan commented Dec 14, 2022 •

edited

Loading

Hubotcoder commented Dec 15, 2022

KawshikManikantan commented Dec 31, 2022

Hubotcoder commented Feb 10, 2023

Niklss commented Jun 4, 2023 •

edited

Loading

Pretrained Models #4

Pretrained Models #4

Comments

KawshikManikantan commented Dec 13, 2022

Hubotcoder commented Dec 14, 2022

KawshikManikantan commented Dec 14, 2022 • edited Loading

Hubotcoder commented Dec 15, 2022

KawshikManikantan commented Dec 31, 2022

Hubotcoder commented Feb 10, 2023

Niklss commented Jun 4, 2023 • edited Loading

UPD: it doesn't perform test evaluation on the run, but there is a separate method to run the evaluation.

KawshikManikantan commented Dec 14, 2022 •

edited

Loading

Niklss commented Jun 4, 2023 •

edited

Loading