some descriptions in ReadMe not clear #1

withchencheng · 2022-11-12T09:27:40Z

Hi, Yu, I don't understand some descriptions in this project ReadMe.

In https://github.com/yueyu1030/actune#training

Take AG News dataset as an example, run_agnews_finetune.sh is used for running the experiment of standard active learning approaches, and run_agnews_finetune.sh is used for running active self-training experiments as unlabeled data is also used during fine-tuning.

You use the same file name run_agnews_finetune.sh twice. Which one is your paper's method?

In https://github.com/yueyu1030/actune#hyperparameter-tuning
pool stands for what?

Thank you!

The text was updated successfully, but these errors were encountered:

yueyu1030 · 2022-11-16T18:16:46Z

Thanks for reaching out.

For question #1, run_agnews.sh is used as our main method (active self-training). We will modify the README to avoid confusion.

For question #2, pool is the size of unlabeled data used in self-training. In self-training, we often do not use the whole unlabeled data as many pseudo-labels may contain noise. A common solution is to first select a subset of data with low uncertainty (pool is the size for such a subset), and fine-tune the pretrained language model on the subset (together with the labeled data) only. Hope these explanations help.

Best,
Yue

linhlt-it-ee · 2023-03-18T07:01:07Z

so how you config this number with different datasets. What is the value of this number if I test your with TREC?

yueyu1030 · 2023-07-08T06:07:17Z

Overall we tune this parameter based on the performance of the validation set.
If there is no validation set, we recommend gradually (linearly) increasing the number of unlabeled examples to around 50% of the size of the unlabeled pool.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some descriptions in ReadMe not clear #1

some descriptions in ReadMe not clear #1

withchencheng commented Nov 12, 2022

yueyu1030 commented Nov 16, 2022

linhlt-it-ee commented Mar 18, 2023

yueyu1030 commented Jul 8, 2023

some descriptions in ReadMe not clear #1

some descriptions in ReadMe not clear #1

Comments

withchencheng commented Nov 12, 2022

yueyu1030 commented Nov 16, 2022

linhlt-it-ee commented Mar 18, 2023

yueyu1030 commented Jul 8, 2023