Skip to content

Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)

License

Notifications You must be signed in to change notification settings

laihuiyuan/eval-formality-transfer

Repository files navigation

This respository contains data and scripts associated to the paper: Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022) and Multidimensional Evaluation for Text Style Transfer Using ChatGPT. If you use any of the following contents, please cite our paper:

@inproceedings{lai-etal-2022-human,
    title = "Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer",
    author = "Lai, Huiyuan  and
      Mao, Jiali  and
      Toral, Antonio  and
      Nissim, Malvina",
    booktitle = "Proceedings of the 2nd Workshop on Human Evaluation of NLP Systems (HumEval)",
    month = may,
    year = "2022",
    address = "Dublin, Ireland",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.humeval-1.9",
    pages = "102--115"
}
@misc{lai2023multidimensional,
      title={Multidimensional Evaluation for Text Style Transfer Using ChatGPT}, 
      author={Huiyuan Lai and Antonio Toral and Malvina Nissim},
      year={2023},
      eprint={2304.13462},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Note: All data used in these two papers are placed in the chatgpt in json format. The following are Automatic Evaluation and Text Style Transfer Papers published over the last three years in the ACL Anthology.

Automatic Evaluation

Dataset

  • PT16
  • GYAFC: informal text (0) <-> formal text (1)

Style Strength

# Train Classifier/Regressor
python train_style.py -dataset xformal -task single_label_classification/regression

# Evaluation
python eval_style.py -dataset xformal -model BART -task single_label_classification/regression

Content Preservation

python eval_content.py data/output/BART.human.txt data/output/BART.auto.txt

Fluency

# Train GPT2 based Language Model
python train_lm.py -dataset xformal -style 0
python train_lm.py -dataset xformal -style 1

# Evaluation
python eval_fluency.py -model BART 

Note: All the models, including style regressor, classifiers and gpt-2 for fluency, can be found here.

Style Transfer Papers

2021

2020

2019

About

Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published