Can CXR-BERT be used / fine-tuned for text generation? #886

PabloMessina · 2023-05-09T23:52:03Z

I have many experiments in mind where I need to condition a Transformer Decoder with some input (e.g. image features, discrete binary labels, a one-hot representing some concept, a question, etc.) in order to generate an output (e.g. a report, an answer). I have already implemented many of these ideas using my own custom Transformer Decoder based on PyTorch's standard implementation. However, now I would like to leverage existing pre-trained language models, instead of my custom implementation that always starts from scratch. Thus, I was wondering if there is an easy way to adapt CXR-BERT (or any other model that you guys would recommend) for text generation, given some input. For example, let's say I have a binary vector encoding certain information, and I want to fine-tune CXR-BERT to generate a paragraph verbalizing the information contained in this binary vector. The paragraph could be, for example, a radiology report, so it makes sense that fine-tuning a model like CXR-BERT for report generation should outperform a custom Transformer Decoder from PyTorch trained from scratch.

Questions:

Is this something that can be easily accomplished?
Are there examples of adapting CXR-BERT for text generation?
What if I need a custom input that conditions the text generation, such as a binary vector?

Thank you very much in advance.

ant0nsc · 2023-05-11T08:05:01Z

@fepegar could you route that question please?

fepegar · 2023-05-11T16:18:11Z

@corcra @Shruthi42 @ozan-oktay @qianchu

Could you please share your thoughts?

qianchu · 2023-05-23T22:31:58Z

Hello, you can run the following

from transformers import BertLMHeadModel
model = BertLMHeadModel.from_pretrained(<cxr-bert model path>, is_decoder=True)

to initialise a decoder model, and then you can finetune this model for generation.

ant0nsc added the hi-ml-multimodal Issues related to the hi-ml-multimodal package label May 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can CXR-BERT be used / fine-tuned for text generation? #886

Can CXR-BERT be used / fine-tuned for text generation? #886

PabloMessina commented May 9, 2023 •

edited

Loading

ant0nsc commented May 11, 2023

fepegar commented May 11, 2023

qianchu commented May 23, 2023

Can CXR-BERT be used / fine-tuned for text generation? #886

Can CXR-BERT be used / fine-tuned for text generation? #886

Comments

PabloMessina commented May 9, 2023 • edited Loading

ant0nsc commented May 11, 2023

fepegar commented May 11, 2023

qianchu commented May 23, 2023

PabloMessina commented May 9, 2023 •

edited

Loading