Defending Against Disinformation Attacks in Open-Domain Question Answering

This is the official repository for the paper Defending Against Disinformation Attacks in Open-Domain Question Answering.

Overview

This paper proposes to defend against disinformation poisoning attacks in open-domain question answering (e.g. e.g. someone malicously puts a website with fake information to be indexed by search engines). We provide tools and data for generating augmented queries and evaluating the robustness of question answering models with various methods.

Quick Start

Most of the data is pre-generated and available in a Hugging Face dataset: orionweller/Defending-Agaisnt-Disinformation-EACL-24.

Initial Setup

Install requirements:

conda env create --file conda_env.yml
conda activate conflicts

Clone the pre-generated data:

git clone https://huggingface.co/datasets/orionweller/Defending-Agaisnt-Disinformation-EACL-24
mv Defending-Agaisnt-Disinformation-EACL-24/* .

Detailed Instructions

Generate Augmented Queries

Pre-generated augmented queries can be found in data/*/*_w_generations*.json.

To regenerate:

Get questions for GPT-3:
```
python get_questions_from_dataset.py
```

Run GPT-3 paraphrasing:

python prompt_gpt3.py --dataset_name {nq,tqa} --API_TOKEN <YOUR_API_TOKEN>

Run LLama-2 paraphrasing:
```
python prompt_llama2.py
```

Note: GPT-3 Davinci may be unavailable. Pre-generated questions are located in (using TQA as an example):

data/TQA/tqa_w_generations.json (GPT-3)
data/TQA/tqa_w_generations_llama.json (Llama2)

Generate Disinformation Conflicts

To recreate the conflicting data:

Clone the knowledge conflicts repository:

git clone https://github.com/apple/ml-knowledge-conflicts.git
cd ml-knowledge-conflicts

Follow setup instructions:
```
bash setup.sh
```

Generate substitutions for Natural Questions:

PYTHONPATH=. python src/load_dataset.py -d MRQANaturalQuestionsDev -w wikidata/entity_info.json.gz
PYTHONPATH=. python src/generate_substitutions.py --inpath datasets/normalized/MRQANaturalQuestionsDev.jsonl --outpath datasets/substitution-sets/MRQANaturalQuestionsDevType.jsonl corpus-substitution

Generate substitutions for TriviaQA:

PYTHONPATH=. python src/load_dataset.py -d MRQATriviaQADev -w wikidata/entity_info.json.gz
PYTHONPATH=. python src/generate_substitutions.py --inpath datasets/normalized/MRQATriviaQADev.jsonl --outpath datasets/substitution-sets/MRQATriviaQADevType.jsonl corpus-substitution

Search and Predict Using ATLAS and FiD

Download Models

Download FiD models:
```
bash scripts/download_FiD_models.sh
```
Set up FiD dataset:
- Follow instructions in the FiD repo
- Run get-data.sh and copy data to data/
Set up ATLAS:
- Clone the ATLAS repo
- Download ATLAS files:
```
bash scripts/download_atlas_models.sh
```

Generate embedding indices:

bash scripts/generate_all_embeddings.sh {nq,tqa}

Gather Model Retrieval and Predictions

FiD

Prepare retrieval data:

python convert_generations_to_dpr_format.py -p data/NQ/nq_w_generations.json -o artifacts/questions_to_retrieve_nq.json

Retrieve data:

qsub -N ret-fid retrieve_FiD.sh {nq,tqa}

Create poisoned data:
```
bash bulk_create_conflicts.sh
```
Run FiD evaluation:
```
bash evaluate_all_FiD.sh
```

ATLAS

Convert data format:

python convert_generations_to_dpr_format.py -r -p data/NQ/nq_w_generations.json -o artifacts/questions_to_retrieve_nq_atlas.json

Retrieve passages:

qsub -N ret-atlas scripts/retrieve_atlas.sh {nq,tqa}

Create poisoned data:
```
bash bulk_create_conflicts.sh
```
Run ATLAS evaluation:
```
bash scripts/evaluate_all_ATLAS.sh
```

Evaluation

Calculate results:
```
bash scripts/bulk_calculate_results.sh
```

Analyze overall groups:

python3 collect_results_across_percents.py -r results_nq_dev

Citation

If you use this code or data in your research, please cite our paper:

@inproceedings{weller-etal-2024-defending,
    title = "Defending Against Disinformation Attacks in Open-Domain Question Answering",
    author = "Weller, Orion  and
      Khan, Aleem  and
      Weir, Nathaniel  and
      Lawrie, Dawn  and
      Van Durme, Benjamin",
    booktitle = "Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers)",
    month = mar,
    year = "2024",
    address = "St. Julian{'}s, Malta",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.eacl-short.35",
    pages = "402--417",
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
plotting		plotting
scripts		scripts
README.md		README.md
calculate_results.py		calculate_results.py
conda_env.yml		conda_env.yml
convert_generations_to_dpr_format.py		convert_generations_to_dpr_format.py
create_conflicts.py		create_conflicts.py
evaluate_script.py		evaluate_script.py
get_questions_from_datasets.py		get_questions_from_datasets.py
prompt_gpt3.py		prompt_gpt3.py
prompt_llama2.py		prompt_llama2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Defending Against Disinformation Attacks in Open-Domain Question Answering

Overview

Quick Start

Initial Setup

Detailed Instructions

Download Models

Gather Model Retrieval and Predictions

FiD

ATLAS

Citation

About

Languages

orionw/disinformation-defense

Folders and files

Latest commit

History

Repository files navigation

Defending Against Disinformation Attacks in Open-Domain Question Answering

Overview

Quick Start

Initial Setup

Detailed Instructions

Download Models

Gather Model Retrieval and Predictions

FiD

ATLAS

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages