Skip to content

[WWW 2021] Source code and datasets for the paper "Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval".

Notifications You must be signed in to change notification settings

CRIPAC-DIG/GHRM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

GHRM

model

This is the code for the WWW 2021 Paper: Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval.

Usage

Data

Here are two datasets we used in our paper. After downloaded the whole datasets, you can put the queries.tsv and documents.tsv into the corresponding robust04 or clueweb09 subfolder in Data:

Quick Start

You first need to process the data, for example: cd Data; and orderly running python preprocess.py , python bulid_dict.py, python gen_word_embeddings.py, python graph_construction.py and idf_construction.py.

Then you can run the file Code/run.py to train the model.

For example: cd Code; python run.py --model GHRM --gpuid 0 --qrl_len 4 --dataset robust04

Requirements

  • Python 3.6+
  • PyTorch 1.5.1

Citation

Please cite our paper if you use the code:

@inproceedings{yu2021graph,
title={Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval},
author={Yu, Xueli and Xu, Weizhi and Cui, Zeyu and Wu, Shu and Wang, Liang},
booktitle={Proceedings of the Web Conference 2021},
pages={778--787},
year={2021}
}

About

[WWW 2021] Source code and datasets for the paper "Graph-based Hierarchical Relevance Matching Signals for Ad-hoc Retrieval".

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages