Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prepare converter from squad to beir #294

Open
5 tasks
laugustyniak opened this issue Aug 9, 2023 · 1 comment
Open
5 tasks

Prepare converter from squad to beir #294

laugustyniak opened this issue Aug 9, 2023 · 1 comment
Assignees

Comments

@laugustyniak
Copy link
Collaborator

laugustyniak commented Aug 9, 2023

some similar ideas:

from beir.datasets.data_loader_hf import HFDataLoader
corpus, queries, qrels = HFDataLoader(hf_repo=f"clarin-knext/{dataset}", streaming=False, keep_in_memory=False).load(split=split)
# Conversion from DataSet
queries = {query['id']: {'text': query['text']} for query in queries}
corpus = {doc['id']: {'title': doc['title'] , 'text': doc['text']} for doc in corpus}
@laugustyniak
Copy link
Collaborator Author

laugustyniak commented Aug 24, 2023

@mkossakowski19 can you link the branch for it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants