Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add initial ONNX regressions #2094

Merged
merged 2 commits into from
Apr 5, 2023
Merged

Add initial ONNX regressions #2094

merged 2 commits into from
Apr 5, 2023

Conversation

lintool
Copy link
Member

@lintool lintool commented Apr 5, 2023

Building on @ArthurChen189 's recent PR.

Some initial ONNX regressions so we can start playing around with it...

These work:

nohup python src/main/python/run_regression.py --index --verify --search --regression msmarco-passage-splade-pp-ed-onnx >& logs/log.msmarco-passage-splade-pp-ed-onnx &
nohup python src/main/python/run_regression.py --index --verify --search --regression msmarco-passage-splade-pp-sd-onnx >& logs/log.msmarco-passage-splade-pp-sd-onnx &

As well as:

nohup python src/main/python/run_regression.py --search-pool 1 --verify --search --regression dl19-passage-splade-pp-ed-onnx >& logs/log.dl19-passage-splade-pp-ed-onnx &
nohup python src/main/python/run_regression.py --search-pool 1 --verify --search --regression dl20-passage-splade-pp-ed-onnx >& logs/log.dl20-passage-splade-pp-ed-onnx &

nohup python src/main/python/run_regression.py --search-pool 1 --verify --search --regression dl19-passage-splade-pp-sd-onnx >& logs/log.dl19-passage-splade-pp-sd-onnx &
nohup python src/main/python/run_regression.py --search-pool 1 --verify --search --regression dl20-passage-splade-pp-sd-onnx >& logs/log.dl20-passage-splade-pp-sd-onnx &

I'm encountering some concurrency issues, so without --search-pool 1, the regressions fail. Will need to debug more.

cc @cadurosar

@lintool lintool requested a review from ronakice April 5, 2023 12:17
@ArthurChen189
Copy link
Member

ArthurChen189 commented Apr 5, 2023

Just want to make sure we are referring to the same error (on different OSes), only the RM3 runs failed?
image

@lintool
Copy link
Member Author

lintool commented Apr 5, 2023

@ArthurChen189 yea, I think that's the concurrency bug... I try running again, and I think you'll observe different runs failing. And with --search-pool 1, it should work fine...

@cadurosar
Copy link
Collaborator

The weirdest part for me is that is not just a small loss of effectiveness, is almost like it is losing the original representation

@lintool
Copy link
Member Author

lintool commented Apr 5, 2023

The weirdest part for me is that is not just a small loss of effectiveness, is almost like it is losing the original representation

I think it's a weird concurrency bug...

@ArthurChen189
Copy link
Member

lgtm!

@lintool lintool merged commit a7df7fc into master Apr 5, 2023
@lintool lintool deleted the regressions branch April 5, 2023 18:31
lintool added a commit that referenced this pull request May 30, 2023
+ Regressions themselves were added in #2094 - this patch adds the documentation.
+ Fix download paths for SPLADE++ ED/SD corpora.
+ Minor tweaks for other docs
RMaarefdoust pushed a commit to RMaarefdoust/anserini that referenced this pull request Sep 16, 2024
There appears to be a concurrency bug associated with regressions, which we address with
python src/main/python/run_regression.py --search-pool 1 ... but we should circle back and
look into later.
RMaarefdoust pushed a commit to RMaarefdoust/anserini that referenced this pull request Sep 16, 2024
+ Regressions themselves were added in castorini#2094 - this patch adds the documentation.
+ Fix download paths for SPLADE++ ED/SD corpora.
+ Minor tweaks for other docs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants