Bug fixes and tweaks for a stronger baseline #7
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
A few bug fixes and tweaks for a stronger baseline.
This improves MRR from 0.5845 to 0.6155 and NDCG from 0.5070 to 0.5315 on
val
.Changes:
val
intrain.py
.shuffle=True
toDataLoader
).torch.cuda.empty_cache()
. Negligible time hit on single GPU, and fits batch sizes of up to 32 x no. of GPUs. There's some time gain when training with larger batch sizes.I've updated the config yaml. Will likely update with a trained model on
trainval
+ numbers ontest-std
in 2-3 days.