Add dataloader arg to Trainer.test() #1393

Anjum48 · 2020-04-06T15:37:40Z

🚀 Feature

It would be nice if you could use a model for inference using:
Trainer.test(model, test_dataloaders=test_loader)

Motivation

This will match the calling structure for Trainer.fit() and allow for test to be called on any dataset multiple times

Pitch

Here's a use case. After training a model using 5-fold cross-validation, you may want to stack the 5 checkpoints across multiple models, which will require a) out-of-fold (OOF) predictions and b) the 5 test predictions (which will be averaged). It would be cool if a & b could be generated as follows:

for f in folds:
    model1.load_from_checkpoint(f'path/to/model1_fold{f}.ckpt')
    trainer.test(model1,  test_dataloaders=valid_loader)
    trainer.test(model1,  test_dataloaders=test_loader)

    model2.load_from_checkpoint(f'path/to/model2_fold{f}.ckpt'))
    trainer.test(model2,  test_dataloaders=valid_loader)
    trainer.test(model2,  test_dataloaders=test_loader)

Alternatives

Maybe I'm misunderstanding how test works and there is an easier way? Or perhaps the best way to do this is to write an inference function as you would in pure PyTorch?

Additional context

The text was updated successfully, but these errors were encountered:

github-actions · 2020-04-06T15:38:22Z

Hi! thanks for your contribution!, great first issue!

Borda · 2020-04-08T11:58:37Z

I am in favour of adding this option, but first, lets see how it fits the API
@williamFalcon any strong suggestion against? cc: @PyTorchLightning/core-contributors

williamFalcon · 2020-04-08T12:05:20Z

test is meant to ONLY operate on the test set. it’s meant to keep people from using the test set when they shouldn’t haha (ie: only right before publication or right before production use).

additions that i’m not sure align well

Trainer.test as an instance method. Why wouldn’t you just init the trainer? otherwise you won’t be able to test on distributed environments or configure the things you need like apex, etc.

additions that are good

allowing the test function to take in a dataset. this also aligns with how fit works.
fit should also not take a test dataloader (not sure if it does now).
current .test already uses your test dataloader defined in the lightningmodule. so the ONLY addition we’re talking about here is allowing test to ALSO take in a dataloader and use that one only.

Ir1d · 2020-04-09T11:41:44Z

btw I'm interested in how to "train a model using 5-fold cross-validation" in PL.

williamFalcon · 2020-04-09T16:58:35Z

Let's do this:

Add a test_dataloader method to .test()
remove the test_dataloader from .fit()?

rohitgr7 · 2020-04-09T17:51:05Z

btw I'm interested in how to "train a model using 5-fold cross-validation" in PL.

@Ir1d Try this:
https://www.kaggle.com/rohitgr/quest-bert

* Add test_dataloaders to test method * Remove test_dataloaders from .fit() * Fix code comment * Fix tests * Add test_dataloaders to test method (#1393) * Fix failing tests * Update docs (#1393)

* Add test_dataloaders to test method * Remove test_dataloaders from .fit() * Fix code comment * Fix tests * Add test_dataloaders to test method (Lightning-AI#1393) * Fix failing tests * Update docs (Lightning-AI#1393)

ArthDh · 2020-06-08T01:01:56Z

https://www.kaggle.com/rohitgr/quest-bert

Hey @rohitgr7! The link seems to be broken, could you point to any other resource? Thanks!

rohitgr7 · 2020-06-08T09:35:46Z

@ArthDh Try this one: https://www.kaggle.com/rohitgr/roberta-with-pytorch-lightning-train-test-lb-0-710

Anjum48 added feature Is an improvement or enhancement help wanted Open to be worked on labels Apr 6, 2020

Borda added the discussion In a discussion stage label Apr 8, 2020

Borda mentioned this issue Apr 9, 2020

Cross validation feature #839

Closed

williamFalcon added priority: 0 High priority task let's do it! approved to implement labels Apr 9, 2020

rohitgr7 mentioned this issue Apr 9, 2020

Add test_dataloaders to test method #1434

Merged

5 tasks

williamFalcon closed this as completed in #1434 Apr 10, 2020

Borda added this to the 0.7.3 milestone Apr 10, 2020

Borda modified the milestones: 0.7.3, v0.7.x Apr 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dataloader arg to Trainer.test() #1393

Add dataloader arg to Trainer.test() #1393

Anjum48 commented Apr 6, 2020 •

edited

Loading

github-actions bot commented Apr 6, 2020

Borda commented Apr 8, 2020

williamFalcon commented Apr 8, 2020

Ir1d commented Apr 9, 2020

williamFalcon commented Apr 9, 2020

rohitgr7 commented Apr 9, 2020

ArthDh commented Jun 8, 2020

rohitgr7 commented Jun 8, 2020

Add dataloader arg to Trainer.test() #1393

Add dataloader arg to Trainer.test() #1393

Comments

Anjum48 commented Apr 6, 2020 • edited Loading

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

github-actions bot commented Apr 6, 2020

Borda commented Apr 8, 2020

williamFalcon commented Apr 8, 2020

Ir1d commented Apr 9, 2020

williamFalcon commented Apr 9, 2020

rohitgr7 commented Apr 9, 2020

ArthDh commented Jun 8, 2020

rohitgr7 commented Jun 8, 2020

Anjum48 commented Apr 6, 2020 •

edited

Loading