Write integration tests #39

mattersoflight · 2020-04-08T23:50:23Z

What is the bottleneck? Please describe.

Our review process involves processing 2-3 datasets to determine if the pipeline is functional. Running these tests takes 15-20 minutes.

Describe the solution you'd like and alternatives

Having automated integration test will speed the review process. It would make sense to create a dataset of 4 random wells from 3 acquisitions and test that ODs, intensities, and backgrounds output by the pipeline match with curated reference. If our results become more accurate, we will update the reference.

Good time to add this feature would be after refactoring (#34 , #36 , #29 ), but before multiprocessing (#33).

It will also be useful to invoke the integration test from cli with --test flag.

jennyfolkesson · 2020-04-09T18:01:23Z

Sounds good. I will add another issue to create unit tests as well as well as continuous integration.

Is it ok to pick some wells from the recent directories? Is it ok to put them on github (make them public)?

And does anyone have any preferences regarding if we use Unittest or pytest?

mattersoflight · 2020-04-09T18:08:45Z

Yes it is okay to host images publicly. They are small enough to hosted on github repository.
I suggest 3 each from two flu plates, the recent covid-19 plate, and the vanilla ELISA plate for the start.

bryantChhun · 2020-04-14T17:33:01Z

Is it ok to pick some wells from the recent directories? Is it ok to put them on github (make them public)?

When you mean "put them on github" do you mean directly in this repo? Or is there another data hosting mechanism? We've kept testing data outside of the repo (gdrive, for example) and downloaded it during testing.

And does anyone have any preferences regarding if we use Unittest or pytest?

We used pytest in reconstructOrder, but in the past I used Unittest, so will defer to other's preferences here.

jennyfolkesson · 2020-04-14T23:40:06Z

Yes, putting the images in the repo. Testing should be done with continuous integration on the CI server, so either you need to keep the test data in the repo or on a server from where they can be downloaded to the CI server every time a test is done (every time there's a push on GitHub). Since the repo itself is ~100MB and images are <5MB I think it's ok to put 3 images in the repo.
Here's a relevant discussion:
https://softwareengineering.stackexchange.com/questions/257881/should-test-data-be-checked-into-version-control

bryantChhun · 2020-04-14T23:55:30Z

I'm hesitant to include images in the repo, if only because we've done this before and found it can explode the size of the repo quickly and accidentally. Maybe it's a minor technical point, but what happens when you want an integration test to test multiple wells/images?

Our approach was to host the test data on a server -- google drive -- and provide those to the tests via the googledrivedownloader pypi package. I'm not sure how well this plays with CI like github actions.

@mattersoflight @smguo what do you think?

mattersoflight · 2020-04-15T16:44:44Z

I agree with @bryantChhun. I expect that we will update the test images as the data rolls in and antigen array format evolves. Thich means that repo can end up with multiple versions of test data and create a bloat. We did have to use BFG (https://www.phase2technology.com/blog/removing-large-files-git-bfg) on reconstruct-order.

mattersoflight · 2020-04-15T16:51:53Z

I looked up github actions and prefer that over travis. It looks like one can trigger the integration test when there is a pull request into a branch and once integration test is merged into master, trigger the test when there is PR for master.

(https://help.github.com/en/actions/reference/workflow-syntax-for-github-actions).

jennyfolkesson · 2020-04-15T21:01:33Z

I'm fine with keeping images on google drive as long as we only do integration tests when there's a PR, and do unit testing every time there's a push to github.

mattersoflight · 2020-04-26T02:22:29Z

I just realize I missed the question on which testing framework should we use. I suggest pytest, since it is the easiest and succinct. We would like all contributors to write & maintain tests for relevant bits of code.

jennyfolkesson · 2020-04-27T20:33:35Z

Just a note: as of a month ago, PyTest went from 4-5 to 1 maintainer due to internal conflicts.
I think we should still go ahead with PyTest. Hopefully they will resolve this and realize that kindness makes team work so much better.
https://adrin.info/open-source-coc-conflicts.html

mattersoflight mentioned this issue Apr 8, 2020

updated main script to argparse, started minor testing #37

Merged

smguo mentioned this issue Apr 15, 2020

Move notebooks to separate directory #36

Closed

lenafb closed this as completed Mar 11, 2022

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write integration tests #39

Write integration tests #39

mattersoflight commented Apr 8, 2020 •

edited

Loading

jennyfolkesson commented Apr 9, 2020

mattersoflight commented Apr 9, 2020

bryantChhun commented Apr 14, 2020

jennyfolkesson commented Apr 14, 2020

bryantChhun commented Apr 14, 2020

mattersoflight commented Apr 15, 2020

mattersoflight commented Apr 15, 2020

jennyfolkesson commented Apr 15, 2020

mattersoflight commented Apr 26, 2020

jennyfolkesson commented Apr 27, 2020

Write integration tests #39

Write integration tests #39

Comments

mattersoflight commented Apr 8, 2020 • edited Loading

jennyfolkesson commented Apr 9, 2020

mattersoflight commented Apr 9, 2020

bryantChhun commented Apr 14, 2020

jennyfolkesson commented Apr 14, 2020

bryantChhun commented Apr 14, 2020

mattersoflight commented Apr 15, 2020

mattersoflight commented Apr 15, 2020

jennyfolkesson commented Apr 15, 2020

mattersoflight commented Apr 26, 2020

jennyfolkesson commented Apr 27, 2020

mattersoflight commented Apr 8, 2020 •

edited

Loading