Split make_SILVA_132_16S_classifier into sequence extraction and training? #147

erikrikarddaniel · 2020-05-05T06:53:03Z

Sequence extraction takes forever on the UPPMAX cluster (4-5 days), and uses little memory, whereas the training step requires more memory (20-25 GiB) and takes much shorter (< 3h). To make it possible to set better cpu and memory limits for this, I would therefore suggest that we split this process into two: The first would run the currently first four steps (unzipping, qiime imports and read extraction) and the second would just be the training step.

d4straub · 2020-10-28T16:03:36Z

Is this still the case (1.1.2/dev)? The whole process make_SILVA_132_16S_classifier takes <3h for me.

erikrikarddaniel · 2020-10-29T21:07:35Z

Den ons 28 okt. 2020 17:03Daniel Strau <notifications@github.com> skrev:

Is this still the case (1.1.2/dev)? The whole process make_SILVA_132_16S_classifier takes <3h for me.

I should think so, but I haven't tried since I've been running a ready made classifier for a long time now. In any case, I can't see that it would hurt to make two processes. Emelie could do this. /D

…

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#147 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AALTHQHQJZYAJSBQHHDCQO3SNA6GZANCNFSM4MZKGYDA> .

d4straub · 2020-10-30T14:28:02Z

Sure, I have no reservations against splitting the process.

d4straub · 2020-11-26T13:42:19Z

This was solved in the linked PR.

erikrikarddaniel added enhancement New feature or request question Further information is requested labels May 5, 2020

erikrikarddaniel added this to the V1.2 Teal Bronze Lion milestone Aug 26, 2020

emnilsson self-assigned this Nov 5, 2020

emnilsson mentioned this issue Nov 24, 2020

Adding a second cutadapt step and splitting "make_classifier" in two #193

Merged

5 tasks

d4straub closed this as completed Nov 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split make_SILVA_132_16S_classifier into sequence extraction and training? #147

Split make_SILVA_132_16S_classifier into sequence extraction and training? #147

erikrikarddaniel commented May 5, 2020

d4straub commented Oct 28, 2020

erikrikarddaniel commented Oct 29, 2020 via email

d4straub commented Oct 30, 2020

d4straub commented Nov 26, 2020

Split make_SILVA_132_16S_classifier into sequence extraction and training? #147

Split make_SILVA_132_16S_classifier into sequence extraction and training? #147

Comments

erikrikarddaniel commented May 5, 2020

d4straub commented Oct 28, 2020

erikrikarddaniel commented Oct 29, 2020 via email

d4straub commented Oct 30, 2020

d4straub commented Nov 26, 2020