Parallel RFI filtering #35

ljgray · 2022-08-16T21:32:11Z

In its current implementation, the RFI Filter step of the daily pipeline takes ~25 minutes. Most of this time is spent taking a rolling median, and it is done in a single process. This PR should address the issue by pre-calculating a median across frequencies to flatten the auto vis. data, then distributing across the frequency axis to distribute the computation across processes.

ch_util/tools.py

ch_util/rfi.py

ljgray · 2022-08-16T23:30:16Z

Although the products aren't quite right yet, this reduces the RFIFilter step down to ~15 seconds in a daily pipeline run

ljgray · 2022-08-17T22:21:03Z

Depends on radiocosmology/caput#211

ljgray · 2022-08-18T00:04:57Z

Interestingly, the mask being produced is different from that produced by the previous method.

ljgray · 2022-08-19T18:22:01Z

@jrs65 this produces the same mask now as previous revisions, so it should be good to go unless there are additional changes.

jrs65

Approved, but please quickly change the comment, and condense the commits (if it makes sense to do so), before merging.

jrs65 · 2022-09-06T20:54:16Z

ch_util/rfi.py

+    limit_range : slice, optional
+        Data is limited to this range in the freqeuncy axis. Defaults to Ellipsis.


Maybe change the comment as the default is not actually an Ellipsis (even though it's effectively the same).

Ah, good catch, that's a remnant of a previous way I had tried implementing it

Running number_deviations across inputs causes the entire operation to take place on a single process. This is modified to be distributed across many nodes.

ljgray force-pushed the ljg/parallel-rfi branch from 59d9ded to 7625cf1 Compare August 16, 2022 21:46

ljgray requested review from jrs65 and tristpinsm and removed request for tristpinsm August 16, 2022 21:53

jrs65 requested changes Aug 16, 2022

View reviewed changes

ljgray force-pushed the ljg/parallel-rfi branch 4 times, most recently from 01aa599 to bba1a5a Compare August 17, 2022 22:19

ljgray force-pushed the ljg/parallel-rfi branch 5 times, most recently from 6b216ed to cb8b593 Compare August 17, 2022 23:43

ljgray requested a review from jrs65 August 18, 2022 00:04

ljgray marked this pull request as ready for review August 18, 2022 00:04

jrs65 previously approved these changes Sep 6, 2022

View reviewed changes

ljgray dismissed jrs65’s stale review via e8d8605 September 6, 2022 21:13

ljgray force-pushed the ljg/parallel-rfi branch from 21a1974 to e8d8605 Compare September 6, 2022 21:13

ljgray added 3 commits September 6, 2022 14:28

feat(rfi): rework number_deviations to be distributed across frequency.

f804f7c

Running number_deviations across inputs causes the entire operation to take place on a single process. This is modified to be distributed across many nodes.

feat(tools): update invert_no_zero to work with MPIArrays

796564d

feat(rfi): add frequency bounds to mad_cut_rolling

2fe5c6d

ljgray force-pushed the ljg/parallel-rfi branch from e8d8605 to 2fe5c6d Compare September 6, 2022 21:29

ljgray merged commit b74fcd5 into master Sep 6, 2022

ljgray deleted the ljg/parallel-rfi branch September 6, 2022 21:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel RFI filtering #35

Parallel RFI filtering #35

ljgray commented Aug 16, 2022

ljgray commented Aug 16, 2022

ljgray commented Aug 17, 2022

ljgray commented Aug 18, 2022 •

edited

Loading

ljgray commented Aug 19, 2022

jrs65 left a comment

jrs65 Sep 6, 2022

ljgray Sep 6, 2022

		limit_range : slice, optional
		Data is limited to this range in the freqeuncy axis. Defaults to Ellipsis.

Parallel RFI filtering #35

Parallel RFI filtering #35

Conversation

ljgray commented Aug 16, 2022

ljgray commented Aug 16, 2022

ljgray commented Aug 17, 2022

ljgray commented Aug 18, 2022 • edited Loading

ljgray commented Aug 19, 2022

jrs65 left a comment

Choose a reason for hiding this comment

jrs65 Sep 6, 2022

Choose a reason for hiding this comment

ljgray Sep 6, 2022

Choose a reason for hiding this comment

ljgray commented Aug 18, 2022 •

edited

Loading