Add error handling to iterators #1243

adaszko · 2020-06-04T14:04:54Z

Before this, iterators would simply stop (i.e. produce None) on an IO error for instance. With this PR, a proper error is produced.

paulhauner

Looks good to me.

In the future we'll have to keep an eye out and make sure we don't let people do collect::<Vec<Result<_, _>, _>> so they don't end up in an infinite loop. I can't think of a way to protect against this at the type-level. This risk seems like a reasonable cost in order to ensure we don't suppress DB errors.

beacon_node/beacon_chain/src/beacon_chain.rs

beacon_node/store/src/forwards_iter.rs

michaelsproul

Nice work!

Just a few minor changes requested from me.

Regarding Paul's point about infinite iteration, I think we discussed this previously and decided that collect will short-circuit on the first Err?

beacon_node/beacon_chain/src/beacon_chain.rs

michaelsproul · 2020-06-05T02:05:35Z

beacon_node/beacon_chain/src/beacon_chain.rs

+            .find(|result| match result {
+                Ok((_, slot)) => *slot == target_slot,
+                Err(_) => true,
+            })


It might be cool if we could abstract this in an iterator extension trait like find_result(|slot| *slot == target_slot), but I'm happy to leave that for a future PR. Similarly for take_while.

I think hiding Err(_) => true behind a function call is going to lead to problems. true within find() signifies the value has been found and Err(_) almost never is the searched value.

Attempting abstracting from a slightly different angle can produce slightly cleaner code. That angle would be itertools::process_results. For instance, translating the piece of code above:

use itertools::process_results; pub fn root_at_slot(&self, target_slot: Slot) -> Result<Option<Hash256>, Error> { process_results(self.rev_iter_state_roots()?, |mut iter| { iter.find(|(_, slot)| *slot == target_slot).map(|(root, _)| root) }) }

It has the advantage that you can write your iterators pipeline as if there were no Errs in it and you get rid of the .transpose() call. The assumptions that's made is that the pipeline gets 'folded' into a single value, and not into another iterator.

Another example with take_while() for comparison:

Before:

let state_root = self .rev_iter_state_roots()? .take_while(|result| match result { Ok((_, current_slot)) => *current_slot >= slot, Err(_) => true, }) .find(|result| match result { Ok((_, current_slot)) => *current_slot == slot, Err(_) => true, }) .transpose()? .map(|(root, _slot)| root) .ok_or_else(|| Error::NoStateForSlot(slot))?;

After:

let state_root = process_results(self.rev_iter_state_roots()?, |iter| { iter.take_while(|(_, current_slot)| *current_slot >= slot) .find(|(_, current_slot)| *current_slot == slot) .map(|(root, _slot)| root) })?.ok_or_else(|| Error::NoStateForSlot(slot))?;

michaelsproul · 2020-06-05T02:09:48Z

beacon_node/store/src/forwards_iter.rs

+        let result = Self { values: values };
+        Ok(result)


Could just be Ok(Self { values })

You dislike having unnecessary local variables but they do come in handy during debugging, where you generally can't evaluate arbitrary Rust expression to see what's going on. I've applied your remark nonetheless.

That's true, I normally refactor during debugging, or use dbg!

beacon_node/store/src/forwards_iter.rs

beacon_node/store/src/iter.rs

adaszko · 2020-06-05T12:56:22Z

Regarding Paul's point about infinite iteration, I think we discussed this previously and decided that collect will short-circuit on the first Err?

That's right! @paulhauner #1132 (comment)

adaszko · 2020-06-05T14:18:20Z

FYI, those currently failing tests pass for me on macOS. I will have to run them on my Linux machine.

paulhauner · 2020-06-08T21:21:10Z

Regarding Paul's point about infinite iteration, I think we discussed this previously and decided that collect will short-circuit on the first Err?

It depends on the type that you collect into. See this playground for an example: https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=27415619b28bdad42c3f70f573561da5

AgeManning

I've had a skim over this and it looks fine to me. I'll defer a more thorough review to @paulhauner and @michaelsproul who know these sections of code better.

michaelsproul

A beautiful application of process_results, it's perfect for this! And it addresses the short-circuit dilemma as well!

Happy to merge once conflicts are addressed 👍

* Add error handling to iterators * Review feedback * Leverage itertools::process_results() in few places

adaszko requested review from AgeManning, michaelsproul and paulhauner June 4, 2020 14:07

paulhauner approved these changes Jun 5, 2020

View reviewed changes

beacon_node/beacon_chain/src/beacon_chain.rs Outdated Show resolved Hide resolved

beacon_node/store/src/forwards_iter.rs Outdated Show resolved Hide resolved

michaelsproul requested changes Jun 5, 2020

View reviewed changes

adaszko force-pushed the master branch 2 times, most recently from 2ee6f9f to b368c8d Compare June 5, 2020 13:32

adaszko added the waiting-on-author The reviewer has suggested changes and awaits thier implementation. label Jun 5, 2020

AgeManning approved these changes Jun 9, 2020

View reviewed changes

michaelsproul approved these changes Jun 9, 2020

View reviewed changes

adaszko added 2 commits June 9, 2020 09:12

Add error handling to iterators

ca5e851

Review feedback

e0cdc6b

adaszko force-pushed the master branch 3 times, most recently from 96f34ab to 213ddab Compare June 9, 2020 11:43

adaszko added ready-to-squerge and removed waiting-on-author The reviewer has suggested changes and awaits thier implementation. labels Jun 9, 2020

Leverage itertools::process_results() in few places

9312fd7

adaszko force-pushed the master branch from 213ddab to 9312fd7 Compare June 9, 2020 17:40

michaelsproul merged commit 7f036a6 into sigp:master Jun 9, 2020

AgeManning pushed a commit that referenced this pull request Jun 19, 2020

Add error handling to iterators (#1243)

c32d0ba

* Add error handling to iterators * Review feedback * Leverage itertools::process_results() in few places

adaszko mentioned this pull request Jul 6, 2020

Handle errors in iterators #1002

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add error handling to iterators #1243

Add error handling to iterators #1243

adaszko commented Jun 4, 2020

paulhauner left a comment

michaelsproul left a comment

michaelsproul Jun 5, 2020

adaszko Jun 5, 2020

michaelsproul Jun 5, 2020

adaszko Jun 5, 2020

michaelsproul Jun 9, 2020

adaszko commented Jun 5, 2020

adaszko commented Jun 5, 2020

paulhauner commented Jun 8, 2020

AgeManning left a comment

michaelsproul left a comment

Add error handling to iterators #1243

Add error handling to iterators #1243

Conversation

adaszko commented Jun 4, 2020

paulhauner left a comment

Choose a reason for hiding this comment

michaelsproul left a comment

Choose a reason for hiding this comment

michaelsproul Jun 5, 2020

Choose a reason for hiding this comment

adaszko Jun 5, 2020

Choose a reason for hiding this comment

michaelsproul Jun 5, 2020

Choose a reason for hiding this comment

adaszko Jun 5, 2020

Choose a reason for hiding this comment

michaelsproul Jun 9, 2020

Choose a reason for hiding this comment

adaszko commented Jun 5, 2020

adaszko commented Jun 5, 2020

paulhauner commented Jun 8, 2020

AgeManning left a comment

Choose a reason for hiding this comment

michaelsproul left a comment

Choose a reason for hiding this comment