Free the fork choice #666

ralexstokes · 2019-05-24T20:17:18Z

EDIT: update description to reflect current state of PR, may be helpful to start w/ Summary of changes below

What was wrong?

Big part of implementing LMG GHOST fork choice.

Pulling this out of #532 as it is substantial enough to stand on its own.

In Eth1, the fork scoring function is just a function of the block header. This fact makes it easy to calculate the score as we import new headers, adjusting the canonical head along the way.

In Eth2, the fork scoring function is more complex (a function of the block, the state, and off-chain attestations); moreover, we want to run the fork choice in more situations than just importing a new valid block.

The current design of the fork choice calculation does not readily facilitate these tasks.

How was it fixed?

Summary of changes:

Extend API of persist_block to allow arbitrary scoring
Add set_score API (not technically required for this PR but will need for future fork choice work. easier to include here than patch up the git history... forgive me for the lack of precision :) )
Update the tests!

To address these limitations, we allow the caller of persist_block to provide their own scoring logic.
The DB class now exposes methods to persist the fork choice but does not actively compute it (at least via the new API persist_block_without_scoring). The particular fork choice lives to a particular variation of the StateMachine. For example, the SerenityStateMachine will have the lmd_ghost_fork_scoring rule after it lands in a subsequent PR. For now, they just use the existing higher_slot_scoring rule.

Rationale for putting the fork choice scoring in the StateMachine and using it in the Chain class

The Chain class is what ultimately orchestrates validating incoming data, persisting valid data, running the fork choice (based on the current state machine's scoring rule) and persisting the results of this fork choice (i.e. writing the new block's score and updating the canonical head if necessary).

This configuration is more flexible, exposing a direct path to inject the AttestationPool necessary for LMD GHOST, and avoids the situation where the DB knows about every other object in the system (single responsibility and all that).

I have some concerns mainly around the new API(s). It might be dangerous to expose something like set_score for anyone to call and update_canonical_head_if_needed feels a little clunky still... I would suggest we merge this in and as we gain confidence w/ the new stuff, we can look at deprecating the old stuff (namely, persist_block and its callers).

Other comments

We could make the argument to pull the actual process of selecting which fork to follow (given a "score chain") but this seems low value given that we can map any fork choice to "follow the monotonically increasing score" by warping the fork choice scoring function).

There is another concern around "overloading" the state machine w/ the fork choice -- it seems like the best place to put it for now but readers should note that the state machine is no longer just a transformation from block to block. To justify this change, let's answer the question: if we change the fork choice (at some slot), does that imply a new state machine?

To-Do

I'll add more tests but given there are some design questions up in the air, I'll get this up so you can make a pass at reviewing it if you want...

Add more tests to cover new APIs

Cute Animal Picture

ralexstokes · 2019-05-28T22:00:43Z

@hwwhww given that you mentioned you would make a pass at this -- another option here is to keep
persist_block and just add another parameter which is the fork choice.

EDIT: scratch this -- we still have the issue of tying the fork choice to the block import w/ this route... readers should ignore this comment, the problem that was here has been resolved in this PR :)

~~so persist_block(block, block_class) becomes persist_block(block, block_class, fork_choice_scoring)~~

as i'm writing these tests, i'm having trouble thinking of cases where you would really want to persist a block w/o scoring it and finding the new head... the goal w/ this entire stream of work is so that we can execute arbitrary logic when running the fork choice.

one way to do this is how this PR currently is (expose the pieces and let the client of this code 'deal w/ it'). the other way i'm alluding to in this comment is that we just pass in this code as the fork_choice_scoring.

~~e..g for LMD GHOST, something like:~~

def lmd_ghost_fork_choice_scoring(attestation_pool):
    return lambda block, chaindb: run_lmd_ghost(block, chaindb, attestation_pool)


# something using a `ChainDB`:
scoring = lmd_ghost_fork_choice_scoring(get_attestation_pool())
new_blocks, old_blocks = chaindb.persist_block(block, block.__class__, scoring)

~~you have a preference for one way or the other? i think this latest way avoids some of the security/usability concerns of the way that this PR currently proposes...~~

ralexstokes · 2019-05-28T23:34:37Z

tests/eth2/core/beacon/db/test_beacon_chaindb.py

    return chaindb


-@pytest.fixture(params=[0, 10, 999])
+@pytest.fixture(params=[1, 10, 999])
 def block(request, sample_beacon_block_params):


to not conflict w/ genesis block

ralexstokes · 2019-05-28T23:40:53Z

ok! this is a little messy as it tracked my thinking about the best way to get towards a more flexible fork choice :)

after I rebase, it should be ready to merge pending review. the tests all pass locally but circleCI is down at the moment.

In particular, it does not make sense to persist a block without running the fork choice. We will need some method to update the canonical head when running the fork choice outside of importing a block, but let's prefer to handle that in a downstream PR.

pipermerriam

I won't claim this to be a thorough review, but at a conceptual level, this seems very appropriate. I think cleaning up the eth1 model to use this pattern which I think qualifies as composition indeed does make sense.

ralexstokes · 2019-05-29T23:35:22Z

just to update, this is officially ready for review now :)

ChihChengLiang · 2019-05-30T07:50:52Z

eth2/beacon/state_machines/forks/serenity/__init__.py


    # methods
    @staticmethod
    def create_block_from_parent(parent_block: BaseBeaconBlock,
                                 block_params: FromBlockParams) -> BaseBeaconBlock:
        return create_serenity_block_from_parent(parent_block, block_params)
+
+    def get_fork_choice_scoring(self) -> ForkChoiceScoring:
+        return higher_slot_scoring


How would the lmd_ghost looks like in this method?
Does the method produce new functions for every slot and block?

My imagination:

def get_fork_choice_scoring(self) -> ForkChoiceScoring: def scoring(block): return lmd_ghost_scoring(db=self.chaindb.db, attestation_pool=somewhere.attestation_pool, state=self.state, start_block=somewhere.start_block, target_block=block, config=self.config) return scoring

yeah more or less! modulo those parameters changing, e.g. i don't think the StateMachine has a reference directly to the chain_db and similar for the other dependencies...

i'll think a bit about the best way to keep a unified function interface while still giving us flexibility...

there is also a concern about how often this closure is created -- it would be possible to memoize this function, esp if the attestation pool is a reference to the mutable thing, not just a description of attestations -- this route generally goes against our preference for immutability though so i'd be inclined to find something more, well, immutable :)

@ChihChengLiang latest thinking here:

08523d3

ec29a72

like i say in the PR, i'm still chewing on a clean way to further modularize the fork choice so it is not so tightly bound to a particular StateMachine.

hwwhww

Sorry for the late review!

Rationale for putting the fork choice scoring in the StateMachine and using it in the Chain class

if we change the fork choice (at some slot), does that imply a new state machine?

My two cents:

Agreed that it’s not ideal to embed the fork choice scoring in BeaconChain!
However, regarding the efficiency, at this point I’m not sure if the fork choice scoring upgrade would be necessary for the future. (Comparing to eth1 fork choice rule didn’t change much).
- If we are certain that the fork choice rule upgrade is exactly as the StateMachine (forks) upgrade boundary, we can make sure that in each batch is using the same fork choice rule scoring function (based on the TBD wire protocol), so that we may be able to initialize less state_machines during syncing.
state_machine.get_fork_choice_scoring() function refactoring: instead of returning a Callable, it seems similar to how eth1 eth/vm/forks/ overrides the static methods. For example, the compute_difficulty in Frontier and in HomesteadVM. I think we can utilize this pattern in SerenityStateMachine. Defer to the snake charmers on which pattern is more appropriate in this case. :)

hwwhww · 2019-05-31T08:16:32Z

eth2/beacon/fork_choice/fork_choice_scoring.py

+
+from eth2.beacon.types.blocks import BaseBeaconBlock
+
+ForkChoiceScoring = Callable[[BaseBeaconBlock], int]


What do you think about naming it ForkChoiceScoringFn?

yeah appending Fn seems ok

my thinking was that earlier in this process I wasn't sure this abstraction would just be a Callable.... the ForkChoice could evolve so that it is more convenient to have more state, in which case we could just go ahead w/ a class and ForkChoiceScoring would become an abstract class (for the interface)...

but i think we can get away w/ just a Callable so let's do that for now

tests/core/p2p-proto/bcc/helpers.py

ralexstokes · 2019-06-01T04:44:09Z

* `state_machine.get_fork_choice_scoring()` function refactoring: instead of returning a `Callable`, it seems similar to how eth1 `eth/vm/forks/` overrides the static methods. For example, the `compute_difficulty` [in `Frontier`](https://github.com/ethereum/py-evm/blob/915cec2a475176ad9722869e845dca8bac7a66d8/eth/vm/forks/frontier/__init__.py#L75) and [in `HomesteadVM`](https://github.com/ethereum/py-evm/blob/915cec2a475176ad9722869e845dca8bac7a66d8/eth/vm/forks/homestead/__init__.py#L39). I think we can utilize this pattern in `SerenityStateMachine`. Defer to the snake charmers on which pattern is more appropriate in this case. :)

@hwwhww yeah i went down this route before I landed on the current thing but decided to do this mainly because I ran into issues w/ mypy. and so I just looked at the links you added for the eth1 stuff and guess what... they just # type: ignore.

my thinking was to just do the simple thing to keep moving :)

the one caveat is that w/ LMD GHOST there is basically a "setup" step where we need to fetch the latest justified state and it seems much more ergonomic as an instance method... i'll refer you here: https://github.com/ethereum/trinity/pull/685/files#diff-26594f899e3bcc74e868bbab3824d8f9R46

ralexstokes · 2019-06-01T04:45:32Z

unless anyone has further comment or concern, i'll go ahead and merge this soon... i'll address #666 (comment) in #685 which builds off of this PR

hwwhww · 2019-06-01T04:57:21Z

@ralexstokes

issues w/ mypy

Right, I recalled it was really weird: ethereum/py-evm#1362 (comment) :'(

i'll go ahead and merge this soon... i'll address #666 (comment) in #685 which builds off of this PR

Sounds good to me!

ralexstokes force-pushed the free-the-fork-choice branch from ab3da5a to aadf358 Compare May 24, 2019 20:22

ralexstokes force-pushed the free-the-fork-choice branch from 23373a6 to 8242407 Compare May 28, 2019 23:30

ralexstokes marked this pull request as ready for review May 28, 2019 23:31

ralexstokes commented May 28, 2019

View reviewed changes

ralexstokes added 12 commits May 28, 2019 16:43

Grammar, make singular for one block

9b4b668

reorder property names

bcf789a

Add module for configurable fork choice

31666fe

Add fork_choice_scoring to the state machine

151575c

Expose methods to run fork choice outside DB class

b08fcbf

Refactor chain to manually run the fork choice

9f3d5c4

Ensure function properties are staticmethods

1dd8a81

Prefer simpler inheritance infrastructure to satisfy mypy

3f19e24

Add test for higher_slot fork choice scoring

2e6adc5

Enhance legibility of test

2d27660

Add test for ChainDB.set_score

5ea9caa

ralexstokes force-pushed the free-the-fork-choice branch from 8242407 to 40538dd Compare May 28, 2019 23:45

pipermerriam approved these changes May 29, 2019

View reviewed changes

ralexstokes force-pushed the free-the-fork-choice branch 6 times, most recently from 7e09ea0 to e0b8f3b Compare May 29, 2019 22:18

ralexstokes added 2 commits May 29, 2019 15:33

Allow a caller of persist_block to provide their own fork choice

27fa0e5

reorder imports

d5f9994

ralexstokes force-pushed the free-the-fork-choice branch from e0b8f3b to f67a99d Compare May 29, 2019 22:33

Update sync and test to accommodate new fork choice interface

7cb4903

ralexstokes force-pushed the free-the-fork-choice branch from f67a99d to 7cb4903 Compare May 29, 2019 23:24

ChihChengLiang reviewed May 30, 2019

View reviewed changes

ralexstokes mentioned this pull request May 31, 2019

Add basic LMD GHOST fork choice #685

Merged

3 tasks

hwwhww reviewed May 31, 2019

View reviewed changes

hwwhww approved these changes Jun 1, 2019

View reviewed changes

ChihChengLiang approved these changes Jun 1, 2019

View reviewed changes

NIC619 approved these changes Jun 2, 2019

View reviewed changes

ralexstokes merged commit 9fe8fa8 into ethereum:master Jun 3, 2019

ralexstokes deleted the free-the-fork-choice branch June 3, 2019 21:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Free the fork choice #666

Free the fork choice #666

ralexstokes commented May 24, 2019 •

edited

Loading

ralexstokes commented May 28, 2019 •

edited

Loading

ralexstokes May 28, 2019

ralexstokes commented May 28, 2019

pipermerriam left a comment

ralexstokes commented May 29, 2019

ChihChengLiang May 30, 2019

ralexstokes May 30, 2019

ralexstokes May 31, 2019

hwwhww left a comment •

edited

Loading

hwwhww May 31, 2019

ralexstokes Jun 1, 2019

ralexstokes Jun 1, 2019

ralexstokes commented Jun 1, 2019 •

edited

Loading

ralexstokes commented Jun 1, 2019

hwwhww commented Jun 1, 2019


		from eth2.beacon.types.blocks import BaseBeaconBlock

		ForkChoiceScoring = Callable[[BaseBeaconBlock], int]

Free the fork choice #666

Free the fork choice #666

Conversation

ralexstokes commented May 24, 2019 • edited Loading

What was wrong?

How was it fixed?

Rationale for putting the fork choice scoring in the StateMachine and using it in the Chain class

Other comments

To-Do

Cute Animal Picture

ralexstokes commented May 28, 2019 • edited Loading

ralexstokes May 28, 2019

Choose a reason for hiding this comment

ralexstokes commented May 28, 2019

pipermerriam left a comment

Choose a reason for hiding this comment

ralexstokes commented May 29, 2019

ChihChengLiang May 30, 2019

Choose a reason for hiding this comment

ralexstokes May 30, 2019

Choose a reason for hiding this comment

ralexstokes May 31, 2019

Choose a reason for hiding this comment

hwwhww left a comment • edited Loading

Choose a reason for hiding this comment

hwwhww May 31, 2019

Choose a reason for hiding this comment

ralexstokes Jun 1, 2019

Choose a reason for hiding this comment

ralexstokes Jun 1, 2019

Choose a reason for hiding this comment

ralexstokes commented Jun 1, 2019 • edited Loading

ralexstokes commented Jun 1, 2019

hwwhww commented Jun 1, 2019

ralexstokes commented May 24, 2019 •

edited

Loading

ralexstokes commented May 28, 2019 •

edited

Loading

hwwhww left a comment •

edited

Loading

ralexstokes commented Jun 1, 2019 •

edited

Loading