Add daily scheduled rewards #131

juliangruber · 2024-06-06T10:20:04Z

Expose scheduled rewards
Test API
Find all participant addresses
Observe scheduled rewards

~~Blocked by #102~~
For filecoin-station/desktop#1552

juliangruber · 2024-06-07T15:34:49Z

@bajtos do you have an idea how to best test this? I would suggest to pass a mocked IE contract and then query if scheduled rewards have been recorded to the DB.

juliangruber · 2024-06-07T17:03:31Z

Another question is how to get the participant addresses.

Query the contract

When we query the contract, we only get those with scheduled rewards over 0.1 FIL. That is insufficient for plotting the road towards the threshold.

Add to spark-api/evaluate

Two potential sources for participants are spark-api, where we see the addresses of all participants, or spark-evaluate, where we see the addresses of all participants with accepted measurements. I would like not to add the logic needed for this PR to these repos, because to me, they are not related. I would like only to add code to spark-stats, and if we add code to spark-api, it should be generic enough and will let arbitrary services consume this data. No changes to spark-{api,evaluate} should be necessary when in the future we desire to change the semantics of "participants".

But how should these services expose the participants, in a way that is not opinionated? (Eg after which period of inactivity should the participant be removed from the list? Should it be queryable by date? Etc).

Query `setScores()` calls

Another option is querying the block explorers for setScores() calls, which will also include participant addresses with accepted measurements. This should work similar to listening for transfer events (#102). However, we are dependent on APIs (like Beryx) to expose this data. Unfortunately, the contract itself can't expose it.

Lazy approach

Yet another option is the lazy approach: When a participant's scheduled rewards are requested for the first time, we add that participant to the list. This has the upside of least architectural cost, but the downside that you won't see historic rewards from before you first asked for them (read: opened Station Desktop). This is likely unacceptable.

Consume raw measurements

Another contract based option is to listen for MeasurementsAdded events, then download the raw measurements from w3s, and extract the participant addresses. This should be simple to implement, but is quite clumsy.

Add contract event

Another option is to add an event to the IE smart contract itself, which will post evaluation results to the chain. This comes at the development cost of deploying a new contract.

There are probably other options here too. @bajtos @patrickwoodhead what do you think?

juliangruber · 2024-06-07T17:10:38Z

I think spark-stats should consume the smart contract to get this information. That is the piece that other services should interact with as much as possible, because it lives on the chain and can therefore be trusted.

That means, spark-stats will either:

listen for MeasurementsAdded events, download measurements via IPFS, and extract the participant addresses
query Beryx for setScores() calls, and extract the participant addresses
listen for a new ScoresSet event, and extract the participant addresses, after we deployed a new IE contract with this feature

My thoughts on these options:

listen for MeasurementsAdded events, download measurements via IPFS, and extract the participant addresses

Downloading measurements from web3.storage often fails. Atm only spark-evaluate does this, now we would have a second service that can fail because of it. It's a known problem, but also we'd be investing more into it.

It will double our web3.storage egress traffic. I think that shouldn't be an issue, but it's wasteful.

This will give spark-stats the participant addresses of all participants, independent of whether their measurements were accepted or not. I think that shouldn't be an issue.

This option is quite easy to implement, and shouldn't take more than 1 day.

query Beryx for setScores() calls, and extract the participant addresses

This couples us to Beryx, if they decide to change their API, we must hope to find another one that works.

~~I'm currently evaluating whether Beryx can expose this information.~~ I've reached out to Beryx to ask if this could be supported, currently the API fails when trying to get this information.

listen for a new ScoresSet event, and extract the participant addresses, after we deployed a new IE contract with this feature

Adding a new event isn't bad, but migrating to a new contract is painful and takes time.

observer/lib/observer.js

bajtos · 2024-06-09T10:42:40Z

observer/lib/observer.js

+    try {
+      scheduledRewards = await ieContract.rewardsScheduledFor(address)
+    } catch (err) {
+      console.error('Error querying scheduled rewards for', address, { cause: err })
+      continue
+    }
+    console.log('Scheduled rewards for', address, scheduledRewards)
+    await pgPool.query(`
+      INSERT INTO daily_scheduled_rewards (day, address, scheduled_rewards)
+      VALUES (now(), $1, $2)
+      ON CONFLICT (day, address) DO UPDATE SET
+      scheduled_rewards = EXCLUDED.scheduled_rewards
+    `, [address, scheduledRewards])
+  }


This is going to be very expensive. With 5k daily active participants (see Spark Public Dashboard), we are going to make 5k RPC API calls followed by 5k SQL queries.

(1)
Can we group RPC API calls and SQL queries into batches? I believe Ethers v6 supports request batching, and we are configuring the RPC API provider to use batching. Now, we need to find out how to trigger the batching behaviour.

What I would like to see at high level:

const BATCH_SIZE = 10 // for example, we need to tweak this for (const addrBatch of splitIntoBatches(dailyParticipantAddresses, BATCH_SIZE)) { // is this going to trigger Ethers.js batching behaviour? const rewards = await Promise.all( addrBatch.map(addr => await ieContract.rewardsScheduledFor(addr)) ) // run a single SQL query to update multiple rows await pgPool.query(` INSERT INTO daily_scheduled_rewards (day, address, scheduled_rewards) VALUES (now(), unnest($1::text[]), unnest($2::numeric[])) ON CONFLICT (day, address) DO UPDATE SET scheduled_rewards = EXCLUDED.scheduled_rewards `, [ addrBatch, rewards ]) }

(The query is inspired by an existing query in spark-evaluate here).

(2)
We must be mindful of how many requests we send to Glif RPC API. IMO, we shouldn't send all 5k queries as fast as the systems can handle as that would put too much load of the RPC API provider.

I propose to introduce a small delay between iterations of this loop.

If we batch requests for each 10 participants, we need to send ~500 requests. If we use a 1s delay, then we will finish updating all participants in 500 seconds = 8.3 minutes. I think that's fast enough since the scheduled rewards are updated every 20 minutes.

stats/lib/typings.d.ts

bajtos · 2024-06-09T11:02:32Z

@bajtos do you have an idea how to best test this? I would suggest to pass a mocked IE contract and then query if scheduled rewards have been recorded to the DB.

Not really. A mocked IE contract sounds like a good start to me. We already used that approach in voyager-publisher tests IIRC (see filecoin-station/voyager-api#22)

Another question is how to get the participant addresses.

Great description of different options available 👌🏻

I posted my thoughts in #131 (comment) before I read your comments.

I think spark-stats should consume the smart contract to get this information. That is the piece that other services should interact with as much as possible, because it lives on the chain and can therefore be trusted.

💯

Instead of adding setScores event, I propose adding an event when participant's scheduled rewards change. There are two cases when that happens - setScores/ increaseParticipantBalance increases the balance, transferScheduled decreases the balance.

Add to spark-api/evaluate

But how should these services expose the participants, in a way that is not opinionated? (Eg after which period of inactivity should the participant be removed from the list? Should it be queryable by date? Etc).

As I explained in #131 (comment), spark-evaluate maintains the table daily_participants. This table is powering our public dashboard, so I think it's safe to assume it will always interpret the term "participant" correctly.

As for opinions about inactivity & querying by date - I guess the current solution is somewhat opinionated to serve the needs of the dashboard, but I also think it's flexible enough to support additional use cases, e.g. your work in this pull request.

--

Depending on how much time you are willing to spend on this feature, I propose to choose one of the following two ways forward:

Least effort: Use the existing table daily_participants. No changes outside of this PR should be needed.
Ideal solution: Implement the new smart contract event. It's more work, but the solution may perform better as we can listen to events instead of repeatedly checking the balance of every participant.

juliangruber · 2024-06-09T15:04:46Z

Awesome, +1 to using daily_participants <3

I agree that adding the event is the right solution, but I don't think this is the right time for this

Co-authored-by: Miroslav Bajtoš <oss@bajtos.net>

observer/bin/dry-run.js

observer/test/observer.test.js

README.md

observer/bin/migrate.js

observer/lib/observer.js

observer/package.json

stats/lib/typings.d.ts

README.md

Co-authored-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos

add read scheduled rewards

5d1a0c7

juliangruber added the blocked label Jun 6, 2024

fix lint

ea46819

juliangruber mentioned this pull request Jun 6, 2024

Provide data for Desktop v2 filecoin-station/desktop#1552

Closed

6 tasks

juliangruber changed the title ~~Add participant scheduled rewards~~ Add daily scheduled rewards Jun 6, 2024

juliangruber mentioned this pull request Jun 7, 2024

Poll FIL events, populate daily_reward_transfers table, reformat endpoint handling #102

Merged

juliangruber removed the blocked label Jun 7, 2024

juliangruber added 7 commits June 7, 2024 15:30

Merge branch 'main' into add/scheduled-rewards

6d41532

fix lint

5baa7dc

fix migration number

cfd9b8c

wrap observation functions

6647e52

wip

af65a20

simplify, dry run works

c7b25b3

fix lint

a5c0187

bajtos requested changes Jun 9, 2024

View reviewed changes

juliangruber added 6 commits June 9, 2024 08:05

Merge branch 'main' into add/scheduled-rewards

a6d1c16

Merge branch 'main' into add/scheduled-rewards

2e291c4

split up loops

8e047e8

get participants from db

716e589

sentry

fa46614

read from last 3 days of participants

109c4f9

bajtos mentioned this pull request Jun 11, 2024

refactor: rename Filter to DateRangeFilter #137

Merged

juliangruber added 3 commits June 11, 2024 16:20

big refactor

043af01

add tests

158bf59

tests pass

cb2cead

juliangruber and others added 9 commits June 12, 2024 13:59

rename pg pools

4e74845

Update db/typings.d.ts

fb79b4e

Co-authored-by: Miroslav Bajtoš <oss@bajtos.net>

harden tests using hooks

7455209

improve test assertions

d57e042

refactor loop

d162516

move migrations into db

7c98e1c

inline observer() in dry-run.js

cae8594

use helper from spark-evaluate

0a0f417

fix test

99a6ba9

juliangruber requested a review from bajtos June 12, 2024 13:27

juliangruber added 5 commits June 12, 2024 15:31

ci: add dry-run

b97e9ac

add glif token to dry-run

6337958

fix dry-run import

46db5b9

refine migration

fb5ac66

fix import

ad79c95

bajtos requested changes Jun 12, 2024

View reviewed changes

juliangruber added 4 commits June 12, 2024 15:50

docs

3a2090b

fix version

8f0d740

revert

83719b5

log

db09bc8

bajtos reviewed Jun 12, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

juliangruber and others added 5 commits June 12, 2024 15:59

refactor

201fc76

observer: add missing Sentry init

a9bf998

Update README.md

ec5e592

Co-authored-by: Miroslav Bajtoš <oss@bajtos.net>

fix export

f2811b5

fix test

1cf47b1

bajtos approved these changes Jun 12, 2024

View reviewed changes

fix

7d39cb5

juliangruber merged commit e03bcf0 into main Jun 12, 2024
10 checks passed

juliangruber deleted the add/scheduled-rewards branch June 12, 2024 14:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add daily scheduled rewards #131

Add daily scheduled rewards #131

juliangruber commented Jun 6, 2024 •

edited

Loading

juliangruber commented Jun 7, 2024

juliangruber commented Jun 7, 2024 •

edited

Loading

juliangruber commented Jun 7, 2024 •

edited

Loading

bajtos Jun 9, 2024

bajtos commented Jun 9, 2024

juliangruber commented Jun 9, 2024

bajtos left a comment

Add daily scheduled rewards #131

Add daily scheduled rewards #131

Conversation

juliangruber commented Jun 6, 2024 • edited Loading

juliangruber commented Jun 7, 2024

juliangruber commented Jun 7, 2024 • edited Loading

Query the contract

Add to spark-api/evaluate

Query setScores() calls

Lazy approach

Consume raw measurements

Add contract event

juliangruber commented Jun 7, 2024 • edited Loading

bajtos Jun 9, 2024

Choose a reason for hiding this comment

bajtos commented Jun 9, 2024

juliangruber commented Jun 9, 2024

bajtos left a comment

Choose a reason for hiding this comment

juliangruber commented Jun 6, 2024 •

edited

Loading

juliangruber commented Jun 7, 2024 •

edited

Loading

Query `setScores()` calls

juliangruber commented Jun 7, 2024 •

edited

Loading