Proof of concept: deterministic tasking using SHA256(StationID) ^ SHA256(Task) #287

bajtos · 2024-07-11T12:06:37Z

See https://www.notion.so/spacemeridian/Spark-Tasking-v3-745b0e1020bb4000ac77acafee09e683

The step of building the list of per-station tasks takes 3124ms to complete, increasing the duration of the evaluation by 50% to 9641ms.

Links:

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos · 2024-07-11T14:05:39Z

Analysis of committee sizes before and after this change:

https://www.notion.so/spacemeridian/Spark-Tasking-v3-745b0e1020bb4000ac77acafee09e683?pvs=4#900b01ef2cb148cd9d12a9a82ce3b21a

Conclusion

The proposed design has acceptable performance and significantly improves the quality of committees.

To have enough confidence that a majority of each committee is honest, we need each committee to have at least 40-50 participants. The current algorithm does not meet that requirement for at least 10% of committees. The proposed algorithm seems to meet this requirement in all committees.

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

juliangruber · 2024-07-11T23:43:54Z

lib/typings.d.ts

@@ -31,6 +32,7 @@ export type RecordTelemetryFn = (
 export type FraudAssesment =
  | 'OK'
  | 'INVALID_TASK'
+  | 'TASK_NOT_ALLOWED'


conceptually, what's the difference between invalid_task and task_not_allowed? The first includes the second I think, but maybe we can rename it to be more precise?

Great question! In the current code:

INVALID_TASK means the task does not match any of the 1000 tasks defined for the round.

TASK_NOT_ALLOWED means the task is valid for the round, but this node is not allowed to perform it.

I'd like to keep the distinction. INVALID_TASK is typically used for measurements that are committed to the wrong round. Typically, all measurements in the first batch published after a new round starts are measurements for tasks from the previous round because spark-api & checker nodes haven't noticed the new round yet.

I agree to find more descriptive names 👍🏻

Got it! What about

TASK_NOT_FOUND

TASK_WRONG_NODE

What do you think about this?

TASK_NOT_IN_ROUND

TASK_NOT_FOR_NODE

On the second thought, TASK_WRONG_NODE works too 👍🏻

juliangruber · 2024-07-15T15:24:14Z

lib/evaluate.js

+  logger.log('EVALUATE ROUND %s: using randomness %s', roundIndex, randomness)
+
+  const started = Date.now()
+  const taskWithKeys = await Promise.all(sparkRoundDetails.retrievalTasks.map(async (task) => {


Suggested change

const taskWithKeys = await Promise.all(sparkRoundDetails.retrievalTasks.map(async (task) => {

const tasksWithKeys = await Promise.all(sparkRoundDetails.retrievalTasks.map(async (task) => {

How about keyedTasks?

juliangruber · 2024-07-15T15:24:21Z

lib/evaluate.js

+    return { ...task, key }
+  }))
+
+  const seeker = new closest.Seeker([...taskWithKeys], (targetKey, t) => t.key ^ targetKey)


Suggested change

const seeker = new closest.Seeker([...taskWithKeys], (targetKey, t) => t.key ^ targetKey)

const seeker = new closest.Seeker([...tasksWithKeys], (targetKey, t) => t.key ^ targetKey)

juliangruber · 2024-07-15T15:31:08Z

lib/evaluate.js

+  /* eslint-disable-next-line camelcase */
+  for (const { stationId, participantAddress, inet_group } of measurements) {
+    if (stations.has(stationId)) continue
+    /* eslint-disable-next-line camelcase */
+    stations.set(stationId, { participantAddress, inet_group })


Suggested change

/* eslint-disable-next-line camelcase */

for (const { stationId, participantAddress, inet_group } of measurements) {

if (stations.has(stationId)) continue

/* eslint-disable-next-line camelcase */

stations.set(stationId, { participantAddress, inet_group })

for (const { stationId, participantAddress, inet_group: inetGroup } of measurements) {

if (stations.has(stationId)) continue

/* eslint-disable-next-line camelcase */

stations.set(stationId, { participantAddress, inetGroup })

I think it's better to rename props as soon as you use them

bajtos · 2024-07-31T08:15:16Z

Closing in favour of #296

poc: deterministic tasking using StationID ^ SHA256(Task)

3347a3a

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos requested a review from juliangruber July 11, 2024 12:06

experiment - next-gencommittee distributions

630c011

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos mentioned this pull request Jul 11, 2024

Deterministic task<->node assignment filecoin-station/spark#38

Closed

4 tasks

bajtos added 3 commits July 11, 2024 16:02

hash StationID to create the key

06c415e

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

fixup! remove forgotten process.exit()

40f4e43

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

fixup! tsc

3f0cee9

Signed-off-by: Miroslav Bajtoš <oss@bajtos.net>

bajtos changed the title ~~poc: deterministic tasking using StationID ^ SHA256(Task)~~ Proof of concept: deterministic tasking using StationID ^ SHA256(Task) Jul 11, 2024

bajtos changed the title ~~Proof of concept: deterministic tasking using StationID ^ SHA256(Task)~~ Proof of concept: deterministic tasking using SHA256(StationID) ^ SHA256(Task) Jul 11, 2024

juliangruber requested changes Jul 15, 2024

View reviewed changes

This was referenced Jul 22, 2024

feat: deterministic tasking filecoin-station/spark#85

Merged

experiment with the new tasking algorithm #234

Closed

This was referenced Jul 30, 2024

test: update integration test to use round 12012 #295

Merged

feat: deterministic task assignment #296

Merged

bajtos closed this Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proof of concept: deterministic tasking using SHA256(StationID) ^ SHA256(Task) #287

Proof of concept: deterministic tasking using SHA256(StationID) ^ SHA256(Task) #287

bajtos commented Jul 11, 2024 •

edited

Loading

bajtos commented Jul 11, 2024 •

edited

Loading

juliangruber Jul 11, 2024

bajtos Jul 22, 2024

juliangruber Jul 22, 2024

bajtos Jul 23, 2024

juliangruber Jul 15, 2024

bajtos Jul 22, 2024

juliangruber Jul 15, 2024

juliangruber Jul 15, 2024

bajtos commented Jul 31, 2024

	const taskWithKeys = await Promise.all(sparkRoundDetails.retrievalTasks.map(async (task) => {
	const tasksWithKeys = await Promise.all(sparkRoundDetails.retrievalTasks.map(async (task) => {

	const seeker = new closest.Seeker([...taskWithKeys], (targetKey, t) => t.key ^ targetKey)
	const seeker = new closest.Seeker([...tasksWithKeys], (targetKey, t) => t.key ^ targetKey)

Proof of concept: deterministic tasking using SHA256(StationID) ^ SHA256(Task) #287

Proof of concept: deterministic tasking using SHA256(StationID) ^ SHA256(Task) #287

Conversation

bajtos commented Jul 11, 2024 • edited Loading

bajtos commented Jul 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bajtos commented Jul 31, 2024

bajtos commented Jul 11, 2024 •

edited

Loading

bajtos commented Jul 11, 2024 •

edited

Loading