Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Design interface for SPARK as a datasource #19

Open
coreymjames opened this issue Sep 17, 2023 · 6 comments
Open

Design interface for SPARK as a datasource #19

coreymjames opened this issue Sep 17, 2023 · 6 comments

Comments

@coreymjames
Copy link

User interviews has surfaced that some retrieval incentive programs (namely ReputationDAO, FIL+, and others) will need to access SPARK data, meaning that we need:

  1. a simple interface through which this data can be pulled
  2. a clear db solution, with an understanding of where and for how long SPARK data will be stored
@coreymjames
Copy link
Author

Or is this within /spark-api repo?

@juliangruber
Copy link
Member

With ReputationDAO, wouldn't we insert the data into their DB?

@coreymjames
Copy link
Author

coreymjames commented Sep 18, 2023 via email

@MF416
Copy link

MF416 commented Sep 18, 2023

I think the clear next steps are to understand how these various groups would use the data more tactically, which we could accomplish by following up with RG / Torfinn / Caro and digging into details / offering a few options

ReputationDAO --> @coreymjames do they have a technical lead here? we should get answers to questions like:

  • Have they defined their report format / how they generate analytics?
  • Do they have a DB set up that we can look at interfacing with?
  • How are they planning on maintaining this?

FIL+ --> likely that this gets fed into the reports that RG generates and adds to github threads, need to understand how the bot / script he's using would ingest the data

Could quickly outline the options (below) and show them a few formats to get feedback

  • API (have a sample JSON with datapoints / format)
  • Data dump into csv
  • Proposed time and feature list

@coreymjames
Copy link
Author

coreymjames commented Sep 18, 2023 via email

@bajtos
Copy link
Member

bajtos commented Oct 4, 2023

We are discussing integration with Reputation DAO in space-meridian/roadmap#21, you can find a lot of details there.

SPARK does submit measurements to IPFS, but it's raw data before fraud detection.

What we need instead is only measurements that we consider as valid (honest).

To get that, we need to change the Evaluate service to publish the data + fraud assessment to a new datastore.

Is there a simple/easy way for us to create an interface/API for RepDAO,
FIL+, or any other potential project to access SPARK data in the future?

  1. Reputation DAO is push-based - we are pushing data to their DB, no API is needed on our side.
  2. Creating a new API for consuming our measurements is relatively simple. Depending on the required features, of course. The simplest possible option is to publish our data to a SaaS offering DB + API out of the box, e.g. https://supabase.com

Is it possible/easy for this to be paywalled?

It depends on how what we want to charge for.

If the paywall is a simple "has full read access" vs "access not allowed", then it's very easy to implement.

That may not be enough though, we will soon need to add rate limiting mechanism to keep our DB and egress costs manageable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: 🗃 backlog
Development

No branches or pull requests

4 participants