Handle meta events #3070

roman-khimov · 2024-12-25T09:44:28Z

Is your feature request related to a problem? Please describe.

A part of #2878.

Describe the solution you'd like

Add meta event handling:

create a new meta DB (single instance per node), reuse https://pkg.go.dev/github.com/nspcc-dev/[email protected]/pkg/core/storage for it
forget about buckets, use prefixes for everything
listen for events, catch and process them
for each block open a new MemCachedStore, process in-block data immediately
in-block data allows to build an index of OIDs and big/lock indexes
these prefixes are added into https://pkg.go.dev/github.com/nspcc-dev/[email protected]/pkg/core/mpt when we're done with block processing (keep only latest state, we don't need historic)
a full header index (see Implement searchv2 #3058) can be managed in a separate routine and not be a part of MPT
mem cache flushing can be per-block currently

Describe alternatives you've considered

There can be some, but we need this data to be processed fast and we'll need MPT for subsequent synchronization.

The text was updated successfully, but these errors were encountered:

roman-khimov · 2024-12-25T09:45:49Z

@AnnaShaleva and @AliceInHunterland can tell a bit more about the way storage is handled in NeoGo. What we're doing here is very similar to storeBlock except that we work with events and some data needs to be obtained additionally (but it's not a part of the state proper at the same time).

carpawell · 2024-12-26T18:11:45Z

@roman-khimov

So we have a new DB now for our indexes (not sure why it is a reuse of neo-go's one but ok, dont mind) and also have a MemCachedStore memory instance for every block, why? That is exactly "cached" version, not just a memory instance per block, right? Over what is it created, the original disk DB? What does the cache solve?
"listen for events", "for each block". What is this new DB subscribed for? I thought that it should receive only our new events from meta-on-chain TXs but it also needs to process every block?
"these prefixes are added into https://pkg.go.dev/github.com/nspcc-dev/[email protected]/pkg/core/mpt". What exact "these prefixes"? Why we need mpt here?
"full header index ... not be a part of MPT" what is the criterion of what mpt and what is not?

roman-khimov · 2024-12-26T18:57:15Z

It can be done without MemCachedStore using per-block Bolt transactions, but reusing NeoGo storage subsystem allows to simplify some interfaces and make DBs changeable. In future we can also optimize flushing from per-block to something smarter. Memory itself is irrelevant here.
Listen for new events, you need to watch for objects, that's the only thing you care about (epochs are interesting too, but maybe not immediately).
MPT is to identify and synchronize state. Prefixes are those created above, not the full index.
Reproducibility. I can fetch headers and recreate index using data stored in MPT, so adding everything into MPT is a resource waste.

carpawell · 2025-01-20T19:48:32Z

@roman-khimov, @cthulhu-rider, considering #3085 and #3080 we now have two meta storages: first one is a fully new DB that relies only on the information from FSchain and currently stores only this info in MPT (and persisting it to the new DB instance); the second one(s) is the old metabase per every shard that takes info from PUT/REPLICATE RPC directly without any object flow changes.
AFAIU, our target goal is to have some combination of these PRs. We need to have meta-from-chain indexes in MPT but we also need to have an index(es) of what is stored on disk.
We need to discuss:

If we need some third kind of DB that stores meta information but neither in MPT, nor in the old metabases. Indexes for objects that are not stored in the current node but that are also not just MTP data from the FSchain. As I understand ("...a full header index can be managed in a separate routine and not be a part of MPT..."), a full header should also be processed after receiving notification from FSchain, not just be a part of a regular shard's PUT routine.
What code should be shared to prevent repeating? Should there be support for more than just bolt (neo-go supports more storages and works on interfaces, our current metabase code -- no)?
In what order do we release our features?

roman-khimov · 2025-01-21T10:01:04Z

Same DB, different prefixes/handling.
Leave Bolt to the old metabase, reuse DB abstraction provided by NeoGo. But you need to translate headers into KVs and this code can be shared.
Search, meta.

roman-khimov added U1 Critically important to resolve quickly S1 Highly significant I2 Regular impact feature Completely new functionality labels Dec 25, 2024

roman-khimov added this to the v0.45.0 milestone Dec 25, 2024

roman-khimov assigned carpawell Dec 25, 2024

roman-khimov mentioned this issue Dec 25, 2024

Chained metadata, v1 #2878

Open

roman-khimov mentioned this issue Dec 27, 2024

API to fetch all notifications from a block (with filters) nspcc-dev/neo-go#3779

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle meta events #3070

Handle meta events #3070

roman-khimov commented Dec 25, 2024

roman-khimov commented Dec 25, 2024

carpawell commented Dec 26, 2024

roman-khimov commented Dec 26, 2024

carpawell commented Jan 20, 2025

roman-khimov commented Jan 21, 2025

Handle meta events #3070

Handle meta events #3070

Comments

roman-khimov commented Dec 25, 2024

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

roman-khimov commented Dec 25, 2024

carpawell commented Dec 26, 2024

roman-khimov commented Dec 26, 2024

carpawell commented Jan 20, 2025

roman-khimov commented Jan 21, 2025