[Remote State] Upload each metadata attributes file instead of single global metadata file #12468

shiv0408 · 2024-02-26T18:41:09Z

Is your feature request related to a problem? Please describe

Currently, we are storing our remote cluster state in a global metadata file and index metadata files for each index and maintain all the info in manifest file. As the cluster grows and cluster state size grows with more usage, the global metadata size also increases.
Whenever we trigger a cluster state update, we need to write the updated metadata on remote. If the global metadata file size has increased and the incoming change is also in global metadata file, we will upload the whole file again, for a small change of settings as well, thus increasing the cluster state update latency.

Describe the solution you'd like

We propose that we split the global metadata file into following following components:

global-metadata/
       | --> coordination__metadata
       | --> settings__metadata
       | --> templates__metadata
       | --> custom__<type1>__metadata
       | --> custom__<type2>__metadata 
.
.

This way if only a setting is modified, we don't update other files. If multiple files need to updated, those are updated in parallel, which is again better than upload full file.
This change will help in majorly decreasing cluster state update latency in big clusters.

Related component

Cluster Manager

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

peternied · 2024-02-28T16:34:36Z

[Triage - attendees 1 2 3 4 5]
@shiv0408 Thanks for creating this issue; however, it isn't being accepted due to its not being clear what the problem is or how this is addressed. Please feel free to open a new issue after addressing the reason.

sarthakaggarwal97 · 2024-03-13T02:31:42Z

@shiv0408 thanks for elaborating on the issue. Please share some numbers of improvement as well. thanks!

shiv0408 · 2024-03-13T07:17:51Z

Thanks for reopening the issue @sarthakaggarwal97. I have added the benchmark results in linked PR.

We see around 50-70% improvement in writing Incremental Metadata, see PR #12190 for more details.

andrross · 2024-03-13T15:36:17Z

[Triage - attendees 1 2 3]
@shiv0408 Thanks for filing this issue. Looking forward to seeing progress here.

shiv0408 added enhancement Enhancement or improvement to existing feature or request untriaged labels Feb 26, 2024

shiv0408 mentioned this issue Feb 26, 2024

Split the remote global metadata file to metadata attribute files #12190

Merged

8 tasks

github-actions bot added the Cluster Manager label Feb 26, 2024

peternied closed this as completed Feb 28, 2024

sarthakaggarwal97 reopened this Mar 13, 2024

andrross removed the untriaged label Mar 13, 2024

shwetathareja closed this as completed in #12190 May 16, 2024

shiv0408 self-assigned this May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Remote State] Upload each metadata attributes file instead of single global metadata file #12468

[Remote State] Upload each metadata attributes file instead of single global metadata file #12468

shiv0408 commented Feb 26, 2024 •

edited

Loading

peternied commented Feb 28, 2024

sarthakaggarwal97 commented Mar 13, 2024

shiv0408 commented Mar 13, 2024

andrross commented Mar 13, 2024

[Remote State] Upload each metadata attributes file instead of single global metadata file #12468

[Remote State] Upload each metadata attributes file instead of single global metadata file #12468

Comments

shiv0408 commented Feb 26, 2024 • edited Loading

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

peternied commented Feb 28, 2024

sarthakaggarwal97 commented Mar 13, 2024

shiv0408 commented Mar 13, 2024

andrross commented Mar 13, 2024

shiv0408 commented Feb 26, 2024 •

edited

Loading