Add concurrency to the find-large-objects scrubber subcommand #8291

arpad-m · 2024-07-05T17:38:18Z

The find-large-objects scrubber subcommand is quite fast if you run it in an environment with low latency to the S3 bucket (say an EC2 instance in the same region). However, the higher the latency gets, the slower the command becomes. Therefore, add a concurrency param and make it parallelized. This doesn't change that general relationship, but at least lets us do multiple requests in parallel and therefore hopefully faster.

Running with concurrency of 64 (default):

2024-07-05T17:30:22.882959Z  INFO lazy_load_identity [...]
[...]
2024-07-05T17:30:28.289853Z  INFO Scanned 500 shards. [...]

With concurrency of 1, simulating state before this PR:

2024-07-05T17:31:43.375153Z  INFO lazy_load_identity [...]
[...]
2024-07-05T17:33:51.987092Z  INFO Scanned 500 shards. [...]

In other words, to list 500 shards, speed is increased from 2:08 minutes to 6 seconds.

Follow-up of #8257, part of #5431

storage_scrubber/src/find_large_objects.rs

skyzh

LGTM

storage_scrubber/src/find_large_objects.rs

github-actions · 2024-07-05T18:34:55Z

3042 tests run: 2927 passed, 0 failed, 115 skipped (full report)

Code coverage* (full report)

functions: 32.6% (6934 of 21279 functions)
lines: 50.0% (54491 of 108996 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
9c5e2f3 at 2024-07-05T20:22:56.387Z :recycle:}

The atomic wasn't required after all

skyzh · 2024-07-05T19:27:27Z

created a tracking issue for test lsn lease flaky: #8293

they are not needed

The find-large-objects scrubber subcommand is quite fast if you run it in an environment with low latency to the S3 bucket (say an EC2 instance in the same region). However, the higher the latency gets, the slower the command becomes. Therefore, add a concurrency param and make it parallelized. This doesn't change that general relationship, but at least lets us do multiple requests in parallel and therefore hopefully faster. Running with concurrency of 64 (default): ``` 2024-07-05T17:30:22.882959Z INFO lazy_load_identity [...] [...] 2024-07-05T17:30:28.289853Z INFO Scanned 500 shards. [...] ``` With concurrency of 1, simulating state before this PR: ``` 2024-07-05T17:31:43.375153Z INFO lazy_load_identity [...] [...] 2024-07-05T17:33:51.987092Z INFO Scanned 500 shards. [...] ``` In other words, to list 500 shards, speed is increased from 2:08 minutes to 6 seconds. Follow-up of #8257, part of #5431

Add concurrency to the find-large-objects scrubber subcommand

a6975f0

arpad-m requested review from problame and skyzh July 5, 2024 17:38

arpad-m mentioned this pull request Jul 5, 2024

Epic: pageserver image layer compression #5431

Open

20 tasks

skyzh reviewed Jul 5, 2024

View reviewed changes

storage_scrubber/src/find_large_objects.rs Outdated Show resolved Hide resolved

storage_scrubber/src/find_large_objects.rs Show resolved Hide resolved

skyzh approved these changes Jul 5, 2024

View reviewed changes

storage_scrubber/src/find_large_objects.rs Show resolved Hide resolved

arpad-m added 2 commits July 5, 2024 21:12

Address Chi's review comment

8eca2b2

Add back tenant_ctr

aa48651

The atomic wasn't required after all

arpad-m added 2 commits July 5, 2024 21:30

Remove expect

beeb8b7

Remove atomics completely

9c5e2f3

they are not needed

arpad-m enabled auto-merge (squash) July 5, 2024 19:42

arpad-m merged commit 0a937b7 into main Jul 5, 2024
65 checks passed

arpad-m deleted the arpad/scrubber_ls_larger branch July 5, 2024 20:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add concurrency to the find-large-objects scrubber subcommand #8291

Add concurrency to the find-large-objects scrubber subcommand #8291

arpad-m commented Jul 5, 2024 •

edited

Loading

skyzh left a comment

github-actions bot commented Jul 5, 2024 •

edited

Loading

skyzh commented Jul 5, 2024

Add concurrency to the find-large-objects scrubber subcommand #8291

Add concurrency to the find-large-objects scrubber subcommand #8291

Conversation

arpad-m commented Jul 5, 2024 • edited Loading

skyzh left a comment

Choose a reason for hiding this comment

github-actions bot commented Jul 5, 2024 • edited Loading

3042 tests run: 2927 passed, 0 failed, 115 skipped (full report)

Code coverage* (full report)

skyzh commented Jul 5, 2024

arpad-m commented Jul 5, 2024 •

edited

Loading

github-actions bot commented Jul 5, 2024 •

edited

Loading