Simplify rustc_serialize by dropping support for decoding into JSON #93839

Mark-Simulacrum · 2022-02-09T23:22:03Z

This PR currently bundles two (somewhat separate) tasks.

First, it removes the JSON Decoder trait impl, which permitted going from JSON to Rust structs. For now, we keep supporting JSON deserialization, but only to Json (an equivalent of serde_json::Value). The primary hard to remove user there is for custom targets -- which need some form of JSON deserialization -- but they already have a custom ad-hoc pass for moving from Json to a Rust struct.

A comment there suggests that it would be impractical to move them to a Decodable-based impl, at least without backwards compatibility concerns. I suspect that if we were widely breaking compat there, it would make sense to use serde_json at this point which would produce better error messages; the types in rustc_target are relatively isolated so we would not particularly suffer from using serde_derive.

The second part of the PR (all but the first commit) is to simplify the Decoder API by removing the non-primitive read_* functions. These primarily add indirection (through a closure), which doesn't directly cause a performance issue (the unique closure types essentially guarantee monomorphization), but does increase the amount of work rustc and LLVM need to do. This could be split out to a separate PR, but is included here in part to help motivate the first part.

Future work might consist of:

Specializing enum discriminant encoding to avoid leb128 for small enums (since we know the variant count, we can directly use read/write u8 in almost all cases)
Adding new methods to support faster deserialization (e.g., access to the underlying byte stream)
- Currently these are somewhat ad-hoc supported by specializations for e.g. Vec<u8>, but other types which could benefit don't today.
Removing the Decoder trait entirely in favor of a concrete type -- today, we only really have one impl of it modulo wrappers used for specialization-based dispatch.

Highly recommend review with whitespace changes off, as the removal of closures frequently causes things to be de-indented.

rust-highfive · 2022-02-09T23:22:06Z

r? @jackh726

(rust-highfive has picked a reviewer for you, use r? to override)

Mark-Simulacrum · 2022-02-09T23:23:51Z

@bors try @rust-timer queue -- I'm interested to see if the simplification has any effect on compiler bootstrap times or performance.

Follows on groundwork laid by #93681, #93680 to remove JSON usage from a few places. It may merit an MCP, which likely would replicate the PR description fairly closely, but will leave that decision up to the reviewer.

rust-timer · 2022-02-09T23:23:53Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2022-02-09T23:24:06Z

⌛ Trying commit 5c90ffea61c1b2fb8b732e56afa71dcc1cdeab87 with merge 413eebf76cb8e173694bdcc35c93c55542ec76a0...

bors · 2022-02-10T00:56:12Z

☀️ Try build successful - checks-actions
Build commit: 413eebf76cb8e173694bdcc35c93c55542ec76a0 (413eebf76cb8e173694bdcc35c93c55542ec76a0)

rust-timer · 2022-02-10T00:56:14Z

Queued 413eebf76cb8e173694bdcc35c93c55542ec76a0 with parent e7aca89, future comparison URL.

rust-timer · 2022-02-10T02:27:37Z

Finished benchmarking commit (413eebf76cb8e173694bdcc35c93c55542ec76a0): comparison url.

Summary: This benchmark run shows 6 relevant improvements 🎉 but 9 relevant regressions 😿 to instruction counts.

Average relevant regression: 1.1%
Average relevant improvement: -2.2%
Largest improvement in instruction counts: -3.3% on incr-unchanged builds of externs debug
Largest regression in instruction counts: 1.9% on incr-unchanged builds of clap-rs check

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR led to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf +perf-regression

Mark-Simulacrum · 2022-02-10T19:09:09Z

I'm pretty sure the small amount of regressions (and improvements, for that matter) in compiler performance is just optimizer noise due to a bunch of different inlining decisions as things get shuffled earlier or later due to simpler Decode impls.

On the other hand, the compiler bootstrap performance improvement is likely not noise, and is a nice ~6 seconds overall. IMO the cleanup here is well-motivated regardless of the comparatively minor regressions in some benchmarks, and primarily incr-unchanged benchmarks at that.

est31 · 2022-02-11T00:56:05Z

Related: #85993

Mark-Simulacrum · 2022-02-11T02:20:04Z

Yes, this is intentionally intended as a version that does not cause any practical impact (i.e., no features removed) with the goal of enabling it to land without too much discussion that #85993 requires due to dropping support for -Zast-json and the like.

compiler/rustc_query_system/src/dep_graph/serialized.rs

jackh726 · 2022-02-12T03:55:42Z

r? @cjgillot

nnethercote · 2022-02-17T22:20:01Z

I have a weak preference for preserving read_seq and read_map, because they're providing some useful structure. But either way, this is a lovely change, r=me once you've decided what to do.

Mark-Simulacrum · 2022-02-17T23:17:21Z

@bors r=nnethercote

read_seq and read_map are both reading just a length from the buffer prior to being called with the current impl, which means that they're tail-calling the passed closure, and I at least find it a little clearer to read the read_usize in plain text (inlined, sort of) than not.

I also think that as we go further along this path (e.g., moving to a Decoder struct) that calculus might shift a little, particularly if some of the users of read_seq start wanting to do something like read_seq_leb128 or so.

bors · 2022-02-17T23:17:22Z

📌 Commit 5b6932da8c30c5c010e6291b5fc758954a2f3700 has been approved by nnethercote

ehuss · 2022-02-21T04:10:32Z

@bors ping

bors · 2022-02-21T04:10:33Z

😪 I'm awake I'm awake

ehuss · 2022-02-21T04:10:42Z

@bors retry

auto (dist-aarch64-apple, ./x.py dist --stage 2, --build=x86_64-apple-darwin --host=aarch64-apple...
Received request to deprovision: The request was cancelled by the remote provider.

bors · 2022-02-21T07:22:39Z

⌛ Testing commit c6ad61a with merge 6bdcc5730ff1028182339ecf33e8cf98464128a2...

bors · 2022-02-21T08:11:13Z

💔 Test failed - checks-actions

rust-log-analyzer · 2022-02-21T08:12:24Z

A job failed! Check out the build log: (web) (plain)

Click to see the possible cause of the failure (guessed by this bot)

Mark-Simulacrum · 2022-02-21T12:34:36Z

@bors retry dist-apple-various didn't start

bors · 2022-02-22T07:54:30Z

⌛ Testing commit c6ad61a with merge 58a721a...

bors · 2022-02-22T10:47:47Z

☀️ Test successful - checks-actions
Approved by: nnethercote
Pushing 58a721a to master...

rust-timer · 2022-02-22T12:18:02Z

Finished benchmarking commit (58a721a): comparison url.

Summary: This benchmark run shows 3 relevant improvements 🎉 but 6 relevant regressions 😿 to instruction counts.

Average relevant regression: 0.8%
Average relevant improvement: -0.5%
Largest improvement in instruction counts: -0.5% on incr-unchanged builds of ctfe-stress-4 check
Largest regression in instruction counts: 1.0% on full builds of externs opt

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression

Cleanup a few Decoder methods This is just some simple follow up to rust-lang#93839. r? `@nnethercote`

rust-highfive assigned jackh726 Feb 9, 2022

rustbot added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Feb 9, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Feb 9, 2022

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Feb 9, 2022

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Feb 10, 2022

Mark-Simulacrum force-pushed the delete-json-rust-deserialization branch from 5c90ffe to eac0eae Compare February 10, 2022 15:36

This comment has been minimized.

Sign in to view

Mark-Simulacrum force-pushed the delete-json-rust-deserialization branch from eac0eae to 6674f0f Compare February 10, 2022 16:34

Mark-Simulacrum added the perf-regression-triaged The performance regression has been triaged. label Feb 10, 2022

cjgillot reviewed Feb 11, 2022

View reviewed changes

compiler/rustc_query_system/src/dep_graph/serialized.rs Show resolved Hide resolved

Mark-Simulacrum force-pushed the delete-json-rust-deserialization branch from 6674f0f to 5b6932d Compare February 11, 2022 21:37

rust-highfive assigned cjgillot and unassigned jackh726 Feb 12, 2022

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 21, 2022

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Feb 21, 2022

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 21, 2022

bors added the merged-by-bors This PR was explicitly merged by bors. label Feb 22, 2022

bors merged commit 58a721a into rust-lang:master Feb 22, 2022

rustbot added this to the 1.61.0 milestone Feb 22, 2022

Mark-Simulacrum deleted the delete-json-rust-deserialization branch February 22, 2022 12:50

Mark-Simulacrum mentioned this pull request Feb 23, 2022

Cleanup a few Decoder methods #94288

Merged

matthiaskrgr added a commit to matthiaskrgr/rust that referenced this pull request Feb 24, 2022

Rollup merge of rust-lang#94288 - Mark-Simulacrum:ser-opt, r=nnethercote

ae27c4a

Cleanup a few Decoder methods This is just some simple follow up to rust-lang#93839. r? `@nnethercote`

nnethercote mentioned this pull request Apr 22, 2022

Remove all json handling from rustc_serialize #85993

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify rustc_serialize by dropping support for decoding into JSON #93839

Simplify rustc_serialize by dropping support for decoding into JSON #93839

Mark-Simulacrum commented Feb 9, 2022 •

edited

Loading

rust-highfive commented Feb 9, 2022

Mark-Simulacrum commented Feb 9, 2022

rust-timer commented Feb 9, 2022

bors commented Feb 9, 2022

This comment has been minimized.

bors commented Feb 10, 2022

rust-timer commented Feb 10, 2022

rust-timer commented Feb 10, 2022

This comment has been minimized.

Mark-Simulacrum commented Feb 10, 2022

est31 commented Feb 11, 2022

Mark-Simulacrum commented Feb 11, 2022

jackh726 commented Feb 12, 2022

nnethercote commented Feb 17, 2022

Mark-Simulacrum commented Feb 17, 2022

bors commented Feb 17, 2022

ehuss commented Feb 21, 2022

bors commented Feb 21, 2022

ehuss commented Feb 21, 2022

bors commented Feb 21, 2022

bors commented Feb 21, 2022

rust-log-analyzer commented Feb 21, 2022

Mark-Simulacrum commented Feb 21, 2022

bors commented Feb 22, 2022

bors commented Feb 22, 2022

rust-timer commented Feb 22, 2022

Simplify rustc_serialize by dropping support for decoding into JSON #93839

Simplify rustc_serialize by dropping support for decoding into JSON #93839

Conversation

Mark-Simulacrum commented Feb 9, 2022 • edited Loading

rust-highfive commented Feb 9, 2022

Mark-Simulacrum commented Feb 9, 2022

rust-timer commented Feb 9, 2022

bors commented Feb 9, 2022

This comment has been minimized.

bors commented Feb 10, 2022

rust-timer commented Feb 10, 2022

rust-timer commented Feb 10, 2022

This comment has been minimized.

Mark-Simulacrum commented Feb 10, 2022

est31 commented Feb 11, 2022

Mark-Simulacrum commented Feb 11, 2022

jackh726 commented Feb 12, 2022

nnethercote commented Feb 17, 2022

Mark-Simulacrum commented Feb 17, 2022

bors commented Feb 17, 2022

ehuss commented Feb 21, 2022

bors commented Feb 21, 2022

ehuss commented Feb 21, 2022

bors commented Feb 21, 2022

bors commented Feb 21, 2022

rust-log-analyzer commented Feb 21, 2022

Mark-Simulacrum commented Feb 21, 2022

bors commented Feb 22, 2022

bors commented Feb 22, 2022

rust-timer commented Feb 22, 2022

Mark-Simulacrum commented Feb 9, 2022 •

edited

Loading