[Merged by Bors] - extract topsort logic to a new method, one pass to detect cycles and … #7727

shuoli84 · 2023-02-17T17:33:39Z

…top sort. reduce mem alloc

Objective

Reduce alloc count.
Improve code quality.

Solution

use TarjanScc::run directly, which calls a closure with each scc, in closure, we can detect cycles and flatten nodes

…top sort. reduce mem alloc

alice-i-cecile · 2023-02-17T17:38:51Z

Please remove the Changelog and Migration Guide completely for this PR :) It's less distracting, and avoids any automated tools picking them up.

maniwani · 2023-02-17T18:02:10Z

Factoring this logic into a topsort_graph makes the code easier to read, but none of this really reduces memory allocation.

tarjan_scc produces a Vec<Vec<G::NodeId>>. The iterator version, TarjanScc::run, still produces a Vec<G::NodeId> for each strongly-connected component, and then you push them into cycles. So you end up with the same result. The outer vector may end up smaller, or it might not since capacity usually grows exponentially (e.g. capacity doubles each realloc).

Also, I don't understand why you renamed strongly_connected_components into cycles and nodes_with_cycles. That's less clear. SCCs contain cycles. They are not cycles themselves. I would change it back (or sccs_with_cycles).

james7132 · 2023-02-17T18:33:54Z

tarjan_scc produces a Vec<Vec<G::NodeId>>. The iterator version, TarjanScc::run, still produces a Vec<G::NodeId> for each strongly-connected component, and then you push them into cycles. So you end up with the same result. The outer vector may end up smaller, or it might not since capacity usually grows exponentially (e.g. capacity doubles each realloc).

Is there a way for us to early return (without catching a panic) as we're processing each SCC? It'd reduce the total memory usage to the maximum number of nodes in each SCC.

maniwani · 2023-02-17T21:28:34Z

Is there a way for us to early return (without catching a panic) as we're processing each SCC? It'd reduce the total memory usage to the maximum number of nodes in each SCC.

Can you clarify what you're asking? The number of G::NodeId you get from tarjan_scc is always the number of nodes in the graph. They're just grouped into strongly-connected components.

If the question is: "Can we return as soon as we find an SCC with 2+ nodes?" I would ask, which do you prefer? Do you want all cycles reported, as in #7463? Then no. If you (and most users) are OK with maybe having to rebuild over and over to find and fix every cycle, then yes.

It's technically possible to avoid allocating a Vec<G::NodeId> when the SCC is a single node, using an enum:

enum StronglyConnectedComponent {
    Single(NodeId),
    Multiple(Vec<NodeId>),
}

But that would have to wait for bevy_graph.

The underlying algorithm doesn't actually need to allocate these while it's running. Putting the nodes of an SCC together in a Vec is just a "post-processing step". You can allocate each time you find one or wait until they're all found.

…ycles => scc_with_cycles

shuoli84 · 2023-02-18T04:08:44Z

Factoring this logic into a topsort_graph makes the code easier to read, but none of this really reduces memory allocation.

Maybe I missed something, so just made a test suit to verify whether mem alloc reduced. https://github.com/shuoli84/test_petgraph_scc_alloc. From the run, the mem alloc for schedule building reduced by around 30%, with system count 900, main alloc 8395 times, vs the optimized version 5690.

tarjan_scc produces a Vec<Vec<G::NodeId>>. The iterator version, TarjanScc::run, still produces a Vec<G::NodeId> for each strongly-connected component, and then you push them into cycles. So you end up with the same result. The outer vector may end up smaller, or it might not since capacity usually grows exponentially (e.g. capacity doubles each realloc).

Also, I don't understand why you renamed strongly_connected_components into cycles and nodes_with_cycles. That's less clear. SCCs contain cycles. They are not cycles themselves. I would change it back (or sccs_with_cycles).

renamed it to sccs_with_cycles.

EDIT: format

maniwani · 2023-02-19T06:37:02Z

shuoli84/test_petgraph_scc_alloc. From the run, the mem alloc for schedule building reduced by around 30%, with system count 900, main alloc 8395 times, vs the optimized version 5690.

Oh, I see now. You prepare the topsorted vector in one allocation. I spoke too soon. Sorry about that! Good catch!

// This is allocated once and never needs to reallocate.
let mut rev_top_sorted_nodes = Vec::<NodeId>::with_capacity(graph.node_count());
/* ... */

tarjan_scc.run(graph, |scc| {
    /* ... */
    // If this graph is acyclic, then we can just reverse this when we're done.
    rev_top_sorted_nodes.extend_from_slice(scc);
});

Yeah, the flattening and collect in sccs.into_iter().flatten().rev().collect::<Vec<_>>() that's in main is definitely way more expensive since we're squashing N vectors into 1.

Will approve once you resolve the CI errors.

alice-i-cecile · 2023-02-19T16:20:21Z

bors r+

#7727) …top sort. reduce mem alloc # Objective - Reduce alloc count. - Improve code quality. ## Solution - use `TarjanScc::run` directly, which calls a closure with each scc, in closure, we can detect cycles and flatten nodes

bors · 2023-02-19T16:50:44Z

Pull request successfully merged into main.

Build succeeded:

build-and-install-on-iOS
build-android
build (macos-latest)
build (ubuntu-latest)
build-wasm
build (windows-latest)
build-without-default-features (bevy)
build-without-default-features (bevy_ecs)
build-without-default-features (bevy_reflect)
check-compiles
check-doc
check-missing-examples-in-docs
ci
markdownlint
msrv
run-examples
run-examples-on-wasm
run-examples-on-windows-dx12

bevyengine#7727) …top sort. reduce mem alloc # Objective - Reduce alloc count. - Improve code quality. ## Solution - use `TarjanScc::run` directly, which calls a closure with each scc, in closure, we can detect cycles and flatten nodes

extract topsort logic to a new method, one pass to detect cycles and …

9befeab

…top sort. reduce mem alloc

alice-i-cecile added A-ECS Entities, components, systems, and events C-Performance A change motivated by improving speed, memory usage or compile times labels Feb 17, 2023

alice-i-cecile requested a review from maniwani February 17, 2023 17:39

alice-i-cecile added the C-Code-Quality A section of code that is hard to understand or change label Feb 17, 2023

vec::new => vec::with_capacity to reduce realloc. rename nodes_with_c…

3d5729e

…ycles => scc_with_cycles

trigger ci run

760a686

maniwani added this to the 0.10 milestone Feb 19, 2023

fix clippy

553e8f6

alice-i-cecile approved these changes Feb 19, 2023

View reviewed changes

maniwani approved these changes Feb 19, 2023

View reviewed changes

alice-i-cecile added the S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it label Feb 19, 2023

bors bot changed the title ~~extract topsort logic to a new method, one pass to detect cycles and …~~ [Merged by Bors] - extract topsort logic to a new method, one pass to detect cycles and … Feb 19, 2023

bors bot closed this Feb 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - extract topsort logic to a new method, one pass to detect cycles and … #7727

[Merged by Bors] - extract topsort logic to a new method, one pass to detect cycles and … #7727

shuoli84 commented Feb 17, 2023 •

edited

Loading

alice-i-cecile commented Feb 17, 2023

maniwani commented Feb 17, 2023 •

edited

Loading

james7132 commented Feb 17, 2023

maniwani commented Feb 17, 2023 •

edited

Loading

shuoli84 commented Feb 18, 2023 •

edited

Loading

maniwani commented Feb 19, 2023 •

edited

Loading

alice-i-cecile commented Feb 19, 2023

bors bot commented Feb 19, 2023

[Merged by Bors] - extract topsort logic to a new method, one pass to detect cycles and … #7727

[Merged by Bors] - extract topsort logic to a new method, one pass to detect cycles and … #7727

Conversation

shuoli84 commented Feb 17, 2023 • edited Loading

Objective

Solution

alice-i-cecile commented Feb 17, 2023

maniwani commented Feb 17, 2023 • edited Loading

james7132 commented Feb 17, 2023

maniwani commented Feb 17, 2023 • edited Loading

shuoli84 commented Feb 18, 2023 • edited Loading

maniwani commented Feb 19, 2023 • edited Loading

alice-i-cecile commented Feb 19, 2023

bors bot commented Feb 19, 2023

shuoli84 commented Feb 17, 2023 •

edited

Loading

maniwani commented Feb 17, 2023 •

edited

Loading

maniwani commented Feb 17, 2023 •

edited

Loading

shuoli84 commented Feb 18, 2023 •

edited

Loading

maniwani commented Feb 19, 2023 •

edited

Loading