LDS access proposal #29

Jasper-Bekkers · 2020-09-11T16:09:23Z

No description provided.

Jasper-Bekkers · 2020-09-11T16:09:56Z

Rendered: https://github.com/EmbarkStudios/rust-gpu/blob/lds-barriers/rfcs/000-safe-lds-access.md

nhaehnle · 2020-09-11T16:12:31Z

rfcs/000-safe-lds-access.md

+// sample 3:
+let lds = LdsWriter::<u32>::new();
+
+lds.write_thread_idx(0);
+lds.write_thread_idx(666); // race?
+```


This is fine. It's just consecutive writes to the same address, the later write wins due to program order.

Yeah - this was mostly to discuss what the potential rules from the Rust point of view are. However, since this shouldn't ever lead to invalid data I think we're fine. We could probably elide the first store entirely in this case.

nhaehnle · 2020-09-11T16:13:24Z

rfcs/000-safe-lds-access.md

+let wrt = rdr.barrier();
+wrt.write_thread_idx(12234);
+```


This part seems fine as long as rdr.barrier() does an OpControlBarrier. The part above with barriers in non-uniform control flow is a problem, but it's orthogonal.

nhaehnle · 2020-09-11T16:17:48Z

Non-uniform control flow is a problem, but I think if the implementation of barrier() guarantees that it will only proceed if all threads in the workgroup reach the barrier, then you're fine. This is not a guarantee provided by OpControlBarrier, so it would have to be added somehow.

My understanding is that safe Rust can still deadlock, which is what may happen in the examples where barriers are inside of control flow.

Jasper-Bekkers · 2020-09-11T16:27:38Z

@Tobski proposed this in chat as an alternative:

struct BufferWriter;

impl BufferWriter {
    fn write_thread_idx(&mut self, value: T);
}

struct BufferReader;


impl BufferReader {
    fn read(&self, idx: usize) -> T;
    fn partition_readers(self, ranges: [(usize, usize)]) -> [BufferReader];
    fn partition_writers(self, ranges: [(usize, usize)]) -> [BufferWriter];
}

fn join_writers(a: BufferWriter, b: BufferWriter) -> BufferReader;
fn join_readers(a: BufferReader, b: BufferReader) -> BufferReader;

the basic idea is that each pass of the bitonic sort owns a different subset of the same buffer in each iteration, and you kind of need a way to express that, which is what I'm trying to do above.

something like

get 2 elements
read
compare
write
join with neighbour to get 4 elements
read
compare
write
join with neighbour to get 8 elements
read
compare
write
partition back to 2 elements
read
compare
write
...etc.

Jasper-Bekkers · 2020-09-11T16:42:37Z

Non-uniform control flow is a problem, but I think if the implementation of barrier() guarantees that it will only proceed if all threads in the workgroup reach the barrier, then you're fine. This is not a guarantee provided by OpControlBarrier, so it would have to be added somehow.

Would it be sensible for the compiler to move the OpControlBarrier out of the controlflow? I'm expecting problems with deeply nested control flow but it might be worth exploring.

My understanding is that safe Rust can still deadlock, which is what may happen in the examples where barriers are inside of control flow.

Rust can still deadlock - it's not one of it's safety guarantees. A quick search gives this:

fn main() {
    let (tx1, rx1) = channel();
    let (tx2, rx2) = channel();

    spawn(proc() {
        println!("Waiting for result 1");
        rx1.recv();
    
        println!("Sending result 2");
        tx2.send(());
    });

    println!("Waiting for result 2");
    rx2.recv();

    println!("Sending result 1");
    tx1.send(());
}

Jasper-Bekkers · 2020-09-11T16:55:08Z

#8 earlier discussion

nhaehnle · 2020-09-12T15:13:15Z

Non-uniform control flow is a problem, but I think if the implementation of barrier() guarantees that it will only proceed if all threads in the workgroup reach the barrier, then you're fine. This is not a guarantee provided by OpControlBarrier, so it would have to be added somehow.

Would it be sensible for the compiler to move the OpControlBarrier out of the controlflow? I'm expecting problems with deeply nested control flow but it might be worth exploring.

I think you need to define what code means at the Rust level. It's possible that perhaps there's a reasonable definition which, as a consequence, requires moving OpControlBarriers out of control flow. But defining the language via "it behaves in way XYZ because that happens to be the result of the compiler transform" generally doesn't lead to a happy place.

That said, the OpControlBarrier is inside of a function. Here, that function can be inlined, and in the medium term everything is going to be inlined anyway because that's the status quo. But relying on the visibility of the OpControlBarrier seems likely to cause problems in the longer term.

XAMPPRocky · 2020-10-20T12:35:15Z

As an non graphics programmer, I think it would nice to include more context in the RFC, as right now there's not a lot. The RFC doesn't explain what "LDS" is for example, and googling "LDS graphics" brings up graphics for the Church of Latter Day Saints, nothing about "Local Data Share".

khyperia · 2020-10-20T12:42:57Z

hey, the utah teapot is graphics-y and is clearly from utah, so church of LDS is of course related to graphics, right? :D (sorry)

Jasper-Bekkers · 2020-10-20T18:05:08Z

"LDS shader" gives heaps of resources that will explain every detail to you if you want, I don't think we should need to explain these things in these kinds of discussions, and I suggest we don't derail this one further.

However, if you came here looking for an introduction to GPU programming, be sure to check out https://gpuopen.com/learn/optimizing-gpu-occupancy-resource-usage-large-thread-groups/ and https://anteru.net/blog/2018/even-more-compute-shaders/

hrydgard · 2020-10-20T18:35:09Z

What's maybe a little bit confusing even for non-console graphics programmers is that LDS is an AMD GCN-specific abbreviation, it might be better to use a more generic term like groupshared memory.

khyperia · 2020-10-20T18:54:59Z

I don't think we should need to explain these things in these kinds of discussions

I disagree - we need to keep in mind both the audience to this repository as well as contributors. Not everyone has the same expertise areas, and the real magic happens when cross-expertise discussion happens - and that means giving background and context. It doesn't have to be long, or take much time to write - for example, the second paragraph in this comment would have been immensely helpful for both me and others, and would have saved everyone time. In any case, on a pure value basis rather than culture, explaining context and definitions opens up new collaborators who may not have been able to contribute before, and that's always good.

khyperia · 2021-04-01T13:48:07Z

Closing this due to inactivity, if we'd like to start pushing on this again, we can reopen.

nhaehnle reviewed Sep 11, 2020

View reviewed changes

Jasper-Bekkers mentioned this pull request Sep 11, 2020

SPIR-V wishlist #17

Open

Jasper-Bekkers added 2 commits September 11, 2020 18:10

LDS access proposal

68b81db

Tobias's proposal and ThreadIdx

dbfa0ac

Tobski mentioned this pull request Nov 9, 2020

Add barrier proposal #216

Closed

khyperia closed this Apr 1, 2021

XAMPPRocky deleted the lds-barriers branch April 30, 2021 07:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LDS access proposal #29

LDS access proposal #29

Jasper-Bekkers commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020 •

edited

Loading

nhaehnle Sep 11, 2020

Jasper-Bekkers Sep 11, 2020

nhaehnle Sep 11, 2020

nhaehnle commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020

nhaehnle commented Sep 12, 2020

XAMPPRocky commented Oct 20, 2020

khyperia commented Oct 20, 2020

Jasper-Bekkers commented Oct 20, 2020

hrydgard commented Oct 20, 2020 •

edited

Loading

khyperia commented Oct 20, 2020

khyperia commented Apr 1, 2021

LDS access proposal #29

LDS access proposal #29

Conversation

Jasper-Bekkers commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020 • edited Loading

nhaehnle Sep 11, 2020

Choose a reason for hiding this comment

Jasper-Bekkers Sep 11, 2020

Choose a reason for hiding this comment

nhaehnle Sep 11, 2020

Choose a reason for hiding this comment

nhaehnle commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020

Jasper-Bekkers commented Sep 11, 2020

nhaehnle commented Sep 12, 2020

XAMPPRocky commented Oct 20, 2020

khyperia commented Oct 20, 2020

Jasper-Bekkers commented Oct 20, 2020

hrydgard commented Oct 20, 2020 • edited Loading

khyperia commented Oct 20, 2020

khyperia commented Apr 1, 2021

Jasper-Bekkers commented Sep 11, 2020 •

edited

Loading

hrydgard commented Oct 20, 2020 •

edited

Loading