Locals aligned to greater than page size can cause unsound behavior #70143

retep998 · 2020-03-19T09:50:43Z

Forked from #70022

Minimal example

#[repr(align(0x10000))]
struct Aligned(u8);

fn main() {
    let x = Aligned(0);
    println!("{:#x}", &x as *const _ as usize);
}

Aligning the stack is done after the stack probe. Because stacks grow downwards and aligning the stack shifts it downwards, it can cause the end of the stack to extend past the guard page and cause invalid access exceptions or worse when those sections of the stack are touched.

~~Only confirmed that this occurs on pc-windows-msvc~~ (pnkfelix edit: see comment thread, its a more general problem.)

The text was updated successfully, but these errors were encountered:

Amanieu · 2020-03-19T09:52:36Z

This is most likely an LLVM bug.

RalfJung · 2020-04-22T13:12:30Z

Could someone who knows the technical details here report this bug upstream with LLVM?

LeSeulArtichaut · 2020-06-17T21:01:13Z

@rustbot ping llvm

rustbot · 2020-06-17T21:01:15Z

Hey LLVM ICE-breakers! This bug has been identified as a good
"LLVM ICE-breaking candidate". In case it's useful, here are some
instructions for tackling these sorts of bugs. Maybe take a look?
Thanks! <3

cc @camelid @comex @cuviper @DutchGhost @dyncat @hanna-kruppe @hdhoang @heyrutvik @JOE1994 @jryans @mmilenko @nagisa @nikic @Noah-Kennedy @SiavoshZarrasvand @spastorino @vertexclique

ghost · 2020-07-12T11:30:12Z

I think this is platform-specific but it still all happens to varying degrees on x86(-64). This safe code causes a stack overflow on Ubuntu:

#[repr(align(0x800000))]
struct Aligned(usize);

fn main() {
    let x = Aligned(2);
    println!("{}", x.0);
}

This safe code causes a segfault on Ubuntu:

#[repr(align(0x10000000))]
struct Aligned(usize);

fn main() {
    let x = Aligned(2);
    println!("{}", x.0);
}

This isn't specific to Windows, then, and can be used to cause stack overflows and/or segfaults in safe code.

I don't know how this would be done, but maybe doing such alignments should only be allowed if unsafe is used to keep it out of safe code? Embedded might depend on high alignments, so I don't think removing the ability to align with high numbers entirely would be a good solution.

RalfJung · 2020-07-12T11:40:44Z

Note that we do install a stack guard and associated handler to check for stack overflows. So just getting a stack overflow or segfault does not necessarily demonstrate a problem. Rust cannot statically make sure that your stack is always big enough, so being able to trigger a stack overflow through big locals or deep recursion is entirely expected behavior.

The bug here is about the locals not having the alignment that was requested via repr. As far as I can see, your tests do not demonstrate that this happens here.

retep998 · 2020-07-12T20:16:04Z

The bug here is about the locals not having the alignment that was requested via repr.

The locals do absolutely have the required alignment. The issue is with the stack aligning prologue which can cause the function's frame to exist entirely past the guard page so unrelated memory can be stomped on.

cuviper · 2020-07-13T17:21:04Z

I just tried this on godbolt: https://rust.godbolt.org/z/8xfn4v

#[repr(align(0x10000000))]
struct Aligned(usize);

pub fn foo() -> usize {
    Aligned(2).0
}

The output for x86_64-pc-windows-msvc:

example::foo:
        push    rbp
        mov     eax, 536870896    # 0x1ffffff0
        call    __chkstk
        sub     rsp, rax
        lea     rbp, [rsp + 128]
        and     rsp, -268435456    # 0xfffffffff0000000
        mov     qword ptr [rsp], 2
        mov     rax, qword ptr [rsp]
        lea     rsp, [rbp + 536870768]    # 0x1fffff70
        pop     rbp
        ret

In the worst case, the sub could move rsp to the very top of an aligned block, or anywhere more than a page size away from the aligned base. Then the large and will round this down, which might jump over the stack guard page, and this wasn't tested by __chkstk.

The output for x86_64-unknown-linux-gnu:

example::foo:
        push    rbp
        mov     rbp, rsp
        and     rsp, -268435456    # 0xfffffffff0000000
        mov     eax, 536870912    # 0x20000000
        call    __rust_probestack
        sub     rsp, rax
        mov     qword ptr [rsp], 2
        mov     rax, qword ptr [rsp]
        mov     rsp, rbp
        pop     rbp
        ret

Here the alignment is done first, but it's still bad. We might start with rsp more than a page size unaligned, and then again the large and can jump over the guard page. Then __rust_probestack could be probing non-stack memory and think everything is fine.

Here's one more Linux test I ran locally, using LLVM 11's "probe-stack"="inline-asm":

example::foo:
        pushq   %rbp
        movq    %rsp, %rbp
        andq    $-268435456, %rsp    # 0xfffffffff0000000
        movq    %rsp, %r11
        subq    $536870912, %r11    # 0x20000000
.LBB0_1:
        subq    $4096, %rsp
        movq    $0, (%rsp)
        cmpq    %r11, %rsp
        jne     .LBB0_1
        movq    $2, (%rsp)
        movq    (%rsp), %rax
        movq    %rbp, %rsp
        popq    %rbp
        retq

Again, the initial alignment could jump the guard page, then the probe loop may be clobbering non-stack memory.
(cc @serge-sans-paille)

Amanieu · 2020-07-13T20:31:11Z

It seems that the ideal behavior here would be to add the alignment to the size passed to __chkstk/__rust_probestack and perform the stack alignment masking after the stack probing.

cuviper · 2020-07-13T20:33:40Z

Yeah, I've been talking privately with Serge, and he's going to try implementing something like that.

RalfJung · 2020-07-14T06:41:50Z

I thought this is what already happens?

Aligning the stack is done after the stack probe.

And the fix would be to align first and then run the stack probe?

retep998 · 2020-07-14T23:06:42Z

I thought this is what already happens?

Aligning the stack is done after the stack probe.

And the fix would be to align first and then run the stack probe?

The fix is to first probe the stack for the frame size plus the alignment, and then align the stack. Currently stack probes only check the frame size without adding the alignment to that size.

cuviper · 2020-07-14T23:14:02Z

Note that the size of the alignment adjustment is a dynamic value -- the offset from whatever incoming stack pointer we get to an aligned pointer below that. Then the actual frame size is allocated after we have known alignment. If the stack guard page is anywhere in that total range, we need to fault, which is what the probing does.

RalfJung · 2020-07-15T06:28:15Z

I guess I am wondering why we can't first align and then probe for the already computed new stack size, that sounds like it avoids doing the aligning twice. But it probably doesn't work for other reasons I cannot see.

retep998 · 2020-07-17T01:19:18Z

@RalfJung Because after aligning the current stack pointer can be past the guard page in a completely unrelated section of memory, that could be completely valid memory for the full size of the stack frame that the stack probe could succeed on, but it's not our stack and we'd be trampling that unrelated memory!

We don't need to do the aligning twice as we can do a simple conservative stack probe of frame size + alignment, which is always sound, but it can trigger stack overflows when there is still a bit of space left. A smarter combination of stack probing + aligning can also avoid computing the aligned address twice, but might involve more implementation effort. Regardless of what technique we use, this only matters for functions with locals that are aligned to a page size or greater.

RalfJung · 2020-07-17T09:49:04Z

Ah I see, I somehow assumed stack probing would still have access to the old top-of-stack and could thus probe all the way from there to the aligned stack with the new frame.

cuviper · 2020-07-17T14:11:31Z

The examples I gave do have the old pointer in rbp. However, once we've moved rsp, a signal could be delivered and run its handler from that stack location, before we've probed that it's safe. For that matter, __chkstk and __rust_probestack need to have a valid stack to be called at all.

…otection As reported in rust-lang/rust#70143 alignment is not taken into account when doing the probing. Fix that by adjusting the first probe if the stack align is small, or by extending the dynamic probing if the alignment is large. Differential Revision: https://reviews.llvm.org/D84419

…otection As reported in rust-lang/rust#70143 alignment is not taken into account when doing the probing. Fix that by adjusting the first probe if the stack align is small, or by extending the dynamic probing if the alignment is large. Differential Revision: https://reviews.llvm.org/D84419 (cherry picked from commit f2c6bfa)

…otection As reported in rust-lang/rust#70143 alignment is not taken into account when doing the probing. Fix that by adjusting the first probe if the stack align is small, or by extending the dynamic probing if the alignment is large. Differential Revision: https://reviews.llvm.org/D84419

…otection As reported in rust-lang/rust#70143 alignment is not taken into account when doing the probing. Fix that by adjusting the first probe if the stack align is small, or by extending the dynamic probing if the alignment is large. Differential Revision: https://reviews.llvm.org/D84419 (cherry picked from commit 387e2c2)

…otection As reported in rust-lang/rust#70143 alignment is not taken into account when doing the probing. Fix that by adjusting the first probe if the stack align is small, or by extending the dynamic probing if the alignment is large. Differential Revision: https://reviews.llvm.org/D84419

pnkfelix · 2022-12-09T15:40:47Z

Visiting for P-high review.

In the absence of a fix for this issue, I'd like to see a diagnostic warning people who have locals with alignment > page-size that they might hit this. rust-lang/rust-clippy#8593 seems relevant to that.

Beyond that, my main question on #70143 is whether we can find an owner to try to implement the fix that's described. It sounds like @cuviper has already done a fair amount of experimentation. I don't know if they have capacity to own moving further with it.

pnkfelix · 2022-12-09T15:43:23Z

@rustbot assign @cuviper

(to be clear, this is not putting cuviper on the hook for implementing a fix. I just want someone to own keeping track of the status and try to prod on things.)

cuviper · 2022-12-10T01:15:45Z

Updating my earlier godbolt to current nightly (https://rust.godbolt.org/z/hseenMTKP) still shows the same asm, still just as problematic. However, I also rechecked "probe-stack"="inline-asm" since #102503 turned that on for the upcoming LLVM 16, and it's better, but I found a bug: https://reviews.llvm.org/D139756

(NB: inline-asm won't apply for Windows and its __chkstk.)

Even /ALIGN:4096 will still lead to a crash in Debug mode. The following issues might be relevant: rust-lang/rust#70022 rust-lang/rust#70143

cuviper · 2024-03-05T01:32:13Z

I think we can now consider this a Windows-only bug, because the rest are now using "probe-stack"="inline-asm", except for arches that lack stack probes entirely. It's not really specific to MSVC either, as windows-gnu generates the same code calling __chkstk.

@rustbot label -O-linux -O-windows-msvc +O-windows

retep998 added T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. O-windows-msvc Toolchain: MSVC, Operating system: Windows I-unsound Issue: A soundness hole (worst kind of bug), see: https://en.wikipedia.org/wiki/Soundness labels Mar 19, 2020

retep998 changed the title ~~Alignment greater than page size locals can cause invalid access exceptions.~~ Locals aligned to greater than page size can cause unsound behavior Mar 19, 2020

retep998 added the A-LLVM Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues. label Mar 19, 2020

jonas-schievink added the C-bug Category: This is a bug. label Mar 19, 2020

retep998 mentioned this issue Mar 19, 2020

Statics don't support alignments larger than the page size #70022

Open

Amanieu mentioned this issue Mar 20, 2020

Limit maximum alignment in #[repr(align)] to 4K #70193

Closed

Elinvynia added the I-prioritize Issue: Indicates that prioritization has been requested for this issue. label Jun 9, 2020

spastorino added P-high High priority and removed I-prioritize Issue: Indicates that prioritization has been requested for this issue. labels Jun 17, 2020

rustbot added the ICEBreaker-LLVM Bugs identified for the LLVM ICE-breaker group label Jun 17, 2020

cuviper added the O-linux Operating system: Linux label Jul 13, 2020

RalfJung mentioned this issue Aug 15, 2020

.rodata bloat with aligned zero sized objects #75524

Open

Jaic1 mentioned this issue Mar 27, 2022

Add a lint for static items with large type alignment rust-lang/rust-clippy#8593

Closed

rustbot assigned cuviper Dec 9, 2022

gburgessiv mentioned this issue Feb 25, 2023

Add known-bug tests for all(?) I-unsound issues #105107

Open

byeongkeunahn added a commit to byeongkeunahn/basm-rs that referenced this issue Aug 2, 2023

Hotfix for unexpected crash

ddde92c

Even /ALIGN:4096 will still lead to a crash in Debug mode. The following issues might be relevant: rust-lang/rust#70022 rust-lang/rust#70143

rustbot added O-windows Operating system: Windows and removed O-linux Operating system: Linux O-windows-msvc Toolchain: MSVC, Operating system: Windows labels Mar 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Locals aligned to greater than page size can cause unsound behavior #70143

Locals aligned to greater than page size can cause unsound behavior #70143

retep998 commented Mar 19, 2020 •

edited by rustbot

Loading

Amanieu commented Mar 19, 2020

RalfJung commented Apr 22, 2020

LeSeulArtichaut commented Jun 17, 2020

rustbot commented Jun 17, 2020

ghost commented Jul 12, 2020 •

edited by ghost

Loading

RalfJung commented Jul 12, 2020 •

edited

Loading

retep998 commented Jul 12, 2020

cuviper commented Jul 13, 2020

Amanieu commented Jul 13, 2020

cuviper commented Jul 13, 2020

RalfJung commented Jul 14, 2020

retep998 commented Jul 14, 2020

cuviper commented Jul 14, 2020

RalfJung commented Jul 15, 2020

retep998 commented Jul 17, 2020 •

edited

Loading

RalfJung commented Jul 17, 2020

cuviper commented Jul 17, 2020 •

edited

Loading

pnkfelix commented Dec 9, 2022

pnkfelix commented Dec 9, 2022 •

edited

Loading

cuviper commented Dec 10, 2022 •

edited

Loading

cuviper commented Mar 5, 2024

Locals aligned to greater than page size can cause unsound behavior #70143

Locals aligned to greater than page size can cause unsound behavior #70143

Comments

retep998 commented Mar 19, 2020 • edited by rustbot Loading

Amanieu commented Mar 19, 2020

RalfJung commented Apr 22, 2020

LeSeulArtichaut commented Jun 17, 2020

rustbot commented Jun 17, 2020

ghost commented Jul 12, 2020 • edited by ghost Loading

RalfJung commented Jul 12, 2020 • edited Loading

retep998 commented Jul 12, 2020

cuviper commented Jul 13, 2020

Amanieu commented Jul 13, 2020

cuviper commented Jul 13, 2020

RalfJung commented Jul 14, 2020

retep998 commented Jul 14, 2020

cuviper commented Jul 14, 2020

RalfJung commented Jul 15, 2020

retep998 commented Jul 17, 2020 • edited Loading

RalfJung commented Jul 17, 2020

cuviper commented Jul 17, 2020 • edited Loading

pnkfelix commented Dec 9, 2022

pnkfelix commented Dec 9, 2022 • edited Loading

cuviper commented Dec 10, 2022 • edited Loading

cuviper commented Mar 5, 2024

retep998 commented Mar 19, 2020 •

edited by rustbot

Loading

ghost commented Jul 12, 2020 •

edited by ghost

Loading

RalfJung commented Jul 12, 2020 •

edited

Loading

retep998 commented Jul 17, 2020 •

edited

Loading

cuviper commented Jul 17, 2020 •

edited

Loading

pnkfelix commented Dec 9, 2022 •

edited

Loading

cuviper commented Dec 10, 2022 •

edited

Loading