Try to fix macOS hang when building Python 2 #1343

BillyONeal · 2024-02-10T03:11:57Z

Don't open multiple handles to the same file when looking for absolute paths.
Don't use std::future.

Rather than opening separate file handles, and reading the first part with one and the rest with the other, introduce best_effort_read_contents_if_shebang which implements this test directly when doing the first read buffer. Also fixes read_contents to avoid needing to copy the entire file down by 3 bytes if the file starts with a BOM using the same idea.

BillyONeal · 2024-02-13T02:01:42Z

I kicked off 3 copies of the osx-x64 build to see if this helps:

BillyONeal · 2024-02-13T06:39:28Z

I kicked off 3 copies of the osx-x64 build to see if this helps:

https://dev.azure.com/vcpkg/public/_build/results?buildId=99494&view=results

https://dev.azure.com/vcpkg/public/_build/results?buildId=99495&view=results

https://dev.azure.com/vcpkg/public/_build/results?buildId=99495&view=results

Initial results are promising since all 3 of these got past Python2 without issues but without a full understanding of the cause of the problem it is difficult to be sure.

ras0219-msft · 2024-02-13T19:02:31Z

include/vcpkg/base/parallel-algorithms.h


-        std::vector<std::future<void>> workers;
-        workers.reserve(num_threads - 1);
+        WorkCallbackContext(F init_f, size_t work_count) : work(init_f), work_count(work_count), next_offset(0) { }


Suggested change

WorkCallbackContext(F init_f, size_t work_count) : work(init_f), work_count(work_count), next_offset(0) { }

WorkCallbackContext(F init_f, size_t work_count) : work(std::move(init_f)), work_count(work_count), next_offset(0) { }

This is a functor/callable, which are usually pointers, and for which std::move is a pessimization. (This is also why they are passed by value)

I understand why taking it by rvalue reference would be a pessimization, but how would std::move reduce performance? For simple pointers, isn't it the same as a copy?

std::move forces that the value be bound to an rvalue-reference. This is why it, for example, breaks NRVO in return std::move(blah); cases. It is likely that it wouldn't matter in this case; I'm just used to 'pass the functors by value' resulting from the mess that caused us to add _Pass_fn: passing functors by reference broke the ability to inline through function pointers. Maybe my paranoia here is out of date by now but given that we control all the functors here and aren't going to ever pass ones that are expensive to copy I'm inclined to keep following the convention.

https://github.com/microsoft/STL/blob/bd3d740ae5de7255c720b8133c5d23aa131e0760/stl/inc/xutility#L558-L584

ras0219-msft · 2024-02-13T19:03:01Z

include/vcpkg/base/parallel-algorithms.h

+            auto offset = next_offset.load(std::memory_order_relaxed);
+            while (offset < work_count)
+            {
+                if (!next_offset.compare_exchange_weak(offset, offset + 1, std::memory_order_relaxed))


Could this use fetch_add instead, since we can assume that work_count + threads < SIZE_MAX?

I'm not sure we can make that assumption. (fetch_add was @Thomas1664 's initial solution and I requested this form to avoid overflow)

I suppose we can assume this is unlikely and add other edge cases for it though I'm not sure that would be better... stand by

Could this use fetch_add instead, since we can assume that work_count + threads < SIZE_MAX?

work_count could reach SIZE_MAX. I remember that some PRs that used smaller types in context of the install plan were rejected.

include/vcpkg/base/parallel-algorithms.h

src/vcpkg/base/files.cpp

…_shebang

ras0219-msft · 2024-02-14T22:06:42Z

include/vcpkg/base/parallel-algorithms.h

+        if (ptp_work)
+        {
+            auto max_threads = (std::min)(work_count, static_cast<size_t>(get_concurrency()));
+            max_threads = (std::min)(max_threads, SIZE_MAX - work_count); // to avoid overflow in fetch_add


Suggested change

max_threads = (std::min)(max_threads, SIZE_MAX - work_count); // to avoid overflow in fetch_add

if (SIZE_MAX - work_count > max_threads) Checks::unreachable(VCPKG_LINE_INFO);

I don't think trying to be fancy here to support more than 18,446,744,073,709,550,000-ish (or 4,294,966,000-ish) items is worth it.

Also, I think the current line is pessimistic by 1 item/thread -- if work_count == SIZE_MAX, 1 thread is permissible.

I don't think trying to be fancy here to support more than 18,446,744,073,709,550,000-ish (or 4,294,966,000-ish) items is worth it.

I think it is worth it, at least as long as 32 bit is not dead. (And arm-linux is still very much a thing, even if we won't get much parallelism there)

Also, I think the current line is pessimistic by 1 item/thread -- if work_count == SIZE_MAX, 1 thread is permissible.

Sure. (Note that work_count == SIZE_MAX is impossible because the end() element can't exist in the address space. :) )

(Moreover, a dependency from this stuff on Checks:: makes this more complex, not less)

BillyONeal added 2 commits February 9, 2024 17:32

Don't use std::future.

d40dc6f

BillyONeal force-pushed the try-to-fix-macos-hang branch from ea0ca67 to d40dc6f Compare February 13, 2024 01:14

BillyONeal marked this pull request as ready for review February 13, 2024 01:15

ras0219-msft approved these changes Feb 13, 2024

View reviewed changes

BillyONeal added 3 commits February 13, 2024 17:39

Use fetch_add.

4bcfd4a

Extract common suffix of read_to_end and best_effort_read_contents_if…

40d38fc

…_shebang

So you want it to compile?

379b3e3

ras0219-msft approved these changes Feb 14, 2024

View reviewed changes

BillyONeal added 2 commits February 14, 2024 16:38

Allow 1 thread even if work_count is SIZE_MAX.

6620a63

Don't optional by making JThread movable.

a5ea305

BillyONeal merged commit 05320e4 into main Feb 15, 2024
5 checks passed

BillyONeal deleted the try-to-fix-macos-hang branch February 15, 2024 18:56

stemann mentioned this pull request May 22, 2024

VCPKG binary caching fails to upload to Azure DevOps feed after interactive authentication microsoft/vcpkg#14840

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try to fix macOS hang when building Python 2 #1343

Try to fix macOS hang when building Python 2 #1343

BillyONeal commented Feb 10, 2024

BillyONeal commented Feb 13, 2024

BillyONeal commented Feb 13, 2024

ras0219-msft Feb 13, 2024

BillyONeal Feb 14, 2024

ras0219-msft Feb 14, 2024 •

edited

Loading

BillyONeal Feb 15, 2024

ras0219-msft Feb 13, 2024

BillyONeal Feb 14, 2024

Thomas1664 Feb 14, 2024

ras0219-msft Feb 14, 2024

BillyONeal Feb 15, 2024

BillyONeal Feb 15, 2024

	WorkCallbackContext(F init_f, size_t work_count) : work(init_f), work_count(work_count), next_offset(0) { }
	WorkCallbackContext(F init_f, size_t work_count) : work(std::move(init_f)), work_count(work_count), next_offset(0) { }

	max_threads = (std::min)(max_threads, SIZE_MAX - work_count); // to avoid overflow in fetch_add
	if (SIZE_MAX - work_count > max_threads) Checks::unreachable(VCPKG_LINE_INFO);

Try to fix macOS hang when building Python 2 #1343

Try to fix macOS hang when building Python 2 #1343

Conversation

BillyONeal commented Feb 10, 2024

BillyONeal commented Feb 13, 2024

BillyONeal commented Feb 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ras0219-msft Feb 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ras0219-msft Feb 14, 2024 •

edited

Loading