gh-111962: Make dtoa thread-safe in `--disable-gil` builds. #112049

colesbury · 2023-11-13T20:39:07Z

This avoids using the Bigint free-list in --disable-gil builds and pre-computes the needed powers of 5 during interpreter initialization.

Issue: dtoa: thread safety in --disable-gil builds #111962

This avoids using the Bigint free-list in `--disable-gil` builds and pre-computes the needed powers of 5 during interpreter initialization.

We need the powers of 5 up to 5**512 because we only jump straight to underflow when the exponent is less than -512 (or larger than 308).

mdickinson

LGTM. A couple of naming and comment nitpicks, and a request for a comment justifying the assert(k < (1 << (Bigint_Pow5max))); assertion.

mdickinson · 2023-11-26T09:58:51Z

Include/internal/pycore_dtoa.h

@@ -35,16 +35,19 @@ struct _dtoa_state {
 /* The size of the Bigint freelist */
 #define Bigint_Kmax 7

+/* The size of the cached powers of 5 array */
+#define Bigint_Pow5max 8


Nit: I'm wondering whether there's a better name here; the "max" part seems potentially confusing (the max value stored here would be 5**2**9, right?). Bigint_Pow5count? Bigint_Pow5size?

I renamed it to Bigint_Pow5size

mdickinson · 2023-11-26T10:01:30Z

Include/internal/pycore_dtoa.h

 #ifndef PRIVATE_MEM
 #define PRIVATE_MEM 2304
 #endif
 #define Bigint_PREALLOC_SIZE \
    ((PRIVATE_MEM+sizeof(double)-1)/sizeof(double))

 struct _dtoa_state {
-    /* p5s is a linked list of powers of 5 of the form 5**(2**i), i >= 2 */
+    /* p5s is an array of powers of 5 of the form 5**(2**i), i >= 2 */


Nit: Could we adjust this comment to be more explicit about the indexing - i.e., make it clear that p5s[i] is 5**(2**(i+2)) for 0 <= i < Bigint_Pow5max?

mdickinson · 2023-11-26T10:04:35Z

Include/internal/pycore_dtoa.h

@@ -57,16 +60,18 @@ struct _dtoa_state {
 #endif  // !Py_USING_MEMORY_DEBUGGER


-/* These functions are used by modules compiled as C extension like math:


Deleted because the comment is out of date, presumably? 👍

Yeah, these comments were about why the below functions used PyAPI_FUNC (were "exported"), but the symbols are no longer exported.

mdickinson · 2023-11-26T11:18:39Z

Python/dtoa.c

@@ -685,19 +685,12 @@ pow5mult(Bigint *b, int k)

    if (!(k >>= 2))
        return b;
+    assert(k < (1 << (Bigint_Pow5max)));


This single line is by far the hardest to review, and reminds me yet again why I'd be delighted to see dtoa.c go away entirely. :-)

This says that pow5mult will never be called with k >= 1024, right?

I do believe that there's an upper limit on possible k values here, based on:

For double-to-string conversion, our input space is naturally bounded, and since we're assuming IEEE 754 binary64 format, we have at most 767 significant decimal digits in any output (an example worst case is something like (2**53 - 1) * 2**-1074), and dtoa.c is clever enough to pad with zeros when requested (e.g., with something crazy like format(math.pi, '.2000f')) rather than try to compute those zeros.

For string-to-double conversion, the input string has a potentially unbounded number of decimal digits, all of which may need to be taken into account for correct-rounding corner cases, but in those corner cases dtoa.c (in bigcomp) takes the approach of computing decimal digits of binary64 tie values and comparing with the decimal input, rather than trying to convert the decimal input to bigint-land (which would potentially require huge powers of 5).

And based on the above, 1023 certainly seems plausible as an upper bound for possible k values. I'd be hard pushed to give a proof that k can never exceed 1023, though. (At least, not without spending a lot more time than I currently have available.)

@colesbury Does the above roughly match your reasoning? Would it be possible to add a comment to the code justifying the assertion?

I've added a comment and moved the assertion up. The limits are related to the maximum base-10 exponent. For double-to-string, that's DBL_MAX_10_EXP (308). As you say, we set the limits ourselves for string-to-double, which are e=308 for overflow and e=-512 for underflow. But our exponent can be adjusted based on the number of digits, so we can see values as larges as k=535 float('1' +'0'*38 + '1E-535'), where the limits is from the combination of 512+STRTOD_DIGLIM-DBL_DIG-1.

mdickinson

Thanks for the updates! LGTM

colesbury · 2023-12-05T20:31:12Z

@mdickinson - would you please merge this?

mdickinson · 2023-12-07T13:46:54Z

@colesbury Apologies; I'd been assuming you'd merge it (and was beginning to wonder why you hadn't). :-)

Eclips4 · 2023-12-07T16:13:18Z

@colesbury Apologies; I'd been assuming you'd merge it (and was beginning to wonder why you hadn't). :-)

Probably Sam doesn't have rights to merge

…thon#112049) This updates `dtoa.c` to avoid using the Bigint free-list in --disable-gil builds and to pre-computes the needed powers of 5 during interpreter initialization. * pythongh-111962: Make dtoa thread-safe in `--disable-gil` builds. This avoids using the Bigint free-list in `--disable-gil` builds and pre-computes the needed powers of 5 during interpreter initialization. * Fix size of cached powers of 5 array. We need the powers of 5 up to 5**512 because we only jump straight to underflow when the exponent is less than -512 (or larger than 308). * Rename Py_NOGIL to Py_GIL_DISABLED * Changes from review * Fix assertion placement

colesbury added 3.13 bugs and security fixes topic-free-threading labels Nov 13, 2023

colesbury requested a review from mdickinson November 13, 2023 20:39

bedevere-app bot added the awaiting review label Nov 13, 2023

bedevere-app bot mentioned this pull request Nov 13, 2023

dtoa: thread safety in --disable-gil builds #111962

Closed

colesbury added the skip news label Nov 13, 2023

colesbury marked this pull request as draft November 13, 2023 21:26

bedevere-app bot removed the awaiting review label Nov 13, 2023

colesbury marked this pull request as ready for review November 13, 2023 22:39

bedevere-app bot added the awaiting review label Nov 13, 2023

colesbury added 2 commits November 20, 2023 10:43

pythongh-111962: Make dtoa thread-safe in --disable-gil builds.

d53c7f4

This avoids using the Bigint free-list in `--disable-gil` builds and pre-computes the needed powers of 5 during interpreter initialization.

Fix size of cached powers of 5 array.

4de2fb8

We need the powers of 5 up to 5**512 because we only jump straight to underflow when the exponent is less than -512 (or larger than 308).

colesbury force-pushed the dtoa-thread-safety branch from 4fd7747 to 4de2fb8 Compare November 20, 2023 15:43

Rename Py_NOGIL to Py_GIL_DISABLED

893f264

mdickinson approved these changes Nov 26, 2023

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Nov 26, 2023

colesbury added 2 commits November 27, 2023 10:49

Changes from review

4907b59

Fix assertion placement

d9536e5

mdickinson approved these changes Nov 27, 2023

View reviewed changes

mdickinson merged commit 2d76be2 into python:main Dec 7, 2023
30 checks passed

bedevere-app bot removed the awaiting merge label Dec 7, 2023

colesbury deleted the dtoa-thread-safety branch December 12, 2023 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-111962: Make dtoa thread-safe in `--disable-gil` builds. #112049

gh-111962: Make dtoa thread-safe in `--disable-gil` builds. #112049

colesbury commented Nov 13, 2023 •

edited by bedevere-app bot

Loading

mdickinson left a comment

mdickinson Nov 26, 2023

colesbury Nov 26, 2023

mdickinson Nov 26, 2023

mdickinson Nov 26, 2023

colesbury Nov 26, 2023

mdickinson Nov 26, 2023

colesbury Nov 27, 2023

mdickinson left a comment

colesbury commented Dec 5, 2023

mdickinson commented Dec 7, 2023

Eclips4 commented Dec 7, 2023

		@@ -57,16 +60,18 @@ struct _dtoa_state {
		#endif // !Py_USING_MEMORY_DEBUGGER


		/* These functions are used by modules compiled as C extension like math:

gh-111962: Make dtoa thread-safe in --disable-gil builds. #112049

gh-111962: Make dtoa thread-safe in --disable-gil builds. #112049

Conversation

colesbury commented Nov 13, 2023 • edited by bedevere-app bot Loading

mdickinson left a comment

Choose a reason for hiding this comment

mdickinson Nov 26, 2023

Choose a reason for hiding this comment

colesbury Nov 26, 2023

Choose a reason for hiding this comment

mdickinson Nov 26, 2023

Choose a reason for hiding this comment

mdickinson Nov 26, 2023

Choose a reason for hiding this comment

colesbury Nov 26, 2023

Choose a reason for hiding this comment

mdickinson Nov 26, 2023

Choose a reason for hiding this comment

colesbury Nov 27, 2023

Choose a reason for hiding this comment

mdickinson left a comment

Choose a reason for hiding this comment

colesbury commented Dec 5, 2023

mdickinson commented Dec 7, 2023

Eclips4 commented Dec 7, 2023

gh-111962: Make dtoa thread-safe in `--disable-gil` builds. #112049

gh-111962: Make dtoa thread-safe in `--disable-gil` builds. #112049

colesbury commented Nov 13, 2023 •

edited by bedevere-app bot

Loading