PERF: pass through to numpy validation for iloc setitem #32257

jbrockmendel · 2020-02-26T02:00:45Z

We lose a little bit of ground on the range and slice (not really sure why), pick up a bigger amount of ground on list or ndarray.

In [3]: ser = pd.Series(range(10**5))                                                                                      
In [4]: key = range(100, 200) 
In [5]: key2 = list(key)
In [6]: key3 = slice(100, 200)                                                                                             
In [7]: key4 = np.array(key2)                                                                                                 

In [16]: %timeit ser.iloc[key] = 1
56.3 µs ± 1.19 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- master
62.9 µs ± 1.77 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- PR

In [17]: %timeit ser.iloc[key2] = 1
95.6 µs ± 2.69 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- master
55.6 µs ± 922 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- PR

In [20]: %timeit ser.iloc[key3] = 1
49 µs ± 756 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- master
50.6 µs ± 1.01 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- PR

In [21]: %timeit ser.iloc[key4] = 1
71.6 µs ± 1.98 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- master
45.7 µs ± 427 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)  # <-- PR

…ks-_get_setitem_indexer

jreback

lgtm. minor nit

jreback · 2020-02-26T12:41:58Z

pandas/core/indexing.py

+        if isinstance(key, tuple):
+            if len(key) > self.ndim:
+                # TODO: we could let this fall through if we are OK
+                #  with having numpy raise IndexError("too many indices for array")


would be ok i think

jreback · 2020-02-26T12:42:54Z

pandas/tests/frame/indexing/test_indexing.py

@@ -27,6 +27,9 @@

 from pandas.tseries.offsets import BDay

+# We pass through a TypeError raised by numpy


nit. you used iloc_msg in the other function (non-private); i would make these more verbose

slice_exception_msg

…ks-_get_setitem_indexer

jbrockmendel · 2020-02-26T19:35:23Z

Removed the tuple length validation, picked up on the order of a microsecond compared to the posts from the OP.

jreback · 2020-02-27T12:52:38Z

thanks

…2257)

jbrockmendel added 7 commits February 18, 2020 18:17

CLN: simplify iloc._get_setitem_indexer

85858d6

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

988cc83

…ks-_get_setitem_indexer

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

9448375

…ks-_get_setitem_indexer

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

b0f3261

…ks-_get_setitem_indexer

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

b1ddd6a

…ks-_get_setitem_indexer

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

8eb118b

…ks-_get_setitem_indexer

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

d31be38

…ks-_get_setitem_indexer

jreback added Indexing Related to indexing on series/frames, not to indexes themselves Performance Memory or execution speed performance labels Feb 26, 2020

jreback added this to the 1.1 milestone Feb 26, 2020

jreback requested changes Feb 26, 2020

View reviewed changes

jbrockmendel added 2 commits February 26, 2020 10:31

Merge branch 'master' of https://github.com/pandas-dev/pandas into ya…

c5a75f7

…ks-_get_setitem_indexer

Let numpy validate tuple length

8a0e78f

jreback approved these changes Feb 27, 2020

View reviewed changes

jreback merged commit 922f932 into pandas-dev:master Feb 27, 2020

jbrockmendel deleted the yaks-_get_setitem_indexer branch February 27, 2020 14:55

roberthdevries pushed a commit to roberthdevries/pandas that referenced this pull request Mar 2, 2020

PERF: pass through to numpy validation for iloc setitem (pandas-dev#3…

e4f37e0

…2257)

ellequelle mentioned this pull request Dec 31, 2021

BUG: iloc(axis=1).__setitem__ ignores axis #45032

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF: pass through to numpy validation for iloc setitem #32257

PERF: pass through to numpy validation for iloc setitem #32257

jbrockmendel commented Feb 26, 2020

jreback left a comment

jreback Feb 26, 2020

jreback Feb 26, 2020

jbrockmendel commented Feb 26, 2020

jreback commented Feb 27, 2020

		@@ -27,6 +27,9 @@

		from pandas.tseries.offsets import BDay

		# We pass through a TypeError raised by numpy

PERF: pass through to numpy validation for iloc setitem #32257

PERF: pass through to numpy validation for iloc setitem #32257

Conversation

jbrockmendel commented Feb 26, 2020

jreback left a comment

Choose a reason for hiding this comment

jreback Feb 26, 2020

Choose a reason for hiding this comment

jreback Feb 26, 2020

Choose a reason for hiding this comment

jbrockmendel commented Feb 26, 2020

jreback commented Feb 27, 2020