ExtensionArray.fillna should not use lib.is_scalar #20411

TomAugspurger · 2018-03-19T15:12:09Z

lib.is_scalar is a bit too strict for the check we do in

Line 272 in 7273ea0

if not is_scalar(value):

. For example, it doesn't condisder ipaddress.IPv4Address as a scalar.

In [1]: from pandas._libs import lib

In [2]: import ipaddress

In [3]: lib.is_scalar(ipaddress.IPv4Address(1))
Out[3]: False

Replacing that check with a hasattr(value, '__len__') should be sufficient.

The text was updated successfully, but these errors were encountered:

jorisvandenbossche · 2018-03-19T17:00:53Z

The same for geopandas.

However, in general, also the len check might not always be good, because eg for geopandas a MultiLineString has a length .. . Or for the json-array, a dict also has a length.

This might not be easy to solve in general, it's mainly due to our flexible API where we need to interpret a lot about what is passed in ..

jorisvandenbossche · 2018-03-19T17:02:26Z

Actually, even simpler example where a length check will not work: strings.
(but of course, for only this one it would be easy to add to an additional check)

TomAugspurger · 2018-03-19T17:03:33Z

Hmm hadn't thought that through.

Perhaps we require that ExtensionArray.fillna(array) requires array to be an instance of the ExtensionArray? That should remove all ambiguity, at the cost of requiring IPArray([]).fillna(['192.168.1.1']) to be written as IPArray([]).fillna(IPArray(['192.168.1.1'])).

jorisvandenbossche · 2018-03-19T17:13:18Z

at the cost of requiring IPArray([]).fillna(['192.168.1.1']) to be written as IPArray([]).fillna(IPArray(['192.168.1.1'])).

But in this case you can still fill with IPArray([]).fillna('192.168.1.1') (filling with a scalar) ?

But such a requirement might be a good idea. Or we could also check it being "array-like" (check if is has a shape and if that shape matches).
In the current implementation, I don't think that a list (instead of array) would work anyhow, because we apply a boolean mask on it?

TomAugspurger · 2018-03-19T17:58:58Z

is_array_like seems to be what we want, since we do use boolean masking on it.

…

On Mon, Mar 19, 2018 at 12:13 PM, Joris Van den Bossche < ***@***.***> wrote: at the cost of requiring IPArray([]).fillna(['192.168.1.1']) to be written as IPArray([]).fillna(IPArray(['192.168.1.1'])). But in this case you can still fill with IPArray([]).fillna('192.168.1.1') (filling with a scalar) ? But such a requirement might be a good idea. Or we could also check it being "array-like" (check if is has a shape and if that shape matches). In the current implementation, I don't think that a list (instead of array) would work anyhow, because we apply a boolean mask on it? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#20411 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABQHIrUbFUoz-ormBr7bN5A8PzEJNsdVks5tf-c3gaJpZM4SwWBZ> .

Closes pandas-dev#20411

Closes #20411

Closes pandas-dev#20411

TomAugspurger added Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate ExtensionArray Extending pandas with custom dtypes or arrays. labels Mar 19, 2018

TomAugspurger added this to the 0.23.0 milestone Mar 19, 2018

TomAugspurger added the Effort Low label Mar 19, 2018

TomAugspurger added a commit to TomAugspurger/pandas that referenced this issue Mar 19, 2018

BUG: ExtensionArray.fillna for scalar values

6e51710

Closes pandas-dev#20411

TomAugspurger mentioned this issue Mar 19, 2018

BUG: ExtensionArray.fillna for scalar values #20412

Merged

jreback closed this as completed in #20412 Mar 19, 2018

jreback pushed a commit that referenced this issue Mar 19, 2018

BUG: ExtensionArray.fillna for scalar values (#20412)

217174b

Closes #20411

dworvos pushed a commit to dworvos/pandas that referenced this issue Apr 2, 2018

BUG: ExtensionArray.fillna for scalar values (pandas-dev#20412)

bff8193

Closes pandas-dev#20411

jorisvandenbossche mentioned this issue Jan 28, 2019

Regression in DataFrame.set_index with class instance column keys #24969

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ExtensionArray.fillna should not use lib.is_scalar #20411

ExtensionArray.fillna should not use lib.is_scalar #20411

TomAugspurger commented Mar 19, 2018

jorisvandenbossche commented Mar 19, 2018

jorisvandenbossche commented Mar 19, 2018

TomAugspurger commented Mar 19, 2018

jorisvandenbossche commented Mar 19, 2018

TomAugspurger commented Mar 19, 2018 via email

ExtensionArray.__fillna__ should not use lib.is_scalar #20411

ExtensionArray.__fillna__ should not use lib.is_scalar #20411

Comments

TomAugspurger commented Mar 19, 2018

jorisvandenbossche commented Mar 19, 2018

jorisvandenbossche commented Mar 19, 2018

TomAugspurger commented Mar 19, 2018

jorisvandenbossche commented Mar 19, 2018

TomAugspurger commented Mar 19, 2018 via email

ExtensionArray.fillna should not use lib.is_scalar #20411

ExtensionArray.fillna should not use lib.is_scalar #20411