Python Numpy mask NaN not working

Question

I'm simply trying to use a masked array to filter out some nanentries.

import numpy as np
# x = [nan, -0.35, nan]
x = np.ma.masked_equal(x, np.nan)
print x

This outputs the following:

masked_array(data = [        nan -0.33557216         nan],
         mask = False,
   fill_value = nan)

Calling np.isnan() on x returns the correct boolean array, but the mask just doesn't seem to work. Why would my mask not be working as I expect?

that works, thanks. if you post an answer I can close this question — chris
– chris, Commented Jan 26, 2015 at 23:27

Community · Accepted Answer · 2017-05-23 11:53:24Z

18

You can use np.ma.masked_invalid:

import numpy as np

x = [np.nan, 3.14, np.nan]
mx = np.ma.masked_invalid(x)

print(repr(mx))
# masked_array(data = [-- 3.14 --],
#              mask = [ True False  True],
#        fill_value = 1e+20)

Alternatively, use np.isnan(x) as the mask= parameter to np.ma.masked_array:

print(repr(np.ma.masked_array(x, np.isnan(x))))
# masked_array(data = [-- 3.14 --],
#              mask = [ True False  True],
#        fill_value = 1e+20)

Why doesn't your original approach work? Because, rather counterintuitively, NaN is not equal to NaN!

print(np.nan == np.nan)
# False

This is actually part of the IEEE-754 definition of NaN

edited May 23, 2017 at 11:53

CommunityBot

11 silver badge

answered Jan 26, 2015 at 23:28

ali_m

74.6k28 gold badges230 silver badges314 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

chuseuiti · Accepted Answer · 2015-02-26 04:59:23Z

7

Here is another alternative without using mask:

import numpy as np
#x = [nan, -0.35, nan]
xmask=x[np.logical_not(np.isnan(x))]
print(xmask)

Result:

array([-0.35])

edited Feb 26, 2015 at 4:59

answered Feb 26, 2015 at 4:54

chuseuiti

8131 gold badge10 silver badges32 bronze badges

2 Comments

Guimoute Over a year ago

And for anyone wondering, np.logical_not is needed here instead of the built-in not because the latter does not broadcast.

ryanjdillon Over a year ago

np.isnan() returns a boolean array, and as such using x[~np.isnan(x)] would suffice. np.logical_not() is useful for non-boolean array use-cases.

Collectives™ on Stack Overflow

Python Numpy mask NaN not working

2 Answers 2

Comments

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

2 Comments

Linked

Related