Add a column to numpy 2d array

Question

I have a 60000 by 200 numpy array. I want to make it 60000 by 201 by adding a column of 1's to the right (so every row is [prev, 1]).

Concatenate with axis = 1 doesn't work because it seems like concatenate requires all input arrays to have the same dimension.

How should I do this?

Hun · Accepted Answer · 2016-04-27 00:27:03Z

63

Let me just throw in a very simple example with much smaller size. The principle should be the same.

a = np.zeros((6,2))
    array([[ 0.,  0.],
           [ 0.,  0.],
           [ 0.,  0.],
           [ 0.,  0.],
           [ 0.,  0.],
           [ 0.,  0.]])
b = np.ones((6,1))
    array([[ 1.],
           [ 1.],
           [ 1.],
           [ 1.],
           [ 1.],
           [ 1.]])

np.hstack((a,b))
array([[ 0.,  0.,  1.],
       [ 0.,  0.,  1.],
       [ 0.,  0.,  1.],
       [ 0.,  0.,  1.],
       [ 0.,  0.,  1.],
       [ 0.,  0.,  1.]])

answered Apr 27, 2016 at 0:27

Hun

3,8872 gold badges17 silver badges16 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Jobs Over a year ago

This works! My array is 60000 by 201 now. Thanks. the trick here is to define np.ones() as (6, 1) instead of (6,), which is what I've been doing.

Sandu Ursu Over a year ago

Interestingly if you define b=np.ones(6) the procedure won't work. Sometimes Numpy makes me vomit.

mtruong1999 Over a year ago

@SanduUrsu You would need to do b = np.ones((6,1)) for that to work

P i Over a year ago

@SanduUrsu examine .shape for np.ones(6) and np.ones((6,1))

Stone · Accepted Answer · 2020-07-28 23:41:12Z

27

Using numpy index trick to append a 1D vector to a 2D array

a = np.zeros((6,2))
# array([[ 0.,  0.],
#        [ 0.,  0.],
#        [ 0.,  0.],
#        [ 0.,  0.],
#        [ 0.,  0.],
#        [ 0.,  0.]])
b = np.ones(6) # or np.ones((6,1))
#array([1., 1., 1., 1., 1., 1.])
np.c_[a,b]
# array([[0., 0., 1.],
#        [0., 0., 1.],
#        [0., 0., 1.],
#        [0., 0., 1.],
#        [0., 0., 1.],
#        [0., 0., 1.]])

answered Jul 28, 2020 at 23:41

Stone

1,17910 silver badges17 bronze badges

Comments

hpaulj · Accepted Answer · 2016-04-27 01:38:33Z

Under cover all the stack variants (including append and insert) end up doing a concatenate. They just precede it with some sort of array reshape.

In [60]: A = np.arange(12).reshape(3,4)

In [61]: np.concatenate([A, np.ones((A.shape[0],1),dtype=A.dtype)], axis=1)
Out[61]: 
array([[ 0,  1,  2,  3,  1],
       [ 4,  5,  6,  7,  1],
       [ 8,  9, 10, 11,  1]])

Here I made a (3,1) array of 1s, to match the (3,4) array. If I wanted to add a new row, I'd make a (1,4) array.

While the variations are handy, if you are learning, you should become familiar with concatenate and the various ways of constructing arrays that match in number of dimensions and necessary shapes.

Community · Accepted Answer · 2017-05-23 12:31:18Z

4

The first thing to think about is that numpy arrays are really not meant to change size. So you should ask yourself, can you create your original matrix as 60k x 201 and then fill the last column afterwards. This is usually best.

If you really must do this, see How to add column to numpy array

edited May 23, 2017 at 12:31

CommunityBot

11 silver badge

answered Apr 27, 2016 at 0:29

Alan

9,65020 silver badges26 bronze badges

Comments

Randerson · Accepted Answer · 2016-04-27 00:50:13Z

1

I think the numpy method column_stack is more interesting because you do not need to create a column numpy array to stack it in the matrix of interest. With the column_stack you just need to create a normal numpy array.

answered Apr 27, 2016 at 0:50

Randerson

7851 gold badge5 silver badges21 bronze badges

4 Comments

hpaulj Over a year ago

What does column_stack do (under the cover) that is different from hstack?

Randerson Over a year ago

Does the same, but do do not need to create a column array was you did. You can use a line array and the column_stack will stack it as a new column of the matrix you want to be stacked with a new column. With the hstack you must use a column array otherwise it will return a error.

Stone Over a year ago

@hpaulj: the first answer just shows an example of column_stack stackoverflow.com/a/63144310/3854750. As Randerson mentioned, the second array being added can be either column array of shape (N,1) or just a simple linear array of shape (N,)

hpaulj Over a year ago

column_stack just makes sure the array(s) is 2d, changing the (N,) to (N,1) if necessary. All these 'stack' functions end up using np.concatenate, with varying degrees of frontend tweaking.

Collectives™ on Stack Overflow

Add a column to numpy 2d array

5 Answers 5

4 Comments

Comments

Comments

Comments

4 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

4 Comments

Comments

Comments

Comments

4 Comments

Linked

Related