unable to understand this image read python code [duplicate]

Question

import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import numpy as np

my_image = mpimg.imread('mountain.png')
print('The image is:- ',type(image), 
         'dimensions is:-', image.shape)

print(image[:,:,0])
print(image[:,:,1])
print(image[:,:,2])

I am unable to understand what image[:,:,0] or image[:,:,1] mean ?

It fetches the red, green and blue channels of the image separately. — willeM_ Van Onsem
– willeM_ Van Onsem, Commented Jan 7, 2018 at 14:43
The canonical 'understanding slice notation' post has several answers extending this to numpy arrays, so I duped it there. — Martijn Pieters
– Martijn Pieters, Commented Jan 7, 2018 at 14:53
This question shouldn't be marked as a duplicate since this involves images and color dimensions, which is more specific and has a different "meaning" than regular multidimensional arrays. — Stev
– Stev, Commented Feb 14 at 2:43

D Greenwood · Accepted Answer · 2018-01-07 14:48:31Z

2

A colour rgb image is read as a three dimensional array. The first two dimensions are x and y, and the third dimension is colour in the order red, green, blue.

The bracket notation is used to refer to subsets of this three dimensional array in the form [x, y, c]. A colon indicates that all values in that dimension should be selected.

Therefore image[:,:,0] refers to the red channel, image[:,:,1] to the blue channel and image[:,:,2] is the green channel.

answered Jan 7, 2018 at 14:48

D Greenwood

4462 silver badges12 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

willeM_ Van Onsem · Accepted Answer · 2018-01-07 14:48:33Z

Most image representations work with a bitmap with an RGB color space. An image is seen as a rectangle of pixels, and we assign a specific color to every pixel. A color is then represented as 3-tuple: where the first item of the tuple represents the intensity of red, the second one the intensity of green, and the last one the intensity of blue. An important note is that this is a representation of an image: there are other ones. Like for instance using vector graphics. Furthermore there are other color-spaces as well.

This thus means that if we load an image into memory, we obtain a matrix with shape (h, w, 3) with h the height of the image (in pixels), and w the width of the image (again in pixels).

Now numpy allows advanced indexing: we can construct a view by using image[:,:,0]. This means that we construct a (h, w)-shaped matrix, where for an item at index [i, j], we obtain the value that is placed at [i, j, 0] in the original image. We thus obtain an image, that only takes the intensity of the red channel into account.

The same holds for image[:,:,1] and image[:,:,2] where we take respecively the green and blue channel into account. The representation uses floats where 1.0 means maximum intensity, and 0.0 means lowest intensity. For instance if (red, green, blue) = (1.0, 0.5, 0.0), this is a color that most people see as yellow.

Collectives™ on Stack Overflow

unable to understand this image read python code [duplicate]

2 Answers 2

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

1 Comment

Linked

Related