如何在 Python 中读取给定像素的 RGB 值？

Posted 2023-02-19

技术标签:

【中文标题】如何在 Python 中读取给定像素的 RGB 值？【英文标题】：How to read the RGB value of a given pixel in Python? 【发布时间】：2010-09-13 09:18:03 【问题描述】：

如果我使用open("image.jpg") 打开图像，假设我有像素的坐标，我如何获得像素的 RGB 值？

那么，我该如何做相反的事情呢？从一个空白图形开始，‘写’一个具有特定 RGB 值的像素？

如果我不需要下载任何额外的库，我会更喜欢。

【问题讨论】：

【参考方案1】：

使用一个名为 Pillow 的库，您可以将其变成一个函数，以便在您的程序中稍后使用，如果您必须多次使用它。该函数只接受图像的路径和要“抓取”的像素的坐标。它打开图像，将其转换为 RGB 颜色空间，并返回请求像素的 R、G 和 B。

from PIL import Image
def rgb_of_pixel(img_path, x, y):
    im = Image.open(img_path).convert('RGB')
    r, g, b = im.getpixel((x, y))
    a = (r, g, b)
    return a

*注意：我不是这段代码的原作者；它没有任何解释。由于它相当容易解释，我只是提供上述解释，以防万一有人不理解它。

【讨论】：

虽然此代码 sn-p 可能是解决方案，但 including an explanation 确实有助于提高您的帖子质量。请记住，您是在为将来的读者回答问题，而这些人可能不知道您提出代码建议的原因。【参考方案2】：

使用Pillow（适用于 Python 3.X 和 Python 2.7+），您可以执行以下操作：

from PIL import Image
im = Image.open('image.jpg', 'r')
width, height = im.size
pixel_values = list(im.getdata())

现在你有了所有的像素值。如果是RGB或者其他模式可以im.mode读取。然后可以通过以下方式获取像素(x, y)：

pixel_values[width*y+x]

或者，您可以使用 Numpy 并重塑数组：

>>> pixel_values = numpy.array(pixel_values).reshape((width, height, 3))
>>> x, y = 0, 1
>>> pixel_values[x][y]
[ 18  18  12]

一个完整、简单易用的解决方案是

# Third party modules
import numpy
from PIL import Image


def get_image(image_path):
    """Get a numpy array of an image so that one can access values[x][y]."""
    image = Image.open(image_path, "r")
    width, height = image.size
    pixel_values = list(image.getdata())
    if image.mode == "RGB":
        channels = 3
    elif image.mode == "L":
        channels = 1
    else:
        print("Unknown mode: %s" % image.mode)
        return None
    pixel_values = numpy.array(pixel_values).reshape((width, height, channels))
    return pixel_values


image = get_image("gradient.png")

print(image[0])
print(image.shape)

冒烟测试代码

您可能不确定宽度/高度/通道的顺序。出于这个原因，我创建了这个渐变：

图片的宽度为 100 像素，高度为 26 像素。它的颜色渐变从#ffaa00（黄色）到#ffffff（白色）。输出是：

[[255 172   5]
 [255 172   5]
 [255 172   5]
 [255 171   5]
 [255 172   5]
 [255 172   5]
 [255 171   5]
 [255 171   5]
 [255 171   5]
 [255 172   5]
 [255 172   5]
 [255 171   5]
 [255 171   5]
 [255 172   5]
 [255 172   5]
 [255 172   5]
 [255 171   5]
 [255 172   5]
 [255 172   5]
 [255 171   5]
 [255 171   5]
 [255 172   4]
 [255 172   5]
 [255 171   5]
 [255 171   5]
 [255 172   5]]
(100, 26, 3)

注意事项：

形状为（宽度、高度、通道） image[0]，也就是第一行，有 26 个相同颜色的三元组

【讨论】：

Pillow 在 macosx 上支持 python 2.7，而我只在 PIL 上找到 python 2.5 支持。谢谢！小心，'reshape' 参数列表应该是（高度、宽度、通道）。对于 rgba 图像，您可以包含 image.mode = RGBA 与 channels = 4 @gmarsi 的点在宽度和高度上是否正确？真的是两者都有效吗？您需要了解数据的输出方式，以便了解输出数组的形状以及图像的行和列像素数据的位置。 @Kioshiki 我在答案中添加了“烟雾测试”部分，因此更容易分辨。【参考方案3】：

正如戴夫·韦伯所说：

这是我的工作代码 sn-p 从图片：
import os, sys
import Image

im = Image.open("image.jpg")
x = 3
y = 4

pix = im.load()
print pix[x,y]

【讨论】：

为什么我在运行 Lachlan Phillips 的代码时会得到四个值？我给这个： print(pix[10,200]) 我得到这个： (156, 158, 157, 255) 为什么？【参考方案4】：

如果您正在寻找 RGB 颜色代码形式的三位数字，那么下面的代码应该可以做到这一点。

i = Image.open(path)
pixels = i.load() # this is not a list, nor is it list()'able
width, height = i.size

all_pixels = []
for x in range(width):
    for y in range(height):
        cpixel = pixels[x, y]
        all_pixels.append(cpixel)

这可能对你有用。

【讨论】：

【参考方案5】：

最好使用Python Image Library 来执行此操作，恐怕需要单独下载。

做你想做的最简单的方法是通过load() method on the Image object，它返回一个像素访问对象，你可以像数组一样操作它：

from PIL import Image

im = Image.open('dead_parrot.jpg') # Can be many different formats.
pix = im.load()
print im.size  # Get the width and hight of the image for iterating over
print pix[x,y]  # Get the RGBA Value of the a pixel of an image
pix[x,y] = value  # Set the RGBA Value of the image (tuple)
im.save('alive_parrot.png')  # Save the modified pixels as .png

或者，查看ImageDraw，它为创建图像提供了更丰富的 API。

【讨论】：

幸运的是，在 Linux 和 Windows 中安装 PIL 非常简单（不了解 Mac） @ArturSapek，我通过pip 安装了 PIL，这相当容易。我在我的 Mac (Pypi) 上使用了这个：easy_install --find-links http://www.pythonware.com/products/pil/ Imaging 对于未来的读者：pip install pillow 将成功且相当快速地安装 PIL（如果不在 virtualenv 中，可能需要sudo）。 pillow.readthedocs.io/en/latest/… 在 Windows 安装步骤中显示 bash 命令。不确定如何继续。【参考方案6】：

import matplotlib.pyplot as plt
import matplotlib.image as mpimg

img=mpimg.imread('Cricket_ACT_official_logo.png')
imgplot = plt.imshow(img)

【讨论】：

【参考方案7】：

您可以使用 Tkinter 模块，它是 Tk GUI 工具包的标准 Python 接口，您不需要额外下载。见https://docs.python.org/2/library/tkinter.html。

（对于 Python 3，Tkinter 被重命名为 tkinter）

这里是如何设置 RGB 值：

#from http://tkinter.unpythonic.net/wiki/PhotoImage
from Tkinter import *

root = Tk()

def pixel(image, pos, color):
    """Place pixel at pos=(x,y) on image, with color=(r,g,b)."""
    r,g,b = color
    x,y = pos
    image.put("#%02x%02x%02x" % (r,g,b), (y, x))

photo = PhotoImage(width=32, height=32)

pixel(photo, (16,16), (255,0,0))  # One lone pixel in the middle...

label = Label(root, image=photo)
label.grid()
root.mainloop()

并获得 RGB：

#from http://www.kosbie.net/cmu/spring-14/15-112/handouts/steganographyEncoder.py
def getRGB(image, x, y):
    value = image.get(x, y)
    return tuple(map(int, value.split(" ")))

【讨论】：

【参考方案8】：

PyPNG - 轻量级 PNG 解码器/编码器

虽然问题暗示了JPG，但我希望我的回答对某些人有用。

下面是使用PyPNG module读写PNG像素的方法：

import png, array

point = (2, 10) # coordinates of pixel to be painted red

reader = png.Reader(filename='image.png')
w, h, pixels, metadata = reader.read_flat()
pixel_byte_width = 4 if metadata['alpha'] else 3
pixel_position = point[0] + point[1] * w
new_pixel_value = (255, 0, 0, 0) if metadata['alpha'] else (255, 0, 0)
pixels[
  pixel_position * pixel_byte_width :
  (pixel_position + 1) * pixel_byte_width] = array.array('B', new_pixel_value)

output = open('image-with-red-dot.png', 'wb')
writer = png.Writer(w, h, **metadata)
writer.write_array(output, pixels)
output.close()

PyPNG 是一个长度不到 4000 行的纯 Python 模块，包括测试和 cmets。

PIL 是一个更全面的图像库，但它也明显更重。

【讨论】：

【参考方案9】：

photo = Image.open('IN.jpg') #your image
photo = photo.convert('RGB')

width = photo.size[0] #define W and H
height = photo.size[1]

for y in range(0, height): #each pixel has coordinates
    row = ""
    for x in range(0, width):

        RGB = photo.getpixel((x,y))
        R,G,B = RGB  #now you can use the RGB value

【讨论】：

【参考方案10】：

使用命令“sudo apt-get install python-imaging”安装 PIL 并运行以下程序。它将打印图像的 RGB 值。如果图像很大，则使用“>”将输出重定向到文件，稍后打开文件以查看 RGB 值

import PIL
import Image
FILENAME='fn.gif' #image can be in gif jpeg or png format 
im=Image.open(FILENAME).convert('RGB')
pix=im.load()
w=im.size[0]
h=im.size[1]
for i in range(w):
  for j in range(h):
    print pix[i,j]

【讨论】：

【参考方案11】：

您可以使用pygame 的 surfarray 模块。该模块有一个 3d 像素数组返回方法，称为 pixel3d(surface)。我在下面展示了用法：

from pygame import surfarray, image, display
import pygame
import numpy #important to import

pygame.init()
image = image.load("myimagefile.jpg") #surface to render
resolution = (image.get_width(),image.get_height())
screen = display.set_mode(resolution) #create space for display
screen.blit(image, (0,0)) #superpose image on screen
display.flip()
surfarray.use_arraytype("numpy") #important!
screenpix = surfarray.pixels3d(image) #pixels in 3d array:
#[x][y][rgb]
for y in range(resolution[1]):
    for x in range(resolution[0]):
        for color in range(3):
            screenpix[x][y][color] += 128
            #reverting colors
screen.blit(surfarray.make_surface(screenpix), (0,0)) #superpose on screen
display.flip() #update display
while 1:
    print finished

希望对您有所帮助。最后一句话：屏幕在 screenpix 的生命周期内被锁定。

【讨论】：

【参考方案12】：

wiki.wxpython.org 上有一篇非常好的文章，标题为Working With Images。文章提到了使用 wxWidgets (wxImage)、PIL 或 PythonMagick 的可能性。就个人而言，我使用过 PIL 和 wxWidgets，它们都使图像处理变得相当容易。

【讨论】：

【参考方案13】：

图像处理是一个复杂的话题，最好使用一个库。我可以推荐gdmodule，它提供了从 Python 中轻松访问许多不同图像格式的方法。

【讨论】：

有人知道为什么这被否决了吗？ libgd 是否存在已知问题？（我从未看过它，但很高兴知道有 PiL 的替代品）

以上是关于如何在 Python 中读取给定像素的 RGB 值？的主要内容，如果未能解决你的问题，请参考以下文章

如何获得具有 GD 的像素的正确 rgb 值？

matlab与opencv读取同一帧视频时会得到不同的像素值