在进行计算机视觉模型训练前，我们经常会用到图像增强的技巧来获取更多的样本，但是有些深度学习框架中的方法对图像的变化方式可能并不能满足我们的需求，所以掌握OpenCV中一些常用的图像处理技巧对我们还是有不少帮助的。主要内容如下：

图片读取（单通道、三通道）
图片修改（剪切、旋转、通道分离）
常用的变换（伽马修正、仿射变换）
总结

开始之前先安装好 Opencv，并导入我们要用到的包：

import cv2
import random
import numpy as np
from matplotlib import pyplot as plt

1. 图片读取

首先使用OpenCV中的 imread 函数来进行单通道图像的读取，使用 imshow 显示读取的图像：

img_gray = cv2.imread('img.jpg', 0)
cv2.imshow("jrs",img_gray)

key = cv2.waitKey()
if key == 27:
 cv2.destroyAllWindows()

显示的结果如下，是一个灰度图：

还可以查看我们图像的一些基本属性：

print(img_gray)
print(img_gray.dtype)
print(img_gray.shape)

我们在来看下三通道的图像读取方式，只需要将 imread 中第二个参数去掉：

img = cv2.imread('img.jpg') 
cv2.imshow("img",img) 

key = cv2.waitKey() 
if key == 27:
 cv2.destroyAllWindows()

结果如下：

同样，我们可以再次查看图像的基本属性，会发现如下两个已经发生了变化。

print(img_gray)
print(img_gray.shape)

2. 图片修改

2.1 图像的裁剪

裁剪最简单的方式是获取图像数组的切片，如下：

img_crop = img[100:300,100:300]
cv2.imshow("img_crop", img_crop)

key = cv2.waitKey()
if key == 27:
 cv2.destroyAllWindows()

得到的结果如下：

2.2 图像通道分离

我们知道每个图像是由 RGB 三个颜色通道构成，所以我们可以使用 split 函数对原图像的三个通道进行分离：

B, G, R = cv2.split(img)

进行通道分离的后，我们就可以在每个通道上独立的进行数值变换，变换完成后再组合来生成新的图像。比如提升图像的亮度：

B,G,R = cv2.split(img)
for i in (B,G,R):
 randint = random.randint(50,100)
 limit = 255-randint
 i[i>limit]=255
 i[i<=limit]=randint+i[i<=limit]
 img_merge = cv2.merge((B,G,R))
cv2.imshow("img_merge",img_merge)

key = cv2.waitKey()
if key==27:
 cv2.destroyAllWindows()

得到的效果如下：

2.3 图像旋转

还可以使用 warpAffine 函数根据我们的设定的角度完成图像的旋转：

M = cv2.getRotationMatrix2D((img.shape[1] / 2, img.shape[0] / 2), 30, 1)
img_rotate = cv2.warpAffine(img, M, (img.shape[1], img.shape[0]))
cv2.imshow('img_rotate', img_rotate)

key = cv2.waitKey(0)
if key == 27:
 cv2.destroyAllWindows()

这里我们不对图像进行缩放，旋转角度为30度，最后得到的效果如下：

3. 常用的变换

3.1 仿射变换

仿射变换允许图像倾斜并且可以在任意两个方向上发生伸缩。代码如下：

def random_warp(img, row, col):
 height, width, channels = img.shape
 random_margin = 100
 x1 = random.randint(-random_margin, random_margin)
 y1 = random.randint(-random_margin, random_margin)
 x2 = random.randint(width - random_margin - 1, width - 1)
 y2 = random.randint(-random_margin, random_margin)
 x3 = random.randint(width - random_margin - 1, width - 1)
 y3 = random.randint(height - random_margin - 1, height - 1)
 x4 = random.randint(-random_margin, random_margin)
 y4 = random.randint(height - random_margin - 1, height - 1)
 dx1 = random.randint(-random_margin, random_margin)
 dy1 = random.randint(-random_margin, random_margin)
 dx2 = random.randint(width - random_margin - 1, width - 1)
 dy2 = random.randint(-random_margin, random_margin)
 dx3 = random.randint(width - random_margin - 1, width - 1)
 dy3 = random.randint(height - random_margin - 1, height - 1)
 dx4 = random.randint(-random_margin, random_margin)
 dy4 = random.randint(height - random_margin - 1, height - 1)
 pts1 = np.float32([[x1, y1], [x2, y2], [x3, y3], [x4, y4]])
 pts2 = np.float32([[dx1, dy1], [dx2, dy2], [dx3, dy3], [dx4, dy4]])
 
 
 M_warp = cv2.getPerspectiveTransform(pts1, pts2)
 img_warp = cv2.warpPerspective(img, M_warp, (width, height))
 return img_warp

img_warp = random_warp(img, img.shape[0], img.shape[1])
cv2.imshow('img_warp', img_warp)

key = cv2.waitKey(0)
if key == 27:
 cv2.destroyAllWindows()

运行后的效果如下：

3.2 伽马修正

伽马修正提升图像的对比度，让图像看起来更加的“明亮”。代码如下：

def adjust_gamma(image, gamma=1.0):
 invGamma = 1.0/gamma
 table = []
 for i in range(256):
 table.append(((i / 255.0) ** invGamma) * 255)
 table = np.array(table).astype("uint8")
 return cv2.LUT(image, table)

img_gamma = adjust_gamma(img, 2)
cv2.imshow("img",img)
cv2.imshow("img_gamma",img_gamma)

key = cv2.waitKey()
if key == 27:
 cv2.destroyAllWindows()

运行后的效果如下：

4. 总结

通过上面的方法已经能够满足我们绝大多数的图像增强需求了，当然和可以组合使用。

如果您觉得有所收获，请点个赞吧。

OpenCV常用图片转换技巧（opencv jpg转bmp）

1. 图片读取

2. 图片修改

3. 常用的变换

4. 总结

相关推荐

取消回复欢迎你发表评论:

Google 黑客常用搜索语句一览原力计划

npx简介（npxvip是哪国的）

在 Android 模拟器上运行 ARM 应用（android模拟器原理）

GB28181,B接口协议之SIPRTSPRTPRTMP协议从入门到精通

手机实时提取SIM卡打电话的信令和声音-辅助外设与商用通话方案

轻松转换!AppleNumbers到Excel的快捷教程

安装使用Hoppscotch构建API请求访问与测试

电脑端腾讯文档如何导出excel

Python自动化办公——后台截图（python 自动截图）

网络流媒体经典开源软件宝典webRTC, FFMpeg, SIP_流媒体开发教程

OpenCV常用图片转换技巧（opencv jpg转bmp）

1. 图片读取

2. 图片修改

3. 常用的变换

4. 总结

相关推荐

取消回复欢迎 你 发表评论:

Google 黑客常用搜索语句一览 原力计划

npx简介（npxvip是哪国的）

在 Android 模拟器上运行 ARM 应用（android模拟器原理）

GB28181,B接口协议之SIPRTSPRTPRTMP协议从入门到精通

手机实时提取SIM卡打电话的信令和声音-辅助外设与商用通话方案

轻松转换!AppleNumbers到Excel的快捷教程

安装使用Hoppscotch构建API请求访问与测试

电脑端腾讯文档如何导出excel

Python自动化办公——后台截图（python 自动截图）

网络流媒体经典开源软件宝典webRTC, FFMpeg, SIP_流媒体开发教程

取消回复欢迎你发表评论:

Google 黑客常用搜索语句一览原力计划