温馨提示:本文翻译自stackoverflow.com，查看原文请点击：opencv - Detect and extract images surrounded by a frame

imagemagick opencv

opencv - 检测并提取框架包围的图像

发布于 2020-04-04 00:40:50

我想从输入图像中获取以下结果图像。生成的图像被具有相同边框大小和类型但边框矩形大小不同的框架包围。有没有办法做到这一点？我认为我需要首先检测边界所包围的区域。但是不知道。我正在尝试在ImageMagick中找到它。

输入图片（input.png）

结果图像（output1.png）

结果图像（output2.png）

边界

更新1

这不是完美的方法，但它可用于OpenCV，如下所示。

import cv2 as cv

def main():
    image_file = '/path/to/your/input/image.png'
    src = cv.imread(image_file, cv.IMREAD_COLOR)
    height, width, channels = src.shape
    image_size = height * width
    img_gray = cv.cvtColor(src, cv.COLOR_RGB2GRAY)
    retval, dst = cv.threshold(img_gray, 1000, 255, cv.THRESH_TOZERO_INV)
    dst = cv.bitwise_not(dst)
    retval, dst = cv.threshold(dst, 0, 255, cv.THRESH_BINARY | cv.THRESH_OTSU)
    dst, contours, hierarchy = cv.findContours(
        dst, cv.RETR_TREE, cv.CHAIN_APPROX_SIMPLE)

    xxx = 0
    for i, contour in enumerate(contours):
        area = cv.contourArea(contour)
        if area < 50000:
            continue
        if image_size * 0.99 < area:
            continue
        if abs(i - xxx) < 10:
            continue
        xxx = i
        x, y, w, h = cv.boundingRect(contour)
        cut = src[y:y+h, x:x+w]
        detector = cv.FastFeatureDetector_create()
        detector.setNonmaxSuppression(False)
        keypoints = detector.detect(cut)
        cv.imwrite('debug_%d.png' % i, cut)

if __name__ == '__main__':
    main()

从此站点引用：https : //angular.io/guide/providers

更新2

fmw42的方法不错，但不足以满足以下要求。（我没有在第一篇文章中提到）唯一的蓝色矩形被提取。背景颜色可能是白色。

输入图片（input2.png）

实际结果图像（output.png）

提问者

zono

被浏览

116

查看英文版

查看原文

fmw42 2020-02-01 14:18

这可以在ImageMagick（6）中使用-connected-components完成。

在这里，我将转换为HSV色彩空间并提取饱和度通道。白色和黑色没有饱和度，但是粉红色和蓝色有饱和度。然后，我将阈值设置为使粉红色和蓝色在黑色背景上变为白色。然后，我使用形态学腐蚀来消除边界的影响。然后，我使用连接的组件填充白色区域中的所有孔，然后获取其边界框并存储在数组中。然后，我遍历每个边界框并裁剪原始图像。

参见https://imagemagick.org/script/connected-components.php

输入：

Unix语法：

bboxArr=(`convert wikipedia.png \
-colorspace HSV -channel 1 -separate +channel \
-threshold 0 -type bilevel \
-morphology erode square:3 \
-define connected-components:verbose=true \
-define connected-components:mean-color=true \
-define connected-components:area-threshold=1000 \
-connected-components 4 null: | grep "gray(255)" | awk '{print $2}'`)

num=${#bboxArr[*]}

for ((i=0; i<num; i++)); do
convert wikipedia.png -crop ${bboxArr[$i]} +repage wikipedia_$i.png
done

结果：

如果使用ImageMagick 7，则将convert转换为magick。

Windows语法需要删除\之前的（和）。并将\的结尾更改为^。grep和awk是Unix工具。因此，您可能需要为Windows安装此类程序或找到其他方法来执行此操作。

zono 2020-02-01 16:57:42

效果很好。但请给我更多时间。正如您所提到的，我已经确认如果它是白色后端颜色，则它不起作用。颜色也可以按我的要求进行（是的，我没有在问题中提及。很抱歉..）。我正在尝试寻找解决方案。

fmw42 2020-02-02 01:51:43

您是否需要所有文本段落？如果是这样，则需要稍微不同的方法。首先，将所有文本在白色背景上设为黑色。然后模糊文本或打开我们的形态以连接每个段落中的文本。门槛。然后使用连接的组件查找文本区域边界框。然后使用边界框裁剪输入。

zono 2020-02-02 15:18:56

你好不，我不需要文本段落。我需要提取output1.png和output2.png。我在问题中添加了细节。（更新2）

相关问题

1

使用python在图块中转换和裁剪图像

2

Wand / ImageMagick比较方法始终返回相同的浮点数

3

如何在docker项目下将图像magick添加到我的laravel中？

4

ImageMagic注释字符代码，而不是注释图像中的字符

5

ImageMagick：粗体和斜体字体？

6

尝试添加文本剪辑时，使用Python / MoviePy获得有关ImageMagick的错误

7

如何确定动画GIF帧中透明像素的数量？

8

使用Imagemagick将图像转换为pdf，并保持图像分辨率并将其放在左上角

9

ImageMagick将灰度图像转换为彩色图像

10

将白板清洁器脚本转换为php

热门github

1

A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP, SCP, SFTP, SMB, SMBS, SMTP, SMTPS, TELNET, TFTP, WS and WSS. libcurl offers a myriad of powerful features (翻译：Curl 是一个命令行工具，用于传输使用 URL 语法指定的数据。)

2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

3

Flutter makes it easy and fast to build beautiful apps for mobile and beyond (翻译：Flutter 可以轻松快速地为移动设备及其他应用构建漂亮的应用程序)

4

Powerful menu bar manager for macOS

5

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

6

AI coding agent, built for the terminal.

7

Tongyi DeepResearch, the Leading Open-source DeepResearch Agent

8

An AI Hedge Fund Team

9

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

10

基于大模型和 RAG 的智能问数系统。Text-to-SQL Generation via LLMs using RAG.

11

🔥 🔥 🔥 Open Source Airtable Alternative (翻译：将任何 MySQL、PostgreSQL、SQL Server、SQLite 和 MariaDB 转换为智能电子表格。)

12

Lightweight coding agent that runs in your terminal

13

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

14

Home of the WebKit project, the browser engine used by Safari, Mail, App Store and many other applications on macOS, iOS and Linux. (翻译：WebKit 项目的主页，Safari、Mail、App Store 和 macOS、iOS 和 Linux 上的许多其他应用程序使用的浏览器引擎。)

15