123,123,123

一种基于点云实例分割的六维位姿估计方法

网络安全与数据治理

周剑

苏州深浅优视智能科技有限公司

摘要： 提出了一种基于SoftGroup实例分割模型和PCA主成分分析算法来估计物体位姿的方法。在工业自动化领域，通常会为诸如机器人、机械臂配备视觉系统并利用二维图像估算目标物体位置，但当目标物体出现堆叠、遮挡等复杂场景时，对二维图形的识别精度往往有所下降。为准确、高效地获取物体位置，充分利用三维点云数据的高分辨率、高精度的优势：首先将深度相机采集到的RGB-D图像转为点云图，接着利用SoftGroup模型分割出点云图中的目标对象，最后用PCA算法得到目标的六维位姿。在自制工件数据集上进行验证，结果表明对三种工件识别的平均AP高达97.5%，单张点云图识别用时仅0.73 ms，证明所提出的方法具有高效性和实时性，为诸如机器人定位、机械臂自主抓取场景带来了全新的视角和解决方案，具有显著的工程应用潜力。

關(guān)鍵詞： 点云数据 SoftGroup实例分割六维位姿估计

中圖分類(lèi)號(hào)：TP391文獻(xiàn)標(biāo)識(shí)碼：ADOI:10-19358/j-issn-2097-1788-2024-05-006
引用格式：周劍.一種基于點(diǎn)云實(shí)例分割的六維位姿估計(jì)方法［J］.網(wǎng)絡(luò)安全與數(shù)據(jù)治理，2024，43（5）：42-45，60.

6D pose estimation based on point cloud instance segmentation

Zhou Jian

DEEPerceptron Tech

Abstract： This paper proposes a method based on the SoftGroup instance segmentation model and Principal Component Analysis (PCA) algorithm for estimating object poses. In the field of industrial automation, visual systems are often equipped on robots and robotic arms to estimate the position of target objects using 2D images. However, in complex scenarios such as stacking and occlusion, the recognition accuracy of 2D images tends to decrease. To accurately and efficiently obtain object positions, this paper fully leverages the high-resolution and high-precision advantages of 3D point cloud data. Firstly, RGB-D images captured by a depth camera are converted into point cloud images. Then, the SoftGroup model is employed to segment the target objects in the point cloud image, and finally, the PCA algorithm is used to obtain the six-dimensional pose of the target. Validation on a self-made dataset shows an average AP of 97.5% for the recognition of three types of objects. The recognition time for a single point cloud image is only 0.73 ms, demonstrating the efficiency and real-time capability of the proposed method. This approach provides a new perspective and solution for scenarios such as robot localization and autonomous grasping of robotic arms, with significant potential for practical engineering applications.

Key words : point cloud data; SoftGroup instance segmentation; 6D pose estimation

引言

近年，隨著激光掃描儀、相機(jī)、三維掃描儀等硬件設(shè)備的發(fā)展與普及，點(diǎn)云數(shù)據(jù)的獲取途徑變得更加多樣，數(shù)據(jù)獲取的難度不斷降低。相較于二維圖像，三維點(diǎn)云數(shù)據(jù)具備無(wú)可比擬的優(yōu)勢(shì)。其高分辨率、高精度、高緯度的特性賦予點(diǎn)云數(shù)據(jù)更為豐富的空間幾何信息，能夠直觀地表達(dá)物體的形狀特征。近年來(lái)，點(diǎn)云數(shù)據(jù)在工業(yè)測(cè)量、機(jī)械臂抓取、目標(biāo)檢測(cè)、機(jī)器人視覺(jué)等領(lǐng)域得到了廣泛應(yīng)用［1–3］。

在工業(yè)自動(dòng)化領(lǐng)域，通常需要先獲得物體的位姿信息再進(jìn)行后續(xù)抓取動(dòng)作。自動(dòng)抓取物體可分為結(jié)構(gòu)化場(chǎng)景和非結(jié)構(gòu)化場(chǎng)景。在結(jié)構(gòu)化工作場(chǎng)景中，機(jī)械臂抓取固定位置的物體，該模式需要進(jìn)行大量調(diào)試和示教工作，機(jī)械臂只能按照預(yù)設(shè)程序進(jìn)行工作，缺乏自主識(shí)別和決策能力，一旦目標(biāo)物體發(fā)生形變或位置偏移，可能導(dǎo)致抓取失?。辉诜墙Y(jié)構(gòu)化場(chǎng)景中，通常為機(jī)械臂配備視覺(jué)感知硬件和目標(biāo)檢測(cè)算法，以使機(jī)械臂能夠感知并理解相對(duì)復(fù)雜的抓取環(huán)境。然而，在實(shí)際復(fù)雜的抓取場(chǎng)景下（如散亂、堆疊、遮擋），常見(jiàn)的目標(biāo)檢測(cè)方法如點(diǎn)云配準(zhǔn)［4］、二維圖像實(shí)例分割［5］的精度有所下降，從而影響抓取效率［6］。

本文詳細(xì)內(nèi)容請(qǐng)下載：

http://www.ihrv.cn/resource/share/2000006014

作者信息：

周劍

（蘇州深淺優(yōu)視智能科技有限公司，江蘇蘇州215124）

Magazine.Subscription.jpg

原創(chuàng)聲明：此內(nèi)容為AET網(wǎng)站原創(chuàng)，未經(jīng)授權(quán)禁止轉(zhuǎn)載。

相關(guān)內(nèi)容