浙江农业学报 ›› 2026, Vol. 38 ›› Issue (3): 588-599.DOI: 10.3969/j.issn.1004-1524.20250029

• 生物系统工程 • 上一篇    下一篇

基于SOLOv2改进的苹果采摘机器人实例分割算法

张郭1(), 周庆辉2,*(), 何胜喜3   

  1. 1. 重庆科创职业学院 智能制造与机器人学院, 重庆 402160
    2. 重庆大学 机械传动国家重点实验室, 重庆 400044
    3. 重庆长安汽车股份有限公司, 重庆 400023
  • 收稿日期:2025-01-06 出版日期:2026-03-25 发布日期:2026-04-17
  • 作者简介:*周庆辉,E-mail:zhouqinghui1985666@163.com
    张郭,研究方向为自动化控制。E-mail:zhangguo1982@yeah.net
  • 通讯作者: 周庆辉
  • 基金资助:
    国家自然科学基金(52005057)

An improved instance segmentation algorithm for apple picking robots based on SOLOv2

ZHANG Guo1(), ZHOU Qinghui2,*(), HE Shengxi3   

  1. 1. College of Intelligent Manufacturing and Robotics, Chongqing College of Science and Creation, Chongqing 402160, China
    2. State Key Laboratory of Mechanical Transmission, Chongqing University, Chongqing 400044, China
    3. Chongqing Chang’an Automobile Co., Ltd., Chongqing 400023, China
  • Received:2025-01-06 Published:2026-03-25 Online:2026-04-17
  • Contact: ZHOU Qinghui

摘要:

为解决现有苹果实例分割算法中存在的目标尺度多变、易受背景噪声干扰影响分割精度,以及推理速度过慢的问题,提出一种基于SOLOv2改进的苹果实例分割算法AE-SOLOv2。首先,采用StarBlock(星形卷积)优化特征提取网络,提升推理速度;然后,设计一种增强型双向特征金字塔网络(E-BiFPN),以额外的跨层连接促进高层语义信息与低层几何细节信息的充分融合,解决因分割目标尺度不一导致的精度下降问题;此外,引入注意力特征融合模块,通过综合多层次与多分支的特征信息,并结合自适应分组策略及全局与局部特征的协同处理,显著增强算法抗背景噪声干扰的能力。结果表明,改进后的算法能够有效提高苹果的实例分割精确率,在验证集上交并比为0.5时的平均精确率均值(mAP50)达到91.2%,单张图像推理耗时12.3 ms,可满足苹果实例分割的实时性需求。此外,采摘实验表明,即便在计算资源受限的苹果采摘机器人场景下,AE-SOLOv2的各项性能仍表现出色,可为农业智能化发展提供一种新的解决思路。

关键词: SOLOv2网络, 实例分割, 采摘机器人, 注意力机制

Abstract:

To address the issues prevalent in existing apple instance segmentation algorithms, including variable object scales, susceptibility to background noise which degrades segmentation accuracy, and excessively slow inference speed, an improved SOLOv2-based apple instance segmentation algorithm, termed AE-SOLOv2, is proposed in this paper. First, the StarBlock structure is adopted to optimize the feature extraction network, thereby improving inference efficiency. Subsequently, an enhanced bidirectional feature pyramid network (E-BiFPN) is designed, where additional cross-layer connections facilitate adequate fusion of high-level semantic information and low-level geometric detail information, mitigating the accuracy degradation caused by inconsistent scales of segmentation targets. Furthermore, an attentive feature fusion module is introduced to integrate multi-level and multi-branch feature information. Combined with an adaptive grouping strategy and collaborative processing of global and local features, this module significantly enhances the anti-interference capability of the algorithm against background noise. Experimental results demonstrate that the improved algorithm effectively enhances the instance segmentation precision for apples. On the validation dataset, the mean average precision at an intersection-over-union threshold of 0.5 (mAP50) reaches 91.2%, and the inference time per image is 12.3 ms, which satisfies the real-time requirements for apple instance segmentation. In addition, picking experiments verify that AE-SOLOv2 exhibits superior overall performance even in apple-picking robot scenarios with constrained computational resources, providing a novel solution for the development of agricultural intelligence.

Key words: SOLOv2 network, instance segmentation, picking robot, attention mechanism

中图分类号: