浙江农业学报 ›› 2022, Vol. 34 ›› Issue (11): 2522-2532.DOI: 10.3969/j.issn.1004-1524.2022.11.21

• 生物系统工程 • 上一篇    下一篇

基于YOLOV4模型的果园樱桃实时检测研究

周品志1,2(), 裴悦琨1,2,*(), 魏冉1,2, 张永飞1,2, 谷宇1,2   

  1. 1.大连大学 辽宁省北斗高精度位置服务技术工程实验室,辽宁 大连 116622
    2.大连大学 大连市环境感知与智能控制重点实验室,辽宁 大连 116622
  • 收稿日期:2021-05-07 出版日期:2022-11-25 发布日期:2022-11-29
  • 通讯作者: 裴悦琨
  • 作者简介:*裴悦琨,E-mail: peiyuekun@dlu.edu.cn
    周品志(1996—), 男,贵州毕节人,硕士研究生,研究方向为计算机图像处理。E-mail: 1978868229@qq.com
  • 基金资助:
    国家自然科学基金(61601076);大连市高层次人才创新计划(2019RQ070)

Real-time detection of orchard cherry based on YOLOV4 model

ZHOU Pinzhi1,2(), PEI Yuekun1,2,*(), WEI Ran1,2, ZHANG Yongfei1,2, GU Yu1,2   

  1. 1. Beidou High Precision Positioning Service Technology Engineering Laboratory of Liaoning Province, Dalian Unviersity, Dalian 116622, Liaoning, China
    2. Environment Sensing and Intelligent Control Key Laboratory of Dalian,Dalian University, Dalian 116622, Liaoning, China
  • Received:2021-05-07 Online:2022-11-25 Published:2022-11-29
  • Contact: PEI Yuekun

摘要:

为解决在自然环境下对樱桃不同生长时期的状态监测受环境影响存在目标识别困难、检测准确率低的问题,提出了一种基于CSPDarknet53改进的卷积神经网络樱桃分类检测模型。经典YOLOV4所使用的特征提取网络层数较深,能够提取更高级的抽象特征,但是对目标局部感知能力较弱,通过在CSPDarknet53网络结构上融合CBAM注意力机制,增强了目标局部特征感知能力,进一步提升目标检测精度,其特征提取和目标检测能力优于原算法,调整特征提取网络的特征层输出,将第三层输出变为第二层输出以增加小目标语义信息的获取,利用k-means算法优化先验框尺寸以适应樱桃目标大小,并进行了消融实验分析。结果表明,改进的YOLOV4樱桃检测模型模型的平均精度达到了92.31%,F1分数达到了87.3%,优于Faster RCNN、YOLOV3和原来的YOLOV4算法,检测速度为40.23幅·s-1,适用于自然环境下的樱桃监测,为实现果园水果生长状态自动监测提供了理论和技术基础。

关键词: 樱桃, 成熟度, 注意力机制

Abstract:

In order to solve the problem of difficulty in target recognition and low detection accuracy in the state monitoring of cherries in different growth periods under natural environment, this paper proposed an improved convolutional neural network cherry classification and detection model based on CSPDarknet53. The feature extraction network used in the classic YOLOV4 had a deeper number of layers and could extract more advanced abstract features, but had a weak local perception of the target. By integrating the CBAM attention mechanism on the CSPDarknet53 network structure, the perception of local features of the target was enhanced. Furthermore, the accuracy of target detection was improved, its feature extraction and target detection capabilities were better than the original algorithm, the feature layer output of the feature extraction network was adjusted, and the third layer output to the second layer output was changed to increase the acquisition of small target semantic information, k-means algorithm was used to optimize the size of the prior frame to adapt to the size of the cherry target, and ablation experiment analysis was conducted. The results showed that the improved YOLOV4 cherry detection model had an average accuracy of 92.31% and an F1 score of 87.3%, which was better than Faster RCNN, YOLOV3 and the original YOLOV4 algorithm. The detection speed was 40.23 frames·s-1, which was suitable for natural environments. It provided a theoretical and technical basis for realizing automatic monitoring of fruit growth status in orchards.

Key words: cherry, maturity, attention mechanism

中图分类号: