TY - JOUR PY - 2023// TI - YOLOv8 with bi-level routing attention for road scene object detection JO - Journal of graphics A1 - Chen-hao, W. E. I. A1 - Rui, Yang A1 - Zhen-bing, L. I. U. A1 - Ru-shi, L. a. N. A1 - Xi-yan, S. U. N. A1 - Xiao-nan, L. U. O. SP - 1104 EP - 1111 VL - 44 IS - 6 N2 - With the continuous increase of motor vehicles, the road traffic environment has become increasingly complex, particularly due to changes in light conditions and complex backgrounds that can interfere with the accuracy and precision of target detection algorithms. Meanwhile, the diverse shapes of targets in road scenes can pose challenges to the detection task. In response to these challenges, a method named YOLOv8n_T was proposed. Building on the YOLOv8 skeleton network, it incorporated a D_C2f block utilizing deformable convolution to enhance feature learning for targets under complex backgrounds, making it more adaptable to the diverse and complex scenarios of road targets. Furthermore, the model incorporated a dual routing attention module to query adaptively and remove irrelevant regions, retaining only the most relevant regions. For small targets such as pedestrians and traffic lights on the road, a small target detection layer was added. Experimental results demonstrated that the proposed YOLOv8n_T could significantly enhance the precision of target detection in road scenarios, with an average precision increase of 6.8 percentage points compared to the original YOLOv8n and 11.2 percentage points compared to YOLOv5n on the BDD100K dataset. Key words: deformable convolution, road scene, object detection, YOLO, attention mechanism
Language: en
LA - en SN - 2095-302X UR - http://dx.doi.org/10.11996/JG.j.2095-302X.2023061104 ID - ref1 ER -