基于Grounding DINO与YOLOv8伪标签监督的高效车牌识别方法 (Efficient License Plate Recognition via Pseudo-Labeled Supervision with Grounding DINO and YOLOv8)

from arxiv, 6 pages, 8 figures. Presented at 2025 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), August 31 - September 3, 2025, Istanbul, Turkey

Developing a highly accurate automatic license plate recognition system (ALPR) is challenging due to environmental factors such as lighting, rain, and dust. Additional difficulties include high vehicle speeds, varying camera angles, and low-quality or low-resolution images. ALPR is vital in traffic control, parking, vehicle tracking, toll collection, and law enforcement applications. This paper proposes a deep learning strategy using YOLOv8 for license plate detection and recognition tasks. This method seeks to enhance the performance of the model using datasets from Ontario, Quebec, California, and New York State. It achieved an impressive recall rate of 94% on the dataset from the Center for Pattern Recognition and Machine Intelligence (CENPARMI) and 91% on the UFPR-ALPR dataset. In addition, our method follows a semi-supervised learning framework, combining a small set of manually labeled data with pseudo-labels generated by Grounding DINO to train our detection model. Grounding DINO, a powerful vision-language model, automatically annotates many images with bounding boxes for license plates, thereby minimizing the reliance on labor-intensive manual labeling. By integrating human-verified and model-generated annotations, we can scale our dataset efficiently while maintaining label quality, which significantly enhances the training process and overall model performance. Furthermore, it reports character error rates for both datasets, providing additional insight into system performance.

翻译：由于光照、雨雾、灰尘等环境因素，开发高精度的自动车牌识别系统（ALPR）具有挑战性。其他困难还包括车辆高速行驶、摄像头角度多变以及图像质量或分辨率较低。ALPR在交通管控、停车管理、车辆追踪、收费系统及执法应用中至关重要。本文提出一种基于YOLOv8的深度学习策略，用于车牌检测与识别任务。该方法利用来自安大略省、魁北克省、加利福尼亚州和纽约州的数据集以提升模型性能。在模式识别与机器智能中心（CENPARMI）数据集上实现了94%的召回率，在UFPR-ALPR数据集上达到91%。此外，本方法采用半监督学习框架，将少量人工标注数据与Grounding DINO生成的伪标签相结合来训练检测模型。Grounding DINO作为一种强大的视觉-语言模型，能自动为大量图像生成车牌边界框标注，从而显著减少对人工标注的依赖。通过整合人工验证与模型生成的标注，我们能在保持标签质量的同时高效扩展数据集，这显著优化了训练过程并提升了整体模型性能。研究还报告了两个数据集的字符错误率，为系统性能评估提供了更深入的视角。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日