AITopics | license plate

Collaborating Authors

license plate

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OV-PARTS: Towards Open-Vocabulary Part Segmentation (Supplementary Material) Coauthor Affiliation Address email

Neural Information Processing SystemsApr-30-2026, 00:36:49 GMT

The supplementary material is organized as follows:1 Implementation Details.(Sec. Except for the Object Mask Prompt and Compositional Prompt Tuning designs,7 we follow the default architecture in the original ZSseg paper. The number of part queries is set to 50.8 All the two-stage baselines are trained with AdamW optimizer with the initial learning rate of 1e-49 and weight decay of 1e-4. A poly learning rate policy with a power of 0.9is adopted.

artificial intelligence, machine learning, pascal-part-116, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

OV-PARTS: Towards Open-Vocabulary Part Segmentation (Supplementary Material)

Neural Information Processing SystemsFeb-17-2026, 12:48:09 GMT

The number of part queries is set to 50. SGD optimizer with the initial learning rate of 2e-2 and weight decay of 5e-4 is used. We sample 128 training samples for each object part class. The initial value of the learnable fusion weight is 0.5 . The total batch size is 8, and the training iterations amount to 40k.

artificial intelligence, machine learning, pascal-part-116, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

3D-Aware Visual Question Answering about Parts, Poses and Occlusions

Neural Information Processing SystemsFeb-16-2026, 17:58:48 GMT

Visual question answering ( VQA) is a challenging task that requires an in-depth understanding of vision and language, as well as multi-modal reasoning.

machine learning, natural language, question answering, (18 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Asia > Middle East > Israel (0.04)

Industry: Automobiles & Trucks (0.97)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.72)
(2 more...)

Add feedback

Flock Uses Overseas Gig Workers to Build Its Surveillance AI

WIREDDec-1-2025, 14:00:00 GMT

An accidental leak revealed that Flock, which has cameras in thousands of US communities, is using workers in the Philippines to review and classify footage. Flock, the automatic license plate reader and AI-powered camera company, uses overseas workers from Upwork to train its machine learning algorithms, with training material telling workers how to review and categorize footage including images people and vehicles in the United States, according to material reviewed by 404 Media that was accidentally exposed by the company. The findings bring up questions about who exactly has access to footage collected by Flock surveillance cameras and where people reviewing the footage may be based. Flock has become a pervasive technology in the US, with its cameras present in thousands of communities that cops use every day to investigate things like carjackings. Local police have also performed numerous lookups for ICE in the system.

artificial intelligence, footage, machine learning, (15 more...)

WIRED

Country:

Asia > Philippines (0.25)
North America > United States > New York (0.05)
North America > United States > California (0.05)
(6 more...)

Genre: Instructional Material (0.36)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance

Dimidov, Valeriu, Hawlader, Faisal, Jafarnejad, Sasan, Frank, Raphaël

arXiv.org Artificial IntelligenceNov-10-2025

Economic constraints, limited availability of datasets for reproducibility and shortages of specialized expertise have long been recognized as key challenges to the adoption and advancement of predictive maintenance (PdM) in the automotive sector. Recent progress in large language models (LLMs) presents an opportunity to overcome these barriers and speed up the transition of PdM from research to industrial practice. Under these conditions, we explore the potential of LLM-based agents to support PdM cleaning pipelines. Specifically, we focus on maintenance logs, a critical data source for training well-performing machine learning (ML) models, but one often affected by errors such as typos, missing fields, near-duplicate entries, and incorrect dates. We evaluate LLM agents on cleaning tasks involving six distinct types of noise. Our findings show that LLMs are effective at handling generic cleaning tasks and offer a promising foundation for future industrial applications. While domain-specific errors remain challenging, these results highlight the potential for further improvements through specialized training and enhanced agentic capabilities.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.05311

Genre: Research Report > New Finding (1.00)

Industry:

Automobiles & Trucks (1.00)
Transportation > Ground > Road (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Efficient License Plate Recognition via Pseudo-Labeled Supervision with Grounding DINO and YOLOv8

Vargoorani, Zahra Ebrahimi, Ghoreyshi, Amir Mohammad, Suen, Ching Yee

arXiv.org Artificial IntelligenceOct-30-2025

Developing a highly accurate automatic license plate recognition system (ALPR) is challenging due to environmental factors such as lighting, rain, and dust. Additional difficulties include high vehicle speeds, varying camera angles, and low-quality or low-resolution images. ALPR is vital in traffic control, parking, vehicle tracking, toll collection, and law enforcement applications. This paper proposes a deep learning strategy using YOLOv8 for license plate detection and recognition tasks. This method seeks to enhance the performance of the model using datasets from Ontario, Quebec, California, and New York State. It achieved an impressive recall rate of 94% on the dataset from the Center for Pattern Recognition and Machine Intelligence (CENPARMI) and 91% on the UFPR-ALPR dataset. In addition, our method follows a semi-supervised learning framework, combining a small set of manually labeled data with pseudo-labels generated by Grounding DINO to train our detection model. Grounding DINO, a powerful vision-language model, automatically annotates many images with bounding boxes for license plates, thereby minimizing the reliance on labor-intensive manual labeling. By integrating human-verified and model-generated annotations, we can scale our dataset efficiently while maintaining label quality, which significantly enhances the training process and overall model performance. Furthermore, it reports character error rates for both datasets, providing additional insight into system performance.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/MLSP62443.2025.11204315

2510.25032

Country:

North America > Canada > Quebec (0.34)
North America > United States > California (0.24)
North America > Canada > Ontario (0.24)
North America > United States > New York (0.24)

Genre: Research Report (1.00)

Industry: Transportation (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Inter-Class Relational Loss for Small Object Detection: A Case Study on License Plates

Ning, Dian, Han, Dong Seog

arXiv.org Artificial IntelligenceAug-21-2025

In one-stage multi-object detection tasks, various intersection over union (IoU)-based solutions aim at smooth and stable convergence near the targets during training. However, IoU-based losses fail to correctly update the gradient of small objects due to an extremely flat gradient. During the update of multiple objects, the learning of small objects' gradients suffers more because of insufficient gradient updates. Therefore, we propose an inter-class relational loss to efficiently update the gradient of small objects while not sacrificing the learning efficiency of other objects based on the simple fact that an object has a spatial relationship to another object (e.g., a car plate is attached to a car in a similar position). When the predicted car plate's bounding box is not within its car, a loss punishment is added to guide the learning, which is inversely proportional to the overlapped area of the car's and predicted car plate's bounding box. By leveraging the spatial relationship at the inter-class level, the loss guides small object predictions using larger objects and enhances latent information in deeper feature maps. In this paper, we present twofold contributions using license plate detection as a case study: (1) a new small vehicle multi-license plate dataset (SVMLP), featuring diverse real-world scenarios with high-quality annotations; and (2) a novel inter-class relational loss function designed to promote effective detection performance. We highlight the proposed ICR loss penalty can be easily added to existing IoU-based losses and enhance the performance. These contributions improve the standard mean Average Precision (mAP) metric, achieving gains of 10.3% and 1.6% in mAP$^{\text{test}}_{50}$ for YOLOv12-T and UAV-DETR, respectively, without any additional hyperparameter tuning. Code and dataset will be available soon.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2508.14343

Country:

Asia > China (0.46)
South America > Brazil (0.28)
Europe > Switzerland (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)

Add feedback

Video-based Vehicle Surveillance in the Wild: License Plate, Make, and Model Recognition with Self Reflective Vision-Language Models

Parsa, Pouya, Li, Keya, Kockelman, Kara M., Choi, Seongjin

arXiv.org Artificial IntelligenceAug-5-2025

Automatic license plate recognition (ALPR) and vehicle make and model recognition underpin intelligent transportation systems, supporting law enforcement, toll collection, and post-incident investigation. Applying these methods to videos captured by handheld smartphones or non-static vehicle-mounted cameras presents unique challenges compared to fixed installations, including frequent camera motion, varying viewpoints, occlusions, and unknown road geometry. Traditional ALPR solutions, dependent on specialized hardware and handcrafted OCR pipelines, often degrade under these conditions. Recent advances in large vision-language models (VLMs) enable direct recognition of textual and semantic attributes from arbitrary imagery. This study evaluates the potential of VLMs for ALPR and makes and models recognition using monocular videos captured with handheld smartphones and non-static mounted cameras. The proposed license plate recognition pipeline filters to sharp frames, then sends a multimodal prompt to a VLM using several prompt strategies. Make and model recognition pipeline runs the same VLM with a revised prompt and an optional self-reflection module. In the self-reflection module, the model contrasts the query image with a reference from a 134-class dataset, correcting mismatches. Experiments on a smartphone dataset collected on the campus of the University of Texas at Austin, achieve top-1 accuracies of 91.67% for ALPR and 66.67% for make and model recognition. On the public UFPR-ALPR dataset, the approach attains 83.05% and 61.07%, respectively. The self-reflection module further improves results by 5.72% on average for make and model recognition. These findings demonstrate that VLMs provide a cost-effective solution for scalable, in-motion traffic video analysis.

large language model, machine learning, recognition, (19 more...)

arXiv.org Artificial Intelligence

2508.01387

Country: North America > United States > Texas > Travis County > Austin (0.34)

Genre: Research Report > New Finding (1.00)

Industry: