AITopics | object identification

Collaborating Authors

object identification

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

RoboEye: Enhancing 2D Robotic Object Identification with Selective 3D Geometric Keypoint Matching

Zhang, Xingwu, Li, Guanxuan, Zhang, Zhuocheng, Long, Zijun

arXiv.org Artificial IntelligenceSep-19-2025

The rapidly growing number of product categories in large-scale e-commerce makes accurate object identification for automated packing in warehouses substantially more difficult. As the catalog grows, intra-class variability and a long tail of rare or visually similar items increase, and when combined with diverse packaging, cluttered containers, frequent occlusion, and large viewpoint changes-these factors amplify discrepancies between query and reference images, causing sharp performance drops for methods that rely solely on 2D appearance features. Thus, we propose RoboEye, a two-stage identification framework that dynamically augments 2D semantic features with domain-adapted 3D reasoning and lightweight adapters to bridge training deployment gaps. In the first stage, we train a large vision model to extract 2D features for generating candidate rankings. A lightweight 3D-feature-awareness module then estimates 3D feature quality and predicts whether 3D re-ranking is necessary, preventing performance degradation and avoiding unnecessary computation. When invoked, the second stage uses our robot 3D retrieval transformer, comprising a 3D feature extractor that produces geometry-aware dense features and a keypoint-based matcher that computes keypoint-correspondence confidences between query and reference images instead of conventional cosine-similarity scoring. Experiments show that RoboEye improves Recall@1 by 7.1% over the prior state of the art (RoboLLM). Moreover, RoboEye operates using only RGB images, avoiding reliance on explicit 3D inputs and reducing deployment costs. The code used in this paper is publicly available at: https://github.com/longkukuhi/RoboEye.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2509.14966

Genre: Research Report (0.82)

Industry: Information Technology (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Beyond Object Identification: A Giant-Leap into Pattern Discovery in Imagery Data

#artificialintelligenceSep-3-2022, 05:33:14 GMT

A critical question that arises after identifying the objects (or class labels) in an imagery database is: "How are the various objects discovered in an imagery database correlated with one another?" This article tries to answer this question by providing a generic framework that can facilitate the readers to discover hidden correlations between objects in the imagery database. The portion of this article is drawn from our work published in IEEE BIGDATA 2021 [1].) The framework to discover the correlation between the objects in an imagery database is shown in Figure 1. Demonstration: In this demo, we first pass the image data into a trained model (e.g., resnet50) and extract objects and their scores.

class label, database, transactional database, (10 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.69)
Information Technology > Artificial Intelligence > Vision (0.51)

Add feedback

30X Optical Zoom 1080P with Object Identification and Tracking Gimbal Camera for Drone UAV

#artificialintelligenceDec-27-2021, 09:30:08 GMT

SEEKER-30 AI-TIR supports dual sensors object identification and tracking based on deep learning algorithm and ECO tracking algorithm. It has an AI object identification and tracking module, with which SEEKER-30 AI-TIR can realize car, human automatic recognition and tracking by choosing the corresponding tracking mode. SEEKER-30 AI-TIR can be controlled via sbus, serial port. Functions like target tracking or pseudo-color pattern switching can be realized via sbus control. SEEKER-30 AI-TIR supports Max.128G storage.

drone uav, identification and tracking gimbal camera, object identification, (1 more...)

#artificialintelligence

Industry: Media > Photography (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

Add feedback