AITopics | display image

Collaborating Authors

display image

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

UnifiedOptimalTransportFrameworkforUniversal DomainAdaptation (SupplementaryMaterial)

Neural Information Processing SystemsFeb-11-2026, 16:31:07 GMT

Recall measures the fraction ofcommon samples that are retrievedascorrect common class, while specificity measures thefraction ofprivatesamples thatarenotretrieved. Fig. S1(b) shows the sensitivity ofγ, where γ is the rough boundary for splitting positive and negative in adaptive filling. For the cosine similarity of two ℓ2-normalized features, the similarity value is limited from 1to1, where higher value indicates higher similarity. Suchself-supervisedlearning methods encourage the consistency between two augmentations of one image. The display images for source prototypes are chosen by finding the nearest source instance of the prototype.

artificial intelligence, prototype, supplementarymaterial, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.68)

Add feedback

CAD2DMD-SET: Synthetic Generation Tool of Digital Measurement Device CAD Model Datasets for fine-tuning Large Vision-Language Models

Valente, João, Dehban, Atabak, Ventura, Rodrigo

arXiv.org Artificial IntelligenceSep-1-2025

Recent advancements in Large Vision-Language Models (LVLMs) have demonstrated impressive capabilities across various multimodal tasks. They continue, however, to struggle with trivial scenarios such as reading values from Digital Measurement Devices (DMDs), particularly in real-world conditions involving clutter, occlusions, extreme viewpoints, and motion blur; common in head-mounted cameras and Augmented Reality (AR) applications. Motivated by these limitations, this work introduces CAD2DMD-SET, a synthetic data generation tool designed to support visual question answering (VQA) tasks involving DMDs. By leveraging 3D CAD models, advanced rendering, and high-fidelity image composition, our tool produces diverse, VQA-labelled synthetic DMD datasets suitable for fine-tuning LVLMs. Additionally, we present DMDBench, a curated validation set of 1,000 annotated real-world images designed to evaluate model performance under practical constraints. Benchmarking three state-of-the-art LVLMs using Average Normalised Levenshtein Similarity (ANLS) and further fine-tuning LoRA's of these models with CAD2DMD-SET's generated dataset yielded substantial improvements, with InternVL showcasing a score increase of 200% without degrading on other tasks. This demonstrates that the CAD2DMD-SET training dataset substantially improves the robustness and performance of LVLMs when operating under the previously stated challenging conditions. The CAD2DMD-SET tool is expected to be released as open-source once the final version of this manuscript is prepared, allowing the community to add different measurement devices and generate their own datasets.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.21732

Country: Europe > Portugal (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.95)
Semiconductors & Electronics (0.60)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.34)

Add feedback

MToFNet: Object Anti-Spoofing with Mobile Time-of-Flight Data

Jeong, Yonghyun, Kim, Doyeon, Lee, Jaehyeon, Hong, Minki, Hwang, Solbi, Choi, Jongwon

arXiv.org Artificial IntelligenceOct-6-2021

In online markets, sellers can maliciously recapture others' images on display screens to utilize as spoof images, which can be challenging to distinguish in human eyes. To prevent such harm, we propose an anti-spoofing method using the paired rgb images and depth maps provided by the mobile camera with a Time-of-Fight sensor. When images are recaptured on display screens, various patterns differing by the screens as known as the moir\'e patterns can be also captured in spoof images. These patterns lead the anti-spoofing model to be overfitted and unable to detect spoof images recaptured on unseen media. To avoid the issue, we build a novel representation model composed of two embedding models, which can be trained without considering the recaptured images. Also, we newly introduce mToF dataset, the largest and most diverse object anti-spoofing dataset, and the first to utilize ToF data. Experimental results confirm that our model achieves robust generalization even across unseen domains.

dataset, display image, tof map, (16 more...)

arXiv.org Artificial Intelligence

2110.04066

Country:

Asia > South Korea > Seoul > Seoul (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(2 more...)

Add feedback

Facebook wants to send 'emotional' robots to explore and scan faces to 'help users make friends'

Daily Mail - Science & techMay-30-2019, 14:04:30 GMT

Facebook is considering building'emotionally sensitive' robots that can explore the world, identify objects and people and enable users to make friends remotely. On-board sensors would allow the robots to spot people to engage with, judge their emotional state and listen to what they are saying, a patent filing revealed. At the same time, the robot could display images, video and speak with people -- potentially letting users meet people and make new friends remotely. However, it is not known whether Facebook will follow through on the patent filing and make the rough robot designs a reality. Facebook is considering building'emotionally sensitive' robots (pictured, in this rough sketch from the patent that the social media firm filed) that can explore the world, identify objects and people and enable users to make friends remotely Cameras to detect faces and interpret emotional states.

artificial intelligence, facebook, social media, (16 more...)

Daily Mail - Science & tech

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback