AITopics | cad

Collaborating Authors

cad

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings

Neural Information Processing SystemsJun-12-2026, 00:46:38 GMT

We study the task of panoptic symbol spotting, which involves identifying both individual instances of countable \textit{things} and the semantic regions of uncountable \textit{stuff} in computer-aided design (CAD) drawings composed of vector graphical primitives. Existing methods typically rely on image rasterization, graph construction, or point-based representation, but these approaches often suffer from high computational costs, limited generality, and loss of geometric structural information. In this paper, we propose \textit{VecFormer}, a novel method that addresses these challenges through \textit{line-based representation} of primitives. This design preserves the geometric continuity of the original primitive, enabling more accurate shape representation while maintaining a computation-friendly structure, making it well-suited for vector graphic understanding tasks. To further enhance prediction reliability, we introduce a \textit{Branch Fusion Refinement} module that effectively integrates instance and semantic predictions, resolving their inconsistencies for more coherent panoptic outputs. Extensive experiments demonstrate that our method establishes a new state-of-the-art, achieving 91.1 PQ, with Stuff-PQ improved by 9.6 and 21.2 points over the second-best results under settings with and without prior information, respectively--highlighting the strong potential of line-based representation as a foundation for vector graphic understanding.

artificial intelligence, line-based representation, neurips proceedings point, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.39)

Add feedback

Suggesting Variable Order for Cylindrical Algebraic Decomposition via Reinforcement Learning

Neural Information Processing SystemsFeb-17-2026, 21:22:11 GMT

The other is GRL-SVO(NUP), a fast heuristic providing a total order directly.

machine learning, polynomial, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Alaska > Anchorage Municipality > Anchorage (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.82)

Add feedback

Computational Budget Should Be Considered in Data Selection

Wan, Weilin, Zhang, Weizhong, Jin, Cheng

arXiv.org Artificial IntelligenceNov-4-2025

Data selection improves computational efficiency by choosing informative subsets of training samples. However, existing methods ignore the compute budget, treating data selection and importance evaluation independently of compute budget constraints. Yet empirical studies show no algorithm can consistently outperform others (or even random selection) across varying budgets. We therefore argue that compute budget must be integral to data-selection strategies, since different budgets impose distinct requirements on data quantity, quality, and distribution for effective training. To this end, we propose a novel Computational budget-Aware Data Selection (CADS) method and naturally formulate it into a bilevel optimization framework, where the inner loop trains the model within the constraints of the computational budget on some selected subset of training data, while the outer loop optimizes data selection based on model evaluation. Our technical contributions lie in addressing two main challenges in solving this bilevel optimization problem: the expensive Hessian matrix estimation for outer-loop gradients and the computational burden of achieving inner-loop optimality during iterations. To solve the first issue, we propose a probabilistic reparameterization strategy and compute the gradient using a Hessian-free policy gradient estimator. To address the second challenge, we transform the inner optimization problem into a penalty term in the outer objective, further discovering that we only need to estimate the minimum of a one-dimensional loss to calculate the gradient, significantly improving efficiency. Extensive experiments show that our method achieves performance gains of up to 14.42% over baselines in vision and language benchmarks.

machine learning, natural language, selection, (20 more...)

arXiv.org Artificial Intelligence

2510.16806

Country: Asia (0.28)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Text-Enhanced Panoptic Symbol Spotting in CAD Drawings

Liu, Xianlin, Gong, Yan, Li, Bohao, Huang, Jiajing, Du, Bowen, Ye, Junchen, Xu, Liyan

arXiv.org Artificial IntelligenceOct-14-2025

With the widespread adoption of Computer-Aided Design(CAD) drawings in engineering, architecture, and industrial design, the ability to accurately interpret and analyze these drawings has become increasingly critical. Among various subtasks, panoptic symbol spotting plays a vital role in enabling downstream applications such as CAD automation and design retrieval. Existing methods primarily focus on geometric primitives within the CAD drawings to address this task, but they face following major problems: they usually overlook the rich textual annotations present in CAD drawings and they lack explicit modeling of relationships among primitives, resulting in incomprehensive understanding of the holistic drawings. To fill this gap, we propose a panoptic symbol spotting framework that incorporates textual annotations. The framework constructs unified representations by jointly modeling geometric and textual primitives. Then, using visual features extract by pretrained CNN as the initial representations, a Transformer-based backbone is employed, enhanced with a type-aware attention mechanism to explicitly model the different types of spatial dependencies between various primitives. Extensive experiments on the real-world dataset demonstrate that the proposed method outperforms existing approaches on symbol spotting tasks involving textual annotations, and exhibits superior robustness when applied to complex CAD drawings.

annotation, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.11091

Country: Asia > China (0.28)

Genre: Research Report (0.50)

Industry: Information Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback

Suggesting Variable Order for Cylindrical Algebraic Decomposition via Reinforcement Learning

Neural Information Processing SystemsOct-9-2025, 11:21:25 GMT

The other is GRL-SVO(NUP), a fast heuristic providing a total order directly.

machine learning, polynomial, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Alaska > Anchorage Municipality > Anchorage (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.82)

Add feedback

CoCoA: Confidence and Context-Aware Adaptive Decoding for Resolving Knowledge Conflicts in Large Language Models

Khandelwal, Anant, Gupta, Manish, Agrawal, Puneet

arXiv.org Artificial IntelligenceAug-28-2025

Faithful generation in large language models (LLMs) is challenged by knowledge conflicts between parametric memory and external context. Existing contrastive decoding methods tuned specifically to handle conflict often lack adaptability and can degrade performance in low conflict settings. We introduce CoCoA (Confidence- and Context-Aware Adaptive Decoding), a novel token-level algorithm for principled conflict resolution and enhanced faithfulness. CoCoA resolves conflict by utilizing confidence-aware measures (entropy gap and contextual peakedness) and the generalized divergence between the parametric and contextual distributions. Crucially, CoCoA maintains strong performance even in low conflict settings. Extensive experiments across multiple LLMs on diverse Question Answering (QA), Summarization, and Long-Form Question Answering (LFQA) benchmarks demonstrate CoCoA's state-of-the-art performance over strong baselines like AdaCAD. It yields significant gains in QA accuracy, up to 9.2 points on average compared to the strong baseline AdaCAD, and improves factuality in summarization and LFQA by up to 2.5 points on average across key benchmarks. Additionally, it demonstrates superior sensitivity to conflict variations. CoCoA enables more informed, context-aware, and ultimately more faithful token generation.

computational linguistic, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2508.1767

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)
North America > Mexico > Mexico City (0.14)

Genre: Research Report > New Finding (0.92)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)
Government > Regional Government > North America Government > United States Government (0.46)
Government > Military (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Facial Foundational Model Advances Early Warning of Coronary Artery Disease from Live Videos with DigitalShadow

Zhou, Juexiao, Han, Zhongyi, Xin, Mankun, He, Xingwei, Wang, Guotao, Song, Jiaoyan, Luo, Gongning, He, Wenjia, Li, Xintong, Chu, Yuetan, Chen, Juanwen, Wang, Bo, Wu, Xia, Duan, Wenwen, Guo, Zhixia, Bai, Liyan, Pan, Yilin, Bi, Xuefei, Liu, Lu, Feng, Long, He, Xiaonan, Gao, Xin

arXiv.org Artificial IntelligenceJun-10-2025

Abstract--Global population aging presents increasing challenges to healthcare systems, with coronary artery disease (CAD) responsible for approximately 17.8 million deaths annually, making it a leading cause of global mortality . As CAD is largely preventable, early detection and proactive management are essential. In this work, we introduce DigitalShadow, an advanced early warning system for CAD, powered by a fine-tuned facial foundation model. The system is pre-trained on 21 million facial images and subsequently fine-tuned into LiveCAD, a specialized CAD risk assessment model trained on 7,004 facial images from 1,751 subjects across four hospitals in China. DigitalShadow functions passively and contactlessly, extracting facial features from live video streams without requiring active user engagement. Integrated with a personalized database, it generates natural language risk reports and individualized health recommendations. With privacy as a core design principle, DigitalShadow supports local deployment to ensure secure handling of user data. The world's population is rapidly ageing [1], with significant implications for the prevalence of chronic diseases such as Coronary Artery Disease (CAD) [2], affecting not only individuals but also families and societies at large [3]. The number of older people is increasing at an unprecedented rate, projected to grow from approximately 761 million in 2021 to 1.6 billion by 2050, which would represent nearly 16% of the global population, according to the UN's W orld Social Report 2023 [4]. The aging population presents numerous challenges, including increased pressure on healthcare systems, pension schemes, and long-term care facilities, alongside potential economic consequences, which together fuel growing demand for healthcare services [5], [6]. With advancing age, people become more vulnerable to various critical diseases [7], such as CAD [8], stroke [9], cancer [10], and Parkinson's disease (PD) [11], [12], [13], leading to considerable morbidity and mortality [14].

digitalshadow, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.06283

Country: Asia > China (0.90)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (0.48)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
(3 more...)

Add feedback

M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark

Zhu, Morui, Zhu, Yongqi, Zhu, Yihao, Chen, Qi, Qu, Deyuan, Fu, Song, Yang, Qing

arXiv.org Artificial IntelligenceMay-13-2025

We introduce M$^3$CAD, a novel benchmark designed to advance research in generic cooperative autonomous driving. M$^3$CAD comprises 204 sequences with 30k frames, spanning a diverse range of cooperative driving scenarios. Each sequence includes multiple vehicles and sensing modalities, e.g., LiDAR point clouds, RGB images, and GPS/IMU, supporting a variety of autonomous driving tasks, including object detection and tracking, mapping, motion forecasting, occupancy prediction, and path planning. This rich multimodal setup enables M$^3$CAD to support both single-vehicle and multi-vehicle autonomous driving research, significantly broadening the scope of research in the field. To our knowledge, M$^3$CAD is the most comprehensive benchmark specifically tailored for cooperative multi-task autonomous driving research. We evaluate the state-of-the-art end-to-end solution on M$^3$CAD to establish baseline performance. To foster cooperative autonomous driving research, we also propose E2EC, a simple yet effective framework for cooperative driving solution that leverages inter-vehicle shared information for improved path planning. We release M$^3$CAD, along with our baseline models and evaluation results, to support the development of robust cooperative autonomous driving systems. All resources will be made publicly available on https://github.com/zhumorui/M3CAD

artificial intelligence, ego vehicle, vehicle, (16 more...)

arXiv.org Artificial Intelligence

2505.06746

Genre: Research Report (0.82)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

mrCAD: Multimodal Refinement of Computer-aided Designs

McCarthy, William P., Vaduguru, Saujas, Willis, Karl D. D., Matejka, Justin, Fan, Judith E., Fried, Daniel, Pu, Yewen

arXiv.org Artificial IntelligenceApr-30-2025

A key feature of human collaboration is the ability to iteratively refine the concepts we have communicated. In contrast, while generative AI excels at the \textit{generation} of content, it often struggles to make specific language-guided \textit{modifications} of its prior outputs. To bridge the gap between how humans and machines perform edits, we present mrCAD, a dataset of multimodal instructions in a communication game. In each game, players created computer aided designs (CADs) and refined them over several rounds to match specific target designs. Only one player, the Designer, could see the target, and they must instruct the other player, the Maker, using text, drawing, or a combination of modalities. mrCAD consists of 6,082 communication games, 15,163 instruction-execution rounds, played between 1,092 pairs of human players. We analyze the dataset and find that generation and refinement instructions differ in their composition of drawing and text. Using the mrCAD task as a benchmark, we find that state-of-the-art VLMs are better at following generation instructions than refinement instructions. These results lay a foundation for analyzing and modeling a multimodal language of refinement that is not represented in previous datasets.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2504.20294

Genre: Research Report > Experimental Study (0.47)

Industry:

Leisure & Entertainment > Games (0.86)
Information Technology > Software (0.61)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design

Ocker, Felix, Menzel, Stefan, Sadik, Ahmed, Rios, Thiago

arXiv.org Artificial IntelligenceMar-6-2025

In modern product development, Computer Aided Design and Engineering (CAD/E) plays a key role to turn innovative ideas and visions into tangible and manufacturable designs. Digital 2D and 3D geometry representations of objects on different levels of granularity are required in various intermediate development steps, for example aesthetic discussions, design quality evaluations based on simulation tools, and design feasibility checks. For these steps, development teams include various roles such as requirement engineers, style designers, Computer-Aided Design (CAD) experts, simulation domain experts, and quality assurance teams who create a product cooperatively. Stakeholders in these roles utilize software tools to implement digital representations of products, also referred to as digital twins. This process receives an increasing amount of support in the form of Artificial Intelligence (AI) methods. For example, data science methods provide efficient ways to improve the problem understanding, e.g., by calculating design sensitivities towards a certain performance aspect [Gräning and Sendhoff, 2014], or displaying the distribution of design variations in the solution space using clustering [Lanfermann et al., 2020].

agent, specification, vlm, (15 more...)

arXiv.org Artificial Intelligence

2503.04417

Country:

North America > Canada > Manitoba > Westman Region > Brandon (0.04)
Europe > Germany (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Software (0.55)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback