AITopics

Country: Asia (0.28)

Genre: Research Report (0.67)

Industry: Health & Medicine > Health Care Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Neural Information Processing SystemsFeb-11-2026, 13:15:43 GMT

CASA: Category-agnostic Skeletal Animal Reconstruction Y uefan Wu1 Zeyuan Chen

Recovering the skeletal shape of an animal from a monocular video is a longstanding challenge.

artificial intelligence, machine learning, reconstruction, (19 more...)

Country:

Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.05)
North America > United States > Illinois > Champaign County > Urbana (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Health Care Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

The New YorkerDec-25-2025, 11:00:00 GMT

What Kind of New World Is Being Born?

What Kind of New World Is Being Born? According to the Gospel of Luke, the Virgin Mary first learns that she'll soon give birth to Christ when she gets an unsolicited visit from an angel. Nice messenger service if you can get it. But before trusty Gabriel can dispense the good news upon which Christmas depends he has to calm the girl down. "Fear not," he says, and, in a way, this sombre reassurance is the Yuletide message in drastic miniature.

album, angelo, current cinema, (12 more...)

The New Yorker

Country:

North America > United States > New York (0.05)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.05)
South America > Peru (0.04)
(4 more...)

Genre: Personal (0.47)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Media > Music (0.94)

Technology: Information Technology > Artificial Intelligence (0.95)

Neural Information Processing SystemsDec-25-2025, 02:10:30 GMT

CASA: Category-agnostic Skeletal Animal Reconstruction

Recovering a skeletal shape from a monocular video is a longstanding challenge. Prevailing nonrigid animal reconstruction methods often adopt a control-point driven animation model and optimize bone transforms individually without considering skeletal topology, yielding unsatisfactory shape and articulation. In contrast, humans can easily infer the articulation structure of an unknown character by associating it with a seen articulated object in their memory. Inspired by this fact, we present CASA, a novel category-agnostic articulated animal reconstruction method. Our method consists of two components, a video-to-shape retrieval process and a neural inverse graphics framework. During inference, CASA first finds a matched articulated shape from a 3D character assets bank so that the input video scores highly with the rendered image, according to a pretrained image-language model. It then integrates the retrieved character into an inverse graphics framework and jointly infers the shape deformation, skeleton structure, and skinning weights through optimization.

casa, category-agnostic skeletal animal reconstruction, name change, (4 more...)

Technology: Information Technology > Artificial Intelligence (0.77)

Lee, Minhyuk, Yoon, HyeKyung, Kang, MyungJoo

CASA: CNN Autoencoder-based Score Attention for Efficient Multivariate Long-term Time-series Forecasting

arXiv.org Artificial IntelligenceMay-6-2025

Multivariate long-term time series forecasting is critical for applications such as weather prediction, and traffic analysis. In addition, the implementation of Transformer variants has improved prediction accuracy. Following these variants, different input data process approaches also enhanced the field, such as tokenization techniques including point-wise, channel-wise, and patch-wise tokenization. However, previous studies still have limitations in time complexity, computational resources, and cross-dimensional interactions. To address these limitations, we introduce a novel CNN Autoencoder-based Score Attention mechanism (CASA), which can be introduced in diverse Transformers model-agnosticically by reducing memory and leading to improvement in model performance. Experiments on eight real-world datasets validate that CASA decreases computational resources by up to 77.7%, accelerates inference by 44.0%, and achieves state-of-the-art performance, ranking first in 87.5% of evaluated metrics.

artificial intelligence, machine learning, natural language, (17 more...)

2505.02011

Genre: Research Report (0.82)

Industry:

Energy > Renewable > Solar (0.46)
Energy > Power Industry (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsJan-18-2025, 16:08:45 GMT

CASA: Category-agnostic Skeletal Animal Reconstruction

casa, category-agnostic skeletal animal reconstruction, inverse graphic framework, (2 more...)

Technology: Information Technology > Artificial Intelligence (0.62)

Daniel, Rui, Verdelho, M. Rita, Barata, Catarina, Santiago, Carlos

Continual Deep Active Learning for Medical Imaging: Replay-Base Architecture for Context Adaptation

arXiv.org Artificial IntelligenceJan-14-2025

Deep Learning for medical imaging faces challenges in adapting and generalizing to new contexts. Additionally, it often lacks sufficient labeled data for specific tasks requiring significant annotation effort. Continual Learning (CL) tackles adaptability and generalizability by enabling lifelong learning from a data stream while mitigating forgetting of previously learned knowledge. Active Learning (AL) reduces the number of required annotations for effective training. This work explores both approaches (CAL) to develop a novel framework for robust medical image analysis. Based on the automatic recognition of shifts in image characteristics, Replay-Base Architecture for Context Adaptation (RBACA) employs a CL rehearsal method to continually learn from diverse contexts, and an AL component to select the most informative instances for annotation. A novel approach to evaluate CAL methods is established using a defined metric denominated IL-Score, which allows for the simultaneous assessment of transfer learning, forgetting, and final model performance. We show that RBACA works in domain and class-incremental learning scenarios, by assessing its IL-Score on the segmentation and diagnosis of cardiac images. The results show that RBACA outperforms a baseline framework without CAL, and a state-of-the-art CAL method across various memory sizes and annotation budgets. Our code is available in https://github.com/RuiDaniel/RBACA .

artificial intelligence, deep learning, machine learning, (16 more...)

2501.08245

Country: Europe (0.46)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

arXiv.org Artificial IntelligenceJun-13-2024

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Casas, Sergio, Agro, Ben, Mao, Jiageng, Gilles, Thomas, Cui, Alexander, Li, Thomas, Urtasun, Raquel

The tasks of object detection and trajectory forecasting play a crucial role in understanding the scene for autonomous driving. These tasks are typically executed in a cascading manner, making them prone to compounding errors. Furthermore, there is usually a very thin interface between the two tasks, creating a lossy information bottleneck. To address these challenges, our approach formulates the union of the two tasks as a trajectory refinement problem, where the first pose is the detection (current time), and the subsequent poses are the waypoints of the multiple forecasts (future time). To tackle this unified task, we design a refinement transformer that infers the presence, pose, and multi-modal future behaviors of objects directly from LiDAR point clouds and high-definition maps. We call this model DeTra, short for object Detection and Trajectory forecasting. In our experiments, we observe that \ourmodel{} outperforms the state-of-the-art on Argoverse 2 Sensor and Waymo Open Dataset by a large margin, across a broad range of metrics. Last but not least, we perform extensive ablation studies that show the value of refinement for this task, that every proposed component contributes positively to its performance, and that key design choices were made.

detection, forecasting, trajectory, (15 more...)

2406.04426

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Transportation (0.48)
Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceMay-29-2023

Conditional Support Alignment for Domain Adaptation with Label Shift

Nguyen, Anh T, Tran, Lam, Tong, Anh, Nguyen, Tuan-Duy H., Tran, Toan

Unsupervised domain adaptation (UDA) refers to a domain adaptation framework in which a learning model is trained based on the labeled samples on the source domain and unlabelled ones in the target domain. The dominant existing methods in the field that rely on the classical covariate shift assumption to learn domain-invariant feature representation have yielded suboptimal performance under the label distribution shift between source and target domains. In this paper, we propose a novel conditional adversarial support alignment (CASA) whose aim is to minimize the conditional symmetric support divergence between the source's and target domain's feature representation distributions, aiming at a more helpful representation for the classification task. We also introduce a novel theoretical target risk bound, which justifies the merits of aligning the supports of conditional feature distributions compared to the existing marginal support alignment approach in the UDA settings. We then provide a complete training process for learning in which the objective optimization functions are precisely based on the proposed target risk bound. Our empirical results demonstrate that CASA outperforms other state-of-the-art methods on different UDA benchmark tasks under label shift conditions.

adaptation, domain adaptation, target domain, (16 more...)

2305.18458

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Middle East > Jordan (0.05)
North America > Canada > Quebec > Montreal (0.04)
Asia > Vietnam (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

arXiv.org Artificial IntelligenceMay-22-2023

Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale

Costa-jussà, Marta R., Andrews, Pierre, Smith, Eric, Hansanti, Prangthip, Ropers, Christophe, Kalbassi, Elahe, Gao, Cynthia, Licht, Daniel, Wood, Carleigh

We introduce a multilingual extension of the HOLISTICBIAS dataset, the largest English template-based taxonomy of textual people references: MULTILINGUALHOLISTICBIAS. This extension consists of 20,459 sentences in 50 languages distributed across all 13 demographic axes. Source sentences are built from combinations of 118 demographic descriptors and three patterns, excluding nonsensical combinations. Multilingual translations include alternatives for gendered languages that cover gendered translations when there is ambiguity in English. Our benchmark is intended to uncover demographic imbalances and be the tool to quantify mitigations towards them. Our initial findings show that translation quality for EN-to-XX translations is an average of 8 spBLEU better when evaluating with the masculine human reference compared to feminine. In the opposite direction, XX-to-EN, we compare the robustness of the model when the source input only differs in gender (masculine or feminine) and masculine translations are an average of almost 4 spBLEU better than feminine. When embedding sentences to a joint multilingual sentence representations space, we find that for most languages masculine translations are significantly closer to the English neutral sentences when embedded.

artificial intelligence, natural language, translation, (20 more...)

2305.13198

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
North America > Dominican Republic (0.04)
(3 more...)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)