AITopics

Dual-Curriculum Contrastive Multi-Instance Learning for Cancer Prognosis Analysis with Whole Slide Images

Neural Information Processing SystemsMar-27-2025, 14:52:48 GMT

The multi-instance learning (MIL) has advanced cancer prognosis analysis with whole slide images (WSIs). However, current MIL methods for WSI analysis still confront unique challenges. Previous methods typically generate instance representations via a pre-trained model or a model trained by the instances with bag-level annotations, which, however, may not generalize well to the downstream task due to the introduction of excessive label noises and the lack of fine-grained information across multi-magnification WSIs. Additionally, existing methods generally aggregate instance representations as bag ones for prognosis prediction and have no consideration of intra-bag redundancy and inter-bag discrimination. To address these issues, we propose a dual-curriculum contrastive MIL method for cancer prognosis analysis with WSIs. The proposed method consists of two curriculums, i.e., saliency-guided weakly-supervised instance encoding with cross-scale tiles and contrastive-enhanced soft-bag prognosis inference. Extensive experiments on three public datasets demonstrate that our method outperforms state-of-the-art methods in this field.

artificial intelligence, machine learning, representation, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Genre:

Research Report > New Finding (0.94)
Research Report > Experimental Study (0.94)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Understanding the Differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks

Neural Information Processing SystemsMar-27-2025, 14:52:37 GMT

Softmax attention is the principle backbone of foundation models for various artificial intelligence applications, yet its quadratic complexity in sequence length can limit its inference throughput in long-context settings. To address this challenge, alternative architectures such as linear attention, State Space Models (SSMs), and Recurrent Neural Networks (RNNs) have been considered as more efficient alternatives. While connections between these approaches exist, such models are commonly developed in isolation and there is a lack of theoretical understanding of the shared principles underpinning these architectures and their subtle differences, greatly influencing performance and scalability. In this paper, we introduce the Dynamical Systems Framework (DSF), which allows a principled investigation of all these architectures in a common representation.

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning

Neural Information Processing SystemsMar-27-2025, 14:52:29 GMT

We propose A-Crab (Actor-Critic Regularized by Average Bellman error), a new practical algorithm for offline reinforcement learning (RL) in complex environments with insufficient data coverage. Our algorithm combines the marginalized importance sampling framework with the actor-critic paradigm, where the critic returns evaluations of the actor (policy) that are pessimistic relative to the offline data and have a small average (importance-weighted) Bellman error. Compared to existing methods, our algorithm simultaneously offers a number of advantages: (1) It achieves the optimal statistical rate of 1/ N--where N is the size of offline dataset--in converging to the best policy covered in the offline dataset, even when combined with general function approximators.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning

Neural Information Processing SystemsMar-27-2025, 14:52:26 GMT

We propose A-Crab (Actor-Critic Regularized by Average Bellman error), a new practical algorithm for offline reinforcement learning (RL) in complex environments with insufficient data coverage. Our algorithm combines the marginalized importance sampling framework with the actor-critic paradigm, where the critic returns evaluations of the actor (policy) that are pessimistic relative to the offline data and have a small average (importance-weighted) Bellman error. Compared to existing methods, our algorithm simultaneously offers a number of advantages: (1) It achieves the optimal statistical rate of 1/ N--where N is the size of offline dataset--in converging to the best policy covered in the offline dataset, even when combined with general function approximators.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space

Neural Information Processing SystemsMar-27-2025, 14:52:15 GMT

With the widespread application of Large Language Models (LLMs) to various domains, concerns regarding the trustworthiness of LLMs in safety-critical scenarios have been raised, due to their unpredictable tendency to hallucinate and generate misinformation. Existing LLMs do not have an inherent functionality to provide the users with an uncertainty/confidence metric for each response it generates, making it difficult to evaluate trustworthiness. Although several studies aim to develop uncertainty quantification methods for LLMs, they have fundamental limitations, such as being restricted to classification tasks, requiring additional training and data, considering only lexical instead of semantic information, and being prompt-wise but not response-wise. A new framework is proposed in this paper to address these issues.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
North America > Mexico > Mexico City (0.14)
North America > United States > Texas (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Education (0.67)
Media > News (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Add feedback

bd6bb13e78da078d8adcabbe6d9ca737-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:52:08 GMT

large language model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

bd6bb13e78da078d8adcabbe6d9ca737-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:52:05 GMT

large language model, machine learning, student model, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science (0.93)

Add feedback

f26b29298ae8acd94bd7e839688e329b-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsMar-27-2025, 14:51:56 GMT

For what purpose was the dataset created? Was there a specific task in mind? This work is partially funded by Cisco and Amazon. Are there multiple types of instances (e.g. Each scenario essentially contains a user's LLM's generated program conforms to the user's intent.

artificial intelligence, dataset, natural language, (18 more...)

Neural Information Processing Systems

Industry:

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code Programs, Xinyu Wang University of Michigan

Neural Information Processing SystemsMar-27-2025, 14:51:54 GMT

Infrastructure-as-Code (IaC), an important component of cloud computing, allows the definition of cloud infrastructure in high-level programs. However, developing IaC programs is challenging, complicated by factors that include the burgeoning complexity of the cloud ecosystem (e.g., diversity of cloud services and workloads), and the relative scarcity of IaC-specific code examples and public repositories. While large language models (LLMs) have shown promise in general code generation and could potentially aid in IaC development, no benchmarks currently exist for evaluating their ability to generate IaC code.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Michigan (0.40)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

9bae70d354793a95fa18751888cea07d-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:51:48 GMT

machine learning, natural language, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Singapore (0.28)

Genre: Research Report (0.46)

Industry: Transportation (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Natural Language (0.92)

Add feedback

Filters

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Dual-Curriculum Contrastive Multi-Instance Learning for Cancer Prognosis Analysis with Whole Slide Images

Understanding the Differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning

Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space

bd6bb13e78da078d8adcabbe6d9ca737-Supplemental-Conference.pdf

bd6bb13e78da078d8adcabbe6d9ca737-Paper-Conference.pdf

f26b29298ae8acd94bd7e839688e329b-Supplemental-Datasets_and_Benchmarks_Track.pdf

IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code Programs, Xinyu Wang University of Michigan

9bae70d354793a95fa18751888cea07d-Paper-Conference.pdf