AITopics | Instructional Material

Collaborating Authors

Instructional Material

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Neural Information Processing SystemsMay-28-2025, 18:02:16 GMT

Existing Multimodal Large Language Models (MLLMs) increasingly emphasize complex understanding of various visual elements, including multiple objects, text information, and spatial relations.

information, large language model, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > China (0.46)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre:

Research Report (0.46)
Instructional Material (0.46)

Industry:

Leisure & Entertainment (0.67)
Education (0.67)
Social Sector (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Fairness and Efficiency in Online Class Matching MohammadTaghi Hajiaghayi Shayan Chashm Jahan Mohammad Sharifi University of Maryland University of Maryland Sharif University of Technology Suho Shin

Neural Information Processing SystemsMay-28-2025, 17:58:09 GMT

The online bipartite matching problem, extensively studied in the literature, deals with the allocation of online arriving vertices (items) to a predetermined set of offline vertices (agents). However, little attention has been given to the concept of class fairness, where agents are categorized into different classes, and the matching algorithm must ensure equitable distribution across these classes. We here focus on randomized algorithms for the fair matching of indivisible items, subject to various definitions of fairness. Our main contribution is the first (randomized) non-wasteful algorithm that simultaneously achieves a 1/2 approximation to class envy-freeness (CEF) while simultaneously ensuring an equivalent approximation to the class proportionality (CPROP) and utilitarian social welfare (USW) objectives. We supplement this result by demonstrating that no non-wasteful algorithm can achieve an α-CEF guarantee for α > 0.761. In a similar vein, we provide a novel input instance for deterministic divisible matching that demonstrates a nearly tight CEF approximation. Lastly, we define the "price of fairness," which represents the trade-off between optimal and fair matching. We demonstrate that increasing the level of fairness in the approximation of the solution leads to a decrease in the objective of maximizing USW, following an inverse proportionality relationship.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Maryland (0.76)

Genre:

Research Report > Experimental Study (0.93)
Instructional Material (0.76)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise

Neural Information Processing SystemsMay-28-2025, 17:51:51 GMT

Deep neural networks have demonstrated remarkable performance in various vision tasks, but their success heavily depends on the quality of the training data. Noisy labels are a critical issue in medical datasets and can significantly degrade model performance. Previous clean sample selection methods have not utilized the well pre-trained features of vision foundation models (VFMs) and assumed that training begins from scratch. In this paper, we propose CUFIT, a curriculum fine-tuning paradigm of VFMs for medical image classification under label noise. Our method is motivated by the fact that linear probing of VFMs is relatively unaffected by noisy samples, as it does not update the feature extractor of the VFM, thus robustly classifying the training samples. Subsequently, curriculum fine-tuning of two adapters is conducted, starting with clean sample selection from the linear probing phase. Our experimental results demonstrate that CUFIT outperforms previous methods across various medical image benchmarks.

artificial intelligence, machine learning, noisy label, (18 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands (0.14)
Asia > China (0.14)

Genre:

Research Report > Experimental Study (1.00)
Instructional Material > Course Syllabus & Notes (0.71)
Research Report > New Finding (0.66)
Instructional Material > Online (0.62)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Power of Resets in Online Reinforcement Learning

Neural Information Processing SystemsMay-28-2025, 14:08:52 GMT

Simulators are a pervasive tool in reinforcement learning, but most existing algorithms cannot efficiently exploit simulator access--particularly in high-dimensional domains that require general function approximation. We explore the power of simulators through online reinforcement learning with local simulator access (or, local planning), an RL protocol where the agent is allowed to reset to previously observed states and follow their dynamics during training. We use local simulator access to unlock new statistical guarantees that were previously out of reach: 1.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.13)
Europe > United Kingdom (0.13)

Genre:

Research Report > Experimental Study (0.92)
Instructional Material > Online (0.60)

Industry: Leisure & Entertainment > Games (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Develop valuable data visualization skills and learn to code for only 50

If you feel like tech advances have passed you by because you've never learned to code or use AI, you could not be more wrong. Thank goodness it's no longer necessary to return to school to develop new skills. You can now learn valuable data wrangling skills and learn how to code with the Microsoft Visual Studio Professional 2022 The Premium Learn to Code Certification Bundle. It should be no surprise that Microsoft Visual Studio Professional 2022 has a perfect 5-star rating on Microsoft Choice Software. The Live Share feature makes collaboration seamless, CodeLens provides deep insights from your code, and Intellicode tops it all off by allowing you to type less while coding more.

artificial intelligence, machine learning, natural language, (11 more...)

Popular Science

Genre: Instructional Material > Course Syllabus & Notes (0.37)

Industry: Education > Educational Setting (0.33)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning

Neural Information Processing SystemsMay-28-2025, 09:11:08 GMT

The efficacy of large language models (LLMs) on downstream tasks usually hinges on instruction tuning, which relies critically on the quality of training data. Unfortunately, collecting high-quality and diverse data is both expensive and timeconsuming. To mitigate this issue, we propose a novel Star-Agents framework, which automates the enhancement of data quality across datasets through multiagent collaboration and assessment. The framework adopts a three-pronged strategy. It initially generates diverse instruction data with multiple LLM agents through a bespoke sampling method. Subsequently, the generated data undergo a rigorous evaluation using a dual-model method that assesses both difficulty and quality. Finaly, the above process evolves in a dynamic refinement phase, where more effective LLMs are prioritized, enhancing the overall data quality. Our empirical studies, including instruction tuning experiments with models such as Pythia and LLaMA, demonstrate the effectiveness of the proposed framework. Optimized datasets have achieved substantial improvements, with an average increase of 12% and notable gains in specific metrics, such as a 40% improvement in Fermi, as evidenced by benchmarks like MT-bench, Vicuna bench, and WizardLM testset.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Asia > China (0.14)
North America > Canada (0.14)
Asia > Middle East > UAE (0.14)

Genre:

Research Report > Experimental Study (1.00)
Instructional Material (0.67)

Industry: Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Subset Selection and Summarization in Sequential Data

Ehsan Elhamifar, M. Clara De Paolis Kaluza

Neural Information Processing SystemsMay-28-2025, 03:02:00 GMT

Subset selection, which is the task of finding a small subset of representative items from a large ground set, finds numerous applications in different areas. Sequential data, including time-series and ordered data, contain important structural relationships among items, imposed by underlying dynamic models of data, that should play a vital role in the selection of representatives. However, nearly all existing subset selection techniques ignore underlying dynamics of data and treat items independently, leading to incompatible sets of representatives. In this paper, we develop a new framework for sequential subset selection that finds a set of representatives compatible with the dynamic models of data. To do so, we equip items with transition dynamic models and pose the problem as an integer binary optimization over assignments of sequential items to representatives, that leads to high encoding, diversity and transition potentials. Our formulation generalizes the well-known facility location objective to deal with sequential data, incorporating transition dynamics among facilities. As the proposed formulation is non-convex, we derive a max-sum message passing algorithm to solve the problem efficiently. Experiments on synthetic and real data, including instructional video summarization, show that our sequential subset selection framework not only achieves better encoding and diversity than the state of the art, but also successfully incorporates dynamics of data, leading to compatible representatives.

data mining, machine learning, selection, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre: Instructional Material > Course Syllabus & Notes (0.67)

Industry: Education > Educational Technology > Audio & Video (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Information Management (0.77)
(2 more...)

Add feedback

The Numerics of GANs

Neural Information Processing SystemsMay-28-2025, 00:54:41 GMT

In this paper, we analyze the numerics of common algorithms for training Generative Adversarial Networks (GANs). Using the formalism of smooth two-player games we analyze the associated gradient vector field of GAN training objectives. Our findings suggest that the convergence of current algorithms suffers due to two factors: i) presence of eigenvalues of the Jacobian of the gradient vector field with zero real-part, and ii) eigenvalues with big imaginary part. Using these findings, we design a new algorithm that overcomes some of these limitations and has better convergence properties. Experimentally, we demonstrate its superiority on training common GAN architectures and show convergence on GAN architectures that are known to be notoriously hard to train.

artificial intelligence, eigenvalue, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > Canada > Quebec (0.14)

Genre:

Research Report > New Finding (0.54)
Instructional Material (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Online Reinforcement Learning in Stochastic Games

Chen-Yu Wei, Yi-Te Hong, Chi-Jen Lu

Neural Information Processing SystemsMay-28-2025, 00:25:26 GMT

We study online reinforcement learning in average-reward stochastic games (SGs). An SG models a two-player zero-sum game in a Markov environment, where state transitions and one-step payoffs are determined simultaneously by a learner and an adversary. We propose the UCSG algorithm that achieves a sublinear regret compared to the game value when competing with an arbitrary opponent. This result improves previous ones under the same setting. The regret bound has a dependency on the diameter, which is an intrinsic value related to the mixing property of SGs. If we let the opponent play an optimistic best response to the learner, UCSG finds an ε-maximin stationary policy with a sample complexity of Õ (poly(1/ε)), where ε is the gap to the best policy.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Taiwan (0.40)
North America > United States (0.28)
Europe > United Kingdom > England (0.14)

Genre: Instructional Material > Online (0.60)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Daniele Calandriello, Alessandro Lazaric, Michal Valko

Neural Information Processing SystemsMay-28-2025, 00:18:14 GMT

Online kernel learning (OKL) is a flexible framework for prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces often contains an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate O( T) more loss than the optimal function, but the curse of kernelization results in a O(t) per-step complexity.

artificial intelligence, machine learning, pro-n-kon, (13 more...)

Neural Information Processing Systems

Country: Europe > France (0.28)

Genre: Instructional Material > Online (0.61)

Industry: