AITopics | search cost

Collaborating Authors

search cost

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Q-VLM: Post-training Quantization for Large Vision-Language Models

Neural Information Processing SystemsMar-22-2026, 13:58:45 GMT

In this paper, we propose a post-training quantization framework of large vision-language models (LVLMs) for efficient multi-modal inference. Conventional quantization methods sequentially search the layer-wise rounding functions by minimizing activation discretization errors, which fails to acquire optimal quantization strategy without considering cross-layer dependency. On the contrary, we mine the cross-layer dependency that significantly influences discretization errors of the entire vision-language model, and embed this dependency into optimal quantization strategy searching with low search cost. Specifically, we observe the strong correlation between the activation entropy and the cross-layer dependency concerning output discretization errors. Therefore, we employ the entropy as the proxy to partition blocks optimally, which aims to achieve satisfying trade-offs between discretization errors and the search cost. Moreover, we optimize the visual encoder to disentangle the cross-layer dependency for fine-grained decomposition of search space, so that the search cost is further reduced without harming the quantization accuracy. Experimental results demonstrate that our method compresses the memory by 2.78x and increase generate speed by 1.44x about 13B LLaVA model without performance degradation on diverse multi-modal reasoning tasks.

artificial intelligence, cross-layer dependency, proceedings, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Q-VLM: Post-training Quantization for Large Vision-Language Models

Neural Information Processing SystemsFeb-18-2026, 05:41:32 GMT

large language model, machine learning, quantization, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Bridge the Gap Between Architecture Spaces via A Cross-Domain Predictor

Neural Information Processing SystemsDec-24-2025, 05:53:47 GMT

Neural Architecture Search (NAS) can automatically design promising neural architectures without artificial experience. Though it achieves great success, prohibitively high search cost is required to find a high-performance architecture, which blocks its practical implementation. Neural predictor can directly evaluate the performance of neural networks based on their architectures and thereby save much budget. However, existing neural predictors require substantial annotated architectures trained from scratch, which still consume many computational resources. To solve this issue, we propose a Cross-Domain Predictor (CDP), which is trained based on the existing NAS benchmark datasets (e.g., NAS-Bench-101), but can be used to find high-performance architectures in large-scale search spaces. Particularly, we propose a progressive subspace adaptation strategy to address the domain discrepancy between the source architecture space and the target space. Considering the large difference between two architecture spaces, an assistant space is developed to smooth the transfer process. Compared with existing NAS methods, the proposed CDP is much more efficient. For example, CDP only requires the search cost of 0.1 GPU Days to find architectures with 76.9% top-1 accuracy on ImageNet and 97.51% on CIFAR-10.

architecture, architecture space, cross-domain predictor, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.83)
Information Technology > Artificial Intelligence > Cognitive Science (0.83)

Add feedback

Algorithmic Collusion of Pricing and Advertising on E-commerce Platforms

Zhao, Hangcheng, Berman, Ron

arXiv.org Artificial IntelligenceNov-3-2025

When online sellers use AI learning algorithms to automatically compete on e-commerce platforms, there is concern that they will learn to coordinate on higher than competitive prices. However, this concern was primarily raised in single-dimension price competition. We investigate whether this prediction holds when sellers make pricing and advertising decisions together, i.e., two-dimensional decisions. We analyze competition in multi-agent reinforcement learning, and use a large-scale dataset from Amazon.com to provide empirical evidence. We show that when consumers have high search costs, learning algorithms can coordinate on prices lower than competitive prices, facilitating a win-win-win for consumers, sellers, and platforms. This occurs because algorithms learn to coordinate on lower advertising bids, which lower advertising costs, leading to lower prices and enlarging demand on the platform. We also show that our results generalize to any learning algorithm that uses exploration of price and advertising bids. Consistent with our predictions, an empirical analysis shows that price levels exhibit a negative interaction between estimated consumer search costs and algorithm usage index. We analyze the platform's strategic response and find that reserve price adjustments will not increase platform profits, but commission adjustments will, while maintaining the beneficial outcomes for both sellers and consumers.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2508.08325

Country: North America > United States (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Marketing (1.00)
Law (1.00)
Banking & Finance (1.00)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

cffbaf4f47546ece96bb42c0edda40ee-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 17:09:30 GMT

cross-layer dependency, discretization error, quantization, (13 more...)

Neural Information Processing Systems

Country:

Southern Ocean (0.04)
Indian Ocean (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Communications (0.93)
(2 more...)

Add feedback

BiSLS/SPS: Auto-tune Step Sizes for Stable Bi-level Optimization Chen Fan

Neural Information Processing SystemsOct-9-2025, 02:44:26 GMT

This unified envelope strategy allows for the extension of the algorithms and their convergence guarantees to BO settings.

artificial intelligence, machine learning, optimization problem, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

76cf99d3614e23eabab16fb27e944bf9-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 07:03:40 GMT

accuracy, architecture, imagenet, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Composer: A Search Framework for Hybrid Neural Architecture Design

Acun, Bilge, Sinha, Prasoon, Ardalani, Newsha, Bae, Sangmin, Golden, Alicia, Lin, Chien-Yu, Madhyastha, Meghana, Sun, Fei, Yadwadkar, Neeraja J., Wu, Carole-Jean

arXiv.org Artificial IntelligenceOct-2-2025

Hybrid model architectures that combine computational primitives (e.g., Attention, MLP) in different ratios have shown promising performance beyond Transformers. Some studies have shown that different interleavings of primitives can affect model quality as well. However, prior works explore the hybrid model architecture design space manually. Due to the large design space and training costs, discovering hybrid models that combine key computational primitives for pre-training is challenging. In this work, we take a principled approach in designing a modular hybrid model architecture search framework -- Composer. Composer explores model architectures at a small scale and extrapolates the top-performing model architectures to a larger scale using our proposed scaling strategies. Using Composer, we discover new hybrid LLM architectures that outperform Llama 3.2. Compared to Llama 3.2 and previous state-of-the-art baselines, the new model architectures consistently reduce validation loss at parameter scales of 350M-3B and improve evaluation accuracy on the downstream tasks by up to 2.8-8.3% (1.1-3.1% on average) while improving both training and inference efficiency.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2510.00379

Country: North America > United States (0.68)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Adapting Neural Architectures Between Domains Y anxi Li1, Zhaohui Y ang 2,3, Yunhe Wang

Neural Information Processing SystemsOct-1-2025, 23:42:00 GMT

Neural architecture search (NAS) has demonstrated impressive performance in automatically designing high-performance neural networks. The power of deep neural networks is to be unleashed for analyzing a large volume of data (e.g.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Filters

Collaborating Authors

search cost

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Q-VLM: Post-training Quantization for Large Vision-Language Models

Q-VLM: Post-training Quantization for Large Vision-Language Models

9cf5fff2f85310e6ece5bc3a8489b6fa-Paper-Conference.pdf

Bridge the Gap Between Architecture Spaces via A Cross-Domain Predictor

Algorithmic Collusion of Pricing and Advertising on E-commerce Platforms

cffbaf4f47546ece96bb42c0edda40ee-Paper-Conference.pdf

BiSLS/SPS: Auto-tune Step Sizes for Stable Bi-level Optimization Chen Fan

76cf99d3614e23eabab16fb27e944bf9-AuthorFeedback.pdf

Composer: A Search Framework for Hybrid Neural Architecture Design

Adapting Neural Architectures Between Domains Y anxi Li1, Zhaohui Y ang 2,3, Yunhe Wang