AITopics | argsort

Collaborating Authors

argsort

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AImpactofhyper-parameters

Neural Information Processing SystemsFeb-9-2026, 19:00:29 GMT

Label smoothing and HXE achieve their best accuracy when set to zero, which is equivalent to a flat softmax. Notethatweuse confidence threshold inference for all loss functions, regardless of the inference function that was usedintheoriginalpublication. Algorithm 1Algorithm for finding ordered Pareto set. The inputsxandy are lists with equal length.

aimpactofhyper-parameter, artificial intelligence, softmaxv, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.57)

Add feedback

Onrankingviasortingbyestimatedexpectedutility

Neural Information Processing SystemsFeb-7-2026, 20:45:18 GMT

Since utilities can serveas target values to learn the scoring function through square loss regression, the optimality ofsorting byexpected utilities isequivalent tothe consistencyofregression.

argmin, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

On ranking via sorting by estimated expected utility

Neural Information Processing SystemsOct-2-2025, 12:17:28 GMT

This paper addresses the question of which of these tasks are asymptotically solved by sorting by decreasing order of expected utility, for some suitable notion of utility, or, equivalently, when is square loss regression consistent for ranking via score-and-sort?

argmin null, decision support system, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Information Management (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
Information Technology > Game Theory (0.71)
(3 more...)

Add feedback

26b58a41da329e0cbde0cbf956640a58-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 12:17:10 GMT

artificial intelligence, inference procedure, relevance judgment, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.55)

Add feedback

727855c31df8821fd18d41c23daebf10-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 20:53:26 GMT

argsort, loss function, multi-label focal, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.38)

Add feedback

XicorAttention: Time Series Transformer Using Attention with Nonlinear Correlation

Kimura, Daichi, Izumitani, Tomonori, Kashima, Hisashi

arXiv.org Artificial IntelligenceJun-4-2025

Various Transformer-based models have been proposed for time series forecasting. These models leverage the self-attention mechanism to capture long-term temporal or variate dependencies in sequences. Existing methods can be divided into two approaches: (1) reducing computational cost of attention by making the calculations sparse, and (2) reshaping the input data to aggregate temporal features. However, existing attention mechanisms may not adequately capture inherent nonlinear dependencies present in time series data, leaving room for improvement. In this study, we propose a novel attention mechanism based on Chatterjee's rank correlation coefficient, which measures nonlinear dependencies between variables. Specifically, we replace the matrix multiplication in standard attention mechanisms with this rank coefficient to measure the query-key relationship. Since computing Chatterjee's correlation coefficient involves sorting and ranking operations, we introduce a differentiable approximation employing SoftSort and SoftRank. Our proposed mechanism, ``XicorAttention,'' integrates it into several state-of-the-art Transformer models. Experimental results on real-world datasets demonstrate that incorporating nonlinear correlation into the attention improves forecasting accuracy by up to approximately 9.1\% compared to existing models.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.02694

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Modeling & Simulation (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Deep greedy unfolding: Sorting out argsorting in greedy sparse recovery algorithms

Mohammad-Taheri, Sina, Colbrook, Matthew J., Brugiapaglia, Simone

arXiv.org Artificial IntelligenceMay-22-2025

Gradient-based learning imposes (deep) neural networks to be differentiable at all steps. This includes model-based architectures constructed by unrolling iterations of an iterative algorithm onto layers of a neural network, known as algorithm unrolling. However, greedy sparse recovery algorithms depend on the non-differentiable argsort operator, which hinders their integration into neural networks. In this paper, we address this challenge in Orthogonal Matching Pursuit (OMP) and Iterative Hard Thresholding (IHT), two popular representative algorithms in this class. We propose permutation-based variants of these algorithms and approximate permutation matrices using "soft" permutation matrices derived from softsort, a continuous relaxation of argsort. We demonstrate -- both theoretically and numerically -- that Soft-OMP and Soft-IHT, as differentiable counterparts of OMP and IHT and fully compatible with neural network training, effectively approximate these algorithms with a controllable degree of accuracy. This leads to the development of OMP- and IHT-Net, fully trainable network architectures based on Soft-OMP and Soft-IHT, respectively. Finally, by choosing weights as "structure-aware" trainable parameters, we connect our approach to structured sparse recovery and demonstrate its ability to extract latent sparsity patterns from data.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2505.15661

Country:

North America (0.67)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.46)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates

Liu, Zeyu Leo, Pandit, Shrey, Ye, Xi, Choi, Eunsol, Durrett, Greg

arXiv.org Artificial IntelligenceJul-8-2024

Large language models (LLMs) are increasingly being used to synthesize and reason about source code. However, the static nature of these models' knowledge does not reflect the fact that libraries and API functions they invoke are continuously evolving, with functionality being added or changing. While numerous benchmarks evaluate how LLMs can generate code, no prior work has studied how an LLMs' knowledge about code API functions can be updated. To fill this gap, we present CodeUpdateArena, a benchmark for knowledge editing in the code domain. An instance in our benchmark consists of a synthetic API function update paired with a program synthesis example that uses the updated functionality; our goal is to update an LLM to be able to solve this program synthesis example without providing documentation of the update at inference time. Compared to knowledge editing for facts encoded in text, success here is more challenging: a code LLM must correctly reason about the semantics of the modified function rather than just reproduce its syntax. Our dataset is constructed by first prompting GPT-4 to generate atomic and executable function updates. Then, for each update, we generate program synthesis examples whose code solutions are prone to use the update. Our benchmark covers updates of various types to 54 functions from seven diverse Python packages, with a total of 670 program synthesis examples. Our experiments show that prepending documentation of the update to open-source code LLMs (i.e., DeepSeek, CodeLlama) does not allow them to incorporate changes for problem solving, and existing knowledge editing techniques also have substantial room for improvement. We hope our benchmark will inspire new methods for knowledge updating in code LLMs.

documentation, program synthesis example, signature, (15 more...)

arXiv.org Artificial Intelligence

2407.06249

Country:

Europe > United Kingdom (0.28)
North America > Dominican Republic (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

argsort

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

AImpactofhyper-parameters

81e3225c6ad49623167a4309eb4b2e75-Supplemental.pdf

Onrankingviasortingbyestimatedexpectedutility

81e3225c6ad49623167a4309eb4b2e75-Supplemental.pdf

On ranking via sorting by estimated expected utility

26b58a41da329e0cbde0cbf956640a58-AuthorFeedback.pdf

727855c31df8821fd18d41c23daebf10-Supplemental-Conference.pdf

XicorAttention: Time Series Transformer Using Attention with Nonlinear Correlation

Deep greedy unfolding: Sorting out argsorting in greedy sparse recovery algorithms

CodeUpdateArena: Benchmarking Knowledge Editing on API Updates