AITopics | logarithm

Collaborating Authors

logarithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Neural Information Processing SystemsJun-20-2026, 11:47:39 GMT

artificial intelligence, budget, experiment, (18 more...)

Neural Information Processing Systems

Country: Asia (0.46)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

6e016d123b093571bfd086f51d209b8a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 16:32:37 GMT

artificial intelligence, machine learning, mlr, (18 more...)

Neural Information Processing Systems

Country:

Europe > France (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Lebanon (0.04)
Asia > China > Jiangsu Province (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (0.67)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
(3 more...)

Add feedback

5b4a2146246bc3a3a941f32225bbb792-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 07:27:38 GMT

linearization, posterior, transformation, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

Robust Bi-Tempered Logistic Loss Based on Bregman Divergences

Neural Information Processing SystemsDec-25-2025, 16:58:34 GMT

We introduce a temperature into the exponential function and replace the softmax output layer of the neural networks by a high-temperature generalization. Similarly, the logarithm in the loss we use for training is replaced by a low-temperature logarithm. By tuning the two temperatures, we create loss functions that are non-convex already in the single layer case. When replacing the last layer of the neural networks by our bi-temperature generalization of the logistic loss, the training becomes more robust to noise. We visualize the effect of tuning the two temperatures in a simple setting and show the efficacy of our method on large datasets. Our methodology is based on Bregman divergences and is superior to a related two-temperature method that uses the Tsallis divergence.

bregman divergence, name change, robust bi-tempered logistic loss, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)

Add feedback

Gradient-based Sampling: An Adaptive Importance Sampling for Least-squares

Rong Zhu

Neural Information Processing SystemsNov-21-2025, 08:02:32 GMT

We draw the data points by random sampling from the full data according to their gradient values.

algorithm, ls solution, probability, (17 more...)

Neural Information Processing Systems

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

Deep Mean-Shift Priors for Image Restoration

Siavash Arjomand Bigdeli, Matthias Zwicker, Paolo Favaro, Meiguang Jin

Neural Information Processing SystemsNov-21-2025, 07:19:23 GMT

In this paper we introduce a natural image prior that directly represents a Gaussian-smoothed version of the natural image distribution.

artificial intelligence, image restoration, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > United Kingdom > England > Surrey (0.04)

Genre: Research Report (0.47)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Are Foundation Models Useful for Bankruptcy Prediction?

Kostrzewa, Marcin, Furman, Oleksii, Furman, Roman, Tomczak, Sebastian, Zięba, Maciej

arXiv.org Artificial IntelligenceNov-21-2025

Foundation models have shown promise across various financial applications, yet their effectiveness for corporate bankruptcy prediction remains systematically unevaluated against established methods. We study bankruptcy forecasting using Llama-3.3-70B-Instruct and TabPFN, evaluated on large, highly imbalanced datasets of over one million company records from the Visegrád Group. We provide the first systematic comparison of foundation models against classical machine learning baselines for this task. Our results show that models such as XGBoost and CatBoost consistently outperform foundation models across all prediction horizons. LLM-based approaches suffer from unreliable probability estimates, undermining their use in risk-sensitive financial settings. TabPFN, while competitive with simpler baselines, requires substantial computational resources with costs not justified by performance gains. These findings suggest that, despite their generality, current foundation models remain less effective than specialized methods for bankruptcy forecasting.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.16375

Country: Europe > Poland (0.47)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance > Trading (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.92)

Add feedback

A Implementation Details

Neural Information Processing SystemsNov-15-2025, 05:41:55 GMT

With tangent space optimization, we can use standard Euclidean optimization techniques, and respect the geometry of the manifold. All experiments were run on Intel Cascade Lake CPUs, with microprocessors Intel Xeon Gold 6230 (20 Cores, 40 Threads, 2.1 GHz, 28MB Cache, 125W TDP). The red dot corresponds to the relation addition R . Datasets: Stats about the datasets used in Knowledge graph experiments can be found in Table 4. Results: In addition to the results provided in 6.1, in Table 5 we provide a comparison with other We include ComplEx [77], Tucker [9], and Quaternion [92]. In Figure 6 we add equivalent plots to the ones explained in 6.4 for other relations from Same grid search is applied to baselines.

matrix, opération, vector-valued distance, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Association via Entropy Reduction

Gamst, Anthony, Wilson, Lawrence

arXiv.org Artificial IntelligenceNov-10-2025

Prior to recent successes using neural networks, term frequency-inverse document frequency (tf-idf) was clearly regarded as the best choice for identifying documents related to a query. We provide a different score, aver, and observe, on a dataset with ground truth marking for association, that aver does do better at finding assciated pairs than tf-idf. This example involves finding associated vertices in a large graph and that may be an area where neural networks are not currently an obvious best choice. Beyond this one anecdote, we observe that (1) aver has a natural threshold for declaring pairs as unassociated while tf-idf does not, (2) aver can distinguish between pairs of documents for which tf-idf gives a score of 1.0, (3) aver can be applied to larger collections of documents than pairs while tf-idf cannot, and (4) that aver is derived from entropy under a simple statistical model while tf-idf is a construction designed to achieve a certain goal and hence aver may be more "natural." To be fair, we also observe that (1) writing down and computing the aver score for a pair is more complex than for tf-idf and (2) that the fact that the aver score is naturally scale-free makes it more complicated to interpret aver scores.

artificial intelligence, aver, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2511.04901

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.44)

Add feedback

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Xu, Xiaoang, Wang, Shuo, Han, Xu, Liu, Zhenghao, Wu, Huijia, Li, Peipei, Liu, Zhiyuan, Sun, Maosong, He, Zhaofeng

arXiv.org Artificial IntelligenceOct-21-2025

Large Reasoning Models (LRMs) achieve superior performance by extending the thought length. However, a lengthy thinking trajectory leads to reduced efficiency. Most of the existing methods are stuck in the assumption of overthinking and attempt to reason efficiently by compressing the Chain-of-Thought, but this often leads to performance degradation. To address this problem, we introduce A*-Thought, an efficient tree search-based unified framework designed to identify and isolate the most essential thoughts from the extensive reasoning chains produced by these models. It formulates the reasoning process of LRMs as a search tree, where each node represents a reasoning span in the giant reasoning space. By combining the A* search algorithm with a cost function specific to the reasoning path, it can efficiently compress the chain of thought and determine a reasoning path with high information density and low cost. In addition, we also propose a bidirectional importance estimation mechanism, which further refines this search process and enhances its efficiency beyond uniform sampling. Extensive experiments on several advanced math tasks show that A*-Thought effectively balances performance and efficiency over a huge search space. Specifically, A*-Thought can improve the performance of QwQ-32B by 2.39$\times$ with low-budget and reduce the length of the output token by nearly 50% with high-budget. The proposed method is also compatible with several other LRMs, demonstrating its generalization capability. The code can be accessed at: https://github.com/AI9Stars/AStar-Thought.

artificial intelligence, budget, log 2, (17 more...)

arXiv.org Artificial Intelligence

2505.2455

Country: Asia (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback