AITopics

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Illinois > Champaign County > Champaign (0.04)

Genre: Research Report (0.46)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science > Data Mining (0.68)

Neural Information Processing SystemsFeb-10-2026, 21:13:26 GMT

e56eea9a45b153de634b23780365f976-AuthorFeedback.pdf

baseline, mean function, participant, (15 more...)

Technology: Information Technology > Artificial Intelligence (0.31)

arXiv.org Artificial IntelligenceDec-3-2025

Do Large Language Models Walk Their Talk? Measuring the Gap Between Implicit Associations, Self-Report, and Behavioral Altruism

Andric, Sandro

We investigate whether Large Language Models (LLMs) exhibit altruistic tendencies, and critically, whether their implicit associations and self-reports predict actual altruistic behavior. Using a multi-method approach inspired by human social psychology, we tested 24 frontier LLMs across three paradigms: (1) an Implicit Association Test (IAT) measuring implicit altruism bias, (2) a forced binary choice task measuring behavioral altruism, and (3) a self-assessment scale measuring explicit altruism beliefs. Our key findings are: (1) All models show strong implicit pro-altruism bias (mean IAT = 0.87, p < .0001), confirming models "know" altruism is good. (2) Models behave more altruistically than chance (65.6% vs. 50%, p < .0001), but with substantial variation (48-85%). (3) Implicit associations do not predict behavior (r = .22, p = .29). (4) Most critically, models systematically overestimate their own altruism, claiming 77.5% altruism while acting at 65.6% (p < .0001, Cohen's d = 1.08). This "virtue signaling gap" affects 75% of models tested. Based on these findings, we recommend the Calibration Gap (the discrepancy between self-reported and behavioral values) as a standardized alignment metric. Well-calibrated models are more predictable and behaviorally consistent; only 12.5% of models achieve the ideal combination of high prosocial behavior and accurate self-knowledge.

altruism, large language model, machine learning, (20 more...)

2512.01568

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsOct-9-2025, 04:41:46 GMT

Efficient Learning of Linear Graph Neural Networks via Node Subsampling

Given their remarkable performance, being able to train GNNs efficiently is an important task.

algorithm 1, leverage score, matrix, (14 more...)

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Illinois > Champaign County > Champaign (0.04)

Genre: Research Report (0.46)

Industry: Government > Regional Government > North America Government > United States Government (0.47)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Naumov, Alexey, Rakhuba, Maxim, Ryapolov, Denis, Samsonov, Sergey

On the Upper Bounds for the Matrix Spectral Norm

arXiv.org Artificial IntelligenceJun-19-2025

We consider the problem of estimating the spectral norm of a matrix using only matrix-vector products. We propose a new Counterbalance estimator that provides upper bounds on the norm and derive probabilistic guarantees on its underestimation. Compared to standard approaches such as the power method, the proposed estimator produces significantly tighter upper bounds in both synthetic and real-world settings. Our method is especially effective for matrices with fast-decaying spectra, such as those arising in deep learning and inverse problems.

artificial intelligence, machine learning, matrix, (18 more...)

2506.1566

Country: Europe (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Neural Information Processing SystemsFeb-8-2025, 07:48:28 GMT

Review for NeurIPS paper: Quantile Propagation for Wasserstein-Approximate Gaussian Processes

Weaknesses: After reading the rebuttals and reviewer discussion, I realise that I was wrong about EP overestimating the variance and the strength of the paper's empirical results, so I have decided to downgrade my score. I still believe this paper should be accepted, but I'm less confident of the matter. Here are the things I changed my mind about, to more critical: - Does EP really overestimate the posterior variance? EP should overestimate the *support* of distributions, because the forward-KL covers all modes with a (unimodal) Gaussian. But this does not necessarily imply that the variance is overestimated, and locally the variance is matched exactly.

experiment, overestimate, wasserstein-approximate gaussian process, (8 more...)

Genre: Research Report (0.33)

Technology:

Information Technology > Modeling & Simulation (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.33)

TIME - TechNov-9-2023, 14:22:48 GMT

Tech Companies Are Taking Action on AI Election Misinformation. Will it Matter?

The announcement comes a day after Microsoft announced it was also taking a number of steps to protect elections, including offering tools to watermark AI-generated content and deploying a "Campaign Success Team" to advise political campaigns on AI, cybersecurity, and other related issues. Next year will be the most significant year for elections so far this century, with the U.S., India, the U.K., Mexico, Indonesia, and Taiwan all headed to the polls. Although many are concerned about the impact deepfakes and misinformation could have on elections, many experts stress the evidence for their impacts on elections so far is limited at best. Experts welcome the measures taken by tech companies to defend election integrity but say more fundamental changes to political systems will be required to tackle misinformation. Tech companies have come under scrutiny after the role they played in previous elections.

ai election misinformation, misinformation, political ad, (12 more...)

TIME - Tech

Country:

North America > United States (1.00)
North America > Mexico (0.25)
Asia > Taiwan (0.25)
(7 more...)

Industry:

Information Technology (1.00)
Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Robinson, Haakon, Lundby, Erlend, Rasheed, Adil, Gravdahl, Jan Tommy

A novel corrective-source term approach to modeling unknown physics in aluminum extraction process

arXiv.org Artificial IntelligenceFeb-10-2023

With the ever-increasing availability of data, there has been an explosion of interest in applying modern machine learning methods to fields such as modeling and control. However, despite the flexibility and surprising accuracy of such black-box models, it remains difficult to trust them. Recent efforts to combine the two approaches aim to develop flexible models that nonetheless generalize well; a paradigm we call Hybrid Analysis and modeling (HAM). In this work we investigate the Corrective Source Term Approach (CoSTA), which uses a data-driven model to correct a misspecified physics-based model. This enables us to develop models that make accurate predictions even when the underlying physics of the problem is not well understood. We apply CoSTA to model the Hall-H\'eroult process in an aluminum electrolysis cell. We demonstrate that the method improves both accuracy and predictive stability, yielding an overall more trustworthy model.

artificial intelligence, machine learning, physics, (18 more...)

2209.10861

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Norway > Central Norway > Trøndelag > Trondheim (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (1.00)

Industry: Materials > Metals & Mining > Aluminum (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceJan-21-2023, 12:05:32 GMT

14 Loss functions you can use for Regression

In mathematical optimization and decision theory, a loss function or cost function (sometimes also called an error function) is a function that maps an event or values of one or more variables onto a real number intuitively representing some "cost" associated with the event. An optimization problem seeks to minimize a loss function. An objective function is either a loss function or its opposite (in specific domains, variously called a reward function, a profit function, a utility function, a fitness function, etc.), in which case it is to be maximized. The loss function could include terms from several levels of the hierarchy. The kind of loss function you are going to use depends on the kind of problem you are working i.e Regression or Classification.

artificial intelligence, machine learning, optimization problem, (17 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.56)
Information Technology > Artificial Intelligence > Machine Learning (0.38)

Anari, Nima, Liu, Yang P., Vuong, Thuy-Duong

Optimal Sublinear Sampling of Spanning Trees and Determinantal Point Processes via Average-Case Entropic Independence

arXiv.org Artificial IntelligenceSep-18-2022

We design fast algorithms for repeatedly sampling from strongly Rayleigh distributions, which include random spanning tree distributions and determinantal point processes. For a graph $G=(V, E)$, we show how to approximately sample uniformly random spanning trees from $G$ in $\widetilde{O}(\lvert V\rvert)$ time per sample after an initial $\widetilde{O}(\lvert E\rvert)$ time preprocessing. For a determinantal point process on subsets of size $k$ of a ground set of $n$ elements, we show how to approximately sample in $\widetilde{O}(k^\omega)$ time after an initial $\widetilde{O}(nk^{\omega-1})$ time preprocessing, where $\omega<2.372864$ is the matrix multiplication exponent. We even improve the state of the art for obtaining a single sample from determinantal point processes, from the prior runtime of $\widetilde{O}(\min\{nk^2, n^\omega\})$ to $\widetilde{O}(nk^{\omega-1})$. In our main technical result, we achieve the optimal limit on domain sparsification for strongly Rayleigh distributions. In domain sparsification, sampling from a distribution $\mu$ on $\binom{[n]}{k}$ is reduced to sampling from related distributions on $\binom{[t]}{k}$ for $t\ll n$. We show that for strongly Rayleigh distributions, we can can achieve the optimal $t=\widetilde{O}(k)$. Our reduction involves sampling from $\widetilde{O}(1)$ domain-sparsified distributions, all of which can be produced efficiently assuming convenient access to approximate overestimates for marginals of $\mu$. Having access to marginals is analogous to having access to the mean and covariance of a continuous distribution, or knowing "isotropy" for the distribution, the key assumption behind the Kannan-Lov\'asz-Simonovits (KLS) conjecture and optimal samplers based on it. We view our result as a moral analog of the KLS conjecture and its consequences for sampling, for discrete strongly Rayleigh measures.

algorithm, artificial intelligence, machine learning, (16 more...)

2204.0257

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Networks (0.82)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)