AITopics | needed

Collaborating Authors

needed

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels

Bernal, Marcel Tomàs, Mallinar, Neil Rohit, Belkin, Mikhail

arXiv.org Machine LearningApr-2-2026

Grokking occurs when a model achieves high training accuracy but generalization to unseen test points happens long after that. This phenomenon was initially observed on a class of algebraic problems, such as learning modular arithmetic (Power et al., 2022). We study grokking on algebraic tasks in a class of feature learning kernels via the Recursive Feature Machine (RFM) algorithm (Radhakrishnan et al., 2024), which iteratively updates feature matrices through the Average Gradient Outer Product (AGOP) of an estimator in order to learn task-relevant features. Our main experimental finding is that generalization occurs only when a certain symmetry in the training set is broken. Furthermore, we empirically show that RFM generalizes by recovering the underlying invariance group action inherent in the data. We find that the learned feature matrices encode specific elements of the invariance group, explaining the dependence of generalization on symmetry.

artificial intelligence, machine learning, reflection, (17 more...)

arXiv.org Machine Learning

2604.00316

Country:

North America > United States (0.28)
Africa > Middle East > Morocco > Tanger-Tetouan-Al Hoceima Region > Tangier (0.04)

Genre: Research Report (0.82)

Industry: Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Buying Warner Bros. Gives Netflix What It's Always Needed: An Identity

WIREDDec-5-2025, 20:41:13 GMT

Buying Warner Bros. Gives Netflix What It's Always Needed: An Identity The $83 billion deal gives the streamer a century's worth of prestige television and movies, from Batman movies to . It also ends the streaming wars. In a deal to acquire Warner Bros. announced Friday, Netflix will be scooping up HBO's many titles, including Courtesy of HBO Close your eyes, think for a minute, and tell me: What is a Netflix Movie? OK, try again: What is a Netflix Show? Sure, it's easy to rattle off some killer titles--, --but Netflix has never really had a brand identity.

artificial intelligence, netflix, warner bro, (16 more...)

WIRED

Country:

Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.25)
North America > United States > California (0.15)
Asia > Nepal (0.15)
(3 more...)

Genre: Financial News (0.55)

Industry:

Media > Television (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence (0.96)

Add feedback

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

Neural Information Processing SystemsNov-20-2025, 23:16:49 GMT

When the linear measurements of an instance of low-rank matrix recovery satisfy a restricted isometry property (RIP) --- i.e. they are approximately norm-preserving --- the problem is known to contain no spurious local minima, so exact recovery is guaranteed. In this paper, we show that moderate RIP is not enough to eliminate spurious local minima, so existing results can only hold for near-perfect RIP. In fact, counterexamples are ubiquitous: every $x$ is the spurious local minimum of a rank-1 instance of matrix recovery that satisfies RIP. One specific counterexample has RIP constant $\delta=1/2$, but causes randomly initialized stochastic gradient descent (SGD) to fail 12\% of the time. SGD is frequently able to avoid and escape spurious local minima, but this empirical result shows that it can occasionally be defeated by their existence. Hence, while exact recovery guarantees will likely require a proof of no spurious local minima, arguments based solely on norm preservation will only be applicable to a narrow set of nearly-isotropic instances.

nonconvex matrix recovery, restricted isometry, spurious local minima, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.60)

Add feedback

How Many Samples are Needed to Estimate a Convolutional Neural Network?

Neural Information Processing SystemsNov-20-2025, 21:41:44 GMT

A widespread folklore for explaining the success of Convolutional Neural Networks (CNNs) is that CNNs use a more compact representation than the Fully-connected Neural Network (FNN) and thus require fewer training samples to accurately estimate their parameters. We initiate the study of rigorously characterizing the sample complexity of estimating CNNs. We show that for an $m$-dimensional convolutional filter with linear activation acting on a $d$-dimensional input, the sample complexity of achieving population prediction error of $\epsilon$ is $\widetilde{O(m/\epsilon^2)$, whereas the sample-complexity for its FNN counterpart is lower bounded by $\Omega(d/\epsilon^2)$ samples. Since, in typical settings $m \ll d$, this result demonstrates the advantage of using a CNN. We further consider the sample complexity of estimating a one-hidden-layer CNN with linear activation where both the $m$-dimensional convolutional filter and the $r$-dimensional output weights are unknown. For this model, we show that the sample complexity is $\widetilde{O}\left((m+r)/\epsilon^2\right)$ when the ratio between the stride size and the filter size is a constant. For both models, we also present lower bounds showing our sample complexities are tight up to logarithmic factors. Our main tools for deriving these results are a localized empirical process analysis and a new lemma characterizing the convolutional structure. We believe that these tools may inspire further developments in understanding CNNs.

convolutional neural network, name change, sample complexity, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Add feedback

Attention Learning is Needed to Efficiently Learn Parity Function

Han, Yaomengxi, Ghoshdastidar, Debarghya

arXiv.org Artificial IntelligenceFeb-11-2025

Transformers, with their attention mechanisms, have emerged as the state-of-the-art architectures of sequential modeling and empirically outperform feed-forward neural networks (FFNNs) across many fields, such as natural language processing and computer vision. However, their generalization ability, particularly for low-sensitivity functions, remains less studied. We bridge this gap by analyzing transformers on the $k$-parity problem. Daniely and Malach (NeurIPS 2020) show that FFNNs with one hidden layer and $O(nk^7 \log k)$ parameters can learn $k$-parity, where the input length $n$ is typically much larger than $k$. In this paper, we prove that FFNNs require at least $\Omega(n)$ parameters to learn $k$-parity, while transformers require only $O(k)$ parameters, surpassing the theoretical lower bound needed by FFNNs. We further prove that this parameter efficiency cannot be achieved with fixed attention heads. Our work establishes transformers as theoretically superior to FFNNs in learning parity function, showing how their attention mechanisms enable parameter-efficient generalization in functions with low sensitivity.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2502.07553

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Austria > Vienna (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(5 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

How Many Samples are Needed to Estimate a Convolutional Neural Network?

Neural Information Processing SystemsOct-8-2024, 14:32:59 GMT

convolutional filter, convolutional neural network, sample complexity, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)

Add feedback

Reviews: How Many Samples are Needed to Estimate a Convolutional Neural Network?

Neural Information Processing SystemsOct-7-2024, 03:59:25 GMT

The authors consider the number of samples needed to achieve an error epsilon in the context of learning an m-dimensional convolutional filter as well as one followed by a linear projection. This is motivated by a desire to rigorously understand the empirical success of CNNs. This paper seems technically correct, yet I believe the setting is very far from real CNNs to the point where it's not clear if the results will be impactful. The authors only consider a linear convolution layer, which corresponds to a wiener filtering-like operation according to their model, for removing noise for estimating the label. My concern is the motivation, the novelty and the assumptions.

artificial intelligence, convolutional neural network, machine learning, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Why A Human-First Mindset Is Needed For AI

#artificialintelligenceMar-11-2023, 02:40:10 GMT

Business leaders and workers have been talking for a long time about how artificial intelligence (AI) might change the future of work. More recently, the development of more sophisticated AI tools has led to a heightened level of discussion on the topic. In addition, ground-breaking advancements--as we see from the rapid rise of ChatGPT--have brought the conversation mainstream. I sat down with Kate O'Neill--best-selling author and founder of KO Insights--who possesses an interesting perspective not widely held by senior leaders and executives regarding AI. Known as the "Tech Humanist," Kate believes that AI should optimize the human experience, not replace it. She believes businesses need to think beyond how technology can help them meet their business goals.

artificial intelligence, human-first mindset, intelligence, (10 more...)

#artificialintelligence

Industry: Health & Medicine (0.31)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

NEEDED: Introducing Hierarchical Transformer to Eye Diseases Diagnosis

Ye, Xu, Xiao, Meng, Ning, Zhiyuan, Dai, Weiwei, Cui, Wenjuan, Du, Yi, Zhou, Yuanchun

arXiv.org Artificial IntelligenceJan-11-2023

With the development of natural language processing techniques(NLP), automatic diagnosis of eye diseases using ophthalmology electronic medical records (OEMR) has become possible. It aims to evaluate the condition of both eyes of a patient respectively, and we formulate it as a particular multi-label classification task in this paper. Although there are a few related studies in other diseases, automatic diagnosis of eye diseases exhibits unique characteristics. First, descriptions of both eyes are mixed up in OEMR documents, with both free text and templated asymptomatic descriptions, resulting in sparsity and clutter of information. Second, OEMR documents contain multiple parts of descriptions and have long document lengths. Third, it is critical to provide explainability to the disease diagnosis model. To overcome those challenges, we present an effective automatic eye disease diagnosis framework, NEEDED. In this framework, a preprocessing module is integrated to improve the density and quality of information. Then, we design a hierarchical transformer structure for learning the contextualized representations of each sentence in the OEMR document. For the diagnosis part, we propose an attention-based predictor that enables traceable diagnosis by obtaining disease-specific information. Experiments on the real dataset and comparison with several baseline models show the advantage and explainability of our framework.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2212.13408

Country:

Europe > Italy > Tuscany > Florence (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

Needed: More Worker Involvement In Artificial Intelligence Initiatives

#artificialintelligenceNov-29-2022, 00:45:08 GMT

Despite all the panicky warnings seen in the mainstream media, AI will not be taking over and automating peoples' jobs. AI will be replacing manual tasks, not job categories. However, something very important is missing from the picture: the involvement of the employees who will be charged with making AI and data-driven enterprises work. AI is only ramping up demand for the human talent needed to guide AI systems to engage in tasks relevant to the business, monitor and maintain the fairness and actionability of AI decisions, and to build, program, update, and ultimately retire these systems. That's one of the takeaways of Deloitte's latest research on the state of AI, which finds a lack of employee input into the ways AI will be deployed and what it will deliver.

artificial intelligence initiative, respondent, worker involvement, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback