AITopics | spam

Collaborating Authors

spam

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ee81a23d6b83ac15fbeb5b7a30934e0b-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 17:46:52 GMT

WepresentanewclassofGAMs thatusetensor rank decompositions of polynomials to learn powerful,inherently-interpretable models. Our approach, titled Scalable Polynomial Additive Models (SPAM) is effortlessly scalable and modelsall higher-order feature interactions without a combinatorial parameter explosion. SPAM outperforms allcurrent interpretable approaches, and matches DNN/XGBoost performance onaseries ofreal-world benchmarks with up to hundreds of thousands of features.

artificial intelligence, explanation, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Sparse Bayesian Message Passing under Structural Uncertainty

Choi, Yoonhyuk, Choi, Jiho, Kim, Chanran, Lee, Yumin, Shin, Hawon, Jeon, Yeowon, Kim, Minjeong, Kang, Jiwoo

arXiv.org Machine LearningJan-6-2026

Semi-supervised learning on real-world graphs is frequently challenged by heterophily, where the observed graph is unreliable or label-disassortative. Many existing graph neural networks either rely on a fixed adjacency structure or attempt to handle structural noise through regularization. In this work, we explicitly capture structural uncertainty by modeling a posterior distribution over signed adjacency matrices, allowing each edge to be positive, negative, or absent. We propose a sparse signed message passing network that is naturally robust to edge noise and heterophily, which can be interpreted from a Bayesian perspective. By combining (i) posterior marginalization over signed graph structures with (ii) sparse signed message aggregation, our approach offers a principled way to handle both edge noise and heterophily. Experimental results demonstrate that our method outperforms strong baseline models on heterophilic benchmarks under both synthetic and real-world structural noise. We provide an anonymous repository at: https://anonymous.4open.science/r/SpaM-F2C8

artificial intelligence, machine learning, neighbor, (17 more...)

arXiv.org Machine Learning

2601.01207

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Beyond One-Size-Fits-All: Personalized Harmful Content Detection with In-Context Learning

Zhang, Rufan, Zhang, Lin, Mi, Xianghang

arXiv.org Artificial IntelligenceNov-11-2025

The proliferation of harmful online content--e.g., toxicity, spam, and negative sentiment--demands robust and adaptable moderation systems. However, prevailing moderation systems are centralized and task-specific, offering limited transparency and neglecting diverse user preferences--an approach ill-suited for privacy-sensitive or decentralized environments. We propose a novel framework that leverages in-context learning (ICL) with foundation models to unify the detection of toxicity, spam, and negative sentiment across binary, multi-class, and multi-label settings. Crucially, our approach enables lightweight personalization, allowing users to easily block new categories, unblock existing ones, or extend detection to semantic variations through simple prompt-based interventions--all without model retraining. Extensive experiments on public benchmarks (TextDetox, UCI SMS, SST2) and a new, annotated Mastodon dataset reveal that: (i) foundation models achieve strong cross-task generalization, often matching or surpassing task-specific fine-tuned models; (ii) effective personalization is achievable with as few as one user-provided example or definition; and (iii) augmenting prompts with label definitions or rationales significantly enhances robustness to noisy, real-world data. Our work demonstrates a definitive shift beyond one-size-fits-all moderation, establishing ICL as a practical, privacy-preserving, and highly adaptable pathway for the next generation of user-centric content safety systems. To foster reproducibility and facilitate future research, we publicly release our code on GitHub and the annotated Mastodon dataset on Hugging Face.

category, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2511.05532

Country:

North America (0.67)
Asia > Middle East > Palestine (0.46)

Genre:

Research Report > New Finding (1.00)
Overview (0.92)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

Sybil-Resistant Service Discovery for Agent Economies

Shi, David, Joo, Kevin

arXiv.org Artificial IntelligenceNov-3-2025

x402 enables Hypertext Transfer Protocol (HTTP) services like application programming interfaces (APIs), data feeds, and inference providers to accept cryptocurrency payments for access. As agents increasingly consume these services, discovery becomes critical: which swap interface should an agent trust? Which data provider is the most reliable? We introduce TraceRank, a reputation-weighted ranking algorithm where payment transactions serve as endorsements. TraceRank seeds addresses with precomputed reputation metrics and propagates reputation through payment flows weighted by transaction value and temporal recency. Applied to x402's payment graph, this surfaces services preferred by high-reputation users rather than those with high transaction volume. Our system combines TraceRank with semantic search to respond to natural language queries with high quality results. We argue that reputation propagation resists Sybil attacks by making spam services with many low-reputation payers rank below legitimate services with few high-reputation payers. Ultimately, we aim to construct a search method for x402 enabled services that avoids infrastructure bias and has better performance than purely volume based or semantic methods.

information retrieval, natural language, reputation, (16 more...)

arXiv.org Artificial Intelligence

2510.27554

Genre: Research Report (0.40)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > e-Commerce > Financial Technology (0.91)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

Add feedback

ee81a23d6b83ac15fbeb5b7a30934e0b-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-19-2025, 17:37:30 GMT

data mining, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
(3 more...)

Add feedback

Anomaly Detection in Human Language via Meta-Learning: A Few-Shot Approach

Singla, Saurav, Singla, Aarav, Gupta, Advik, Gupta, Parnika

arXiv.org Artificial IntelligenceJul-29-2025

We propose a meta learning framework for detecting anomalies in human language across diverse domains with limited labeled data. Anomalies in language ranging from spam and fake news to hate speech pose a major challenge due to their sparsity and variability. We treat anomaly detection as a few shot binary classification problem and leverage meta-learning to train models that generalize across tasks. Using datasets from domains such as SMS spam, COVID-19 fake news, and hate speech, we evaluate model generalization on unseen tasks with minimal labeled anomalies. Our method combines episodic training with prototypical networks and domain resampling to adapt quickly to new anomaly detection tasks. Empirical results show that our method outperforms strong baselines in F1 and AUC scores. We also release the code and benchmarks to facilitate further research in few-shot text anomaly detection.

anomaly, data mining, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.20019

Genre: Research Report > New Finding (1.00)

Industry:

Media > News (0.88)
Information Technology > Security & Privacy (0.67)
Health & Medicine > Therapeutic Area > Immunology (0.48)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.34)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

John Oliver on AI slop: 'Some of this stuff is potentially very dangerous'

The GuardianJun-23-2025, 16:01:42 GMT

John Oliver covered the dangers of AI on his weekly HBO show, calling it "worryingly corrosive" for society. On Last Week Tonight, Oliver said that the "spread of AI generation tools has made it very easy to flood social media sites with cheap, professional-looking, often deeply weird content" using the term AI slop to describe it all. He referred to it as the "newest iteration of spam" with weird images and videos flooding people's feeds, with some people having "absolutely no idea that it isn't real". Oliver said that it was "extremely likely that we are gonna be drowning in this shit for the foreseeable future". With content such as this, "the whole point is to grab your attention" and given how easy it has become to make it, the barrier of entry has been reduced. Meta has not only joined the game with its own tool but it has also tweaked the algorithm meaning that more than a third of content in your feed is now from accounts you don't follow.

artificial intelligence, john oliver, social media, (5 more...)

The Guardian

Country:

North America > United States > North Carolina (0.06)
Asia > Thailand (0.06)
Asia > Pakistan (0.06)
(4 more...)

Industry:

Media (0.54)
Transportation > Air (0.52)

Technology:

Information Technology > Communications > Social Media (0.57)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.57)

Add feedback

Efficient Sampling for Learning Sparse Additive Models in High Dimensions

Hemant Tyagi, Bernd Gärtner, Andreas Krause

Neural Information Processing SystemsFeb-9-2025, 01:19:32 GMT

We consider the problem of learning sparse additive models, i.e., functions of the form: f(x) =

artificial intelligence, machine learning, noise, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Huang, Tianjin, Zhu, Ziquan, Jin, Gaojie, Liu, Lu, Wang, Zhangyang, Liu, Shiwei

arXiv.org Artificial IntelligenceJan-12-2025

Large Language Models (LLMs) have demonstrated exceptional performance across diverse tasks, yet their training remains highly resource-intensive and susceptible to critical challenges such as training instability. A predominant source of this instability stems from gradient and loss spikes, which disrupt the learning process, often leading to costly interventions like checkpoint recovery and experiment restarts, further amplifying inefficiencies. This paper presents a comprehensive investigation into gradient spikes observed during LLM training, revealing their prevalence across multiple architectures and datasets. Our analysis shows that these spikes can be up to $1000\times$ larger than typical gradients, substantially deteriorating model performance. To address this issue, we propose Spike-Aware Adam with Momentum Reset SPAM, a novel optimizer designed to counteract gradient spikes through momentum reset and spike-aware gradient clipping. Extensive experiments, including both pre-training and fine-tuning, demonstrate that SPAM consistently surpasses Adam and its variants across various tasks, including (1) LLM pre-training from 60M to 1B, (2) 4-bit LLM pre-training,(3) reinforcement learning, and (4) Time Series Forecasting. Additionally, SPAM facilitates memory-efficient training by enabling sparse momentum, where only a subset of momentum terms are maintained and updated. When operating under memory constraints, SPAM outperforms state-of-the-art memory-efficient optimizers such as GaLore and Adam-Mini. Our work underscores the importance of mitigating gradient spikes in LLM training and introduces an effective optimization strategy that enhances both training stability and resource efficiency at scale. Code is available at https://github.com/TianjinYellow/SPAM-Optimizer.git

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2501.06842

Country: Europe (0.46)

Genre: Research Report > New Finding (1.00)

Technology: