AITopics | Overview

Collaborating Authors

Overview

Explaining the Success of Nearest Neighbor Methods in Prediction

arXiv.org Machine LearningFeb-21-2025

Many modern methods for prediction leverage nearest neighbor search to find past training examples most similar to a test example, an idea that dates back in text to at least the 11th century and has stood the test of time. This monograph aims to explain the success of these methods, both in theory, for which we cover foundational nonasymptotic statistical guarantees on nearest-neighbor-based regression and classification, and in practice, for which we gather prominent methods for approximate nearest neighbor search that have been essential to scaling prediction systems reliant on nearest neighbor analysis to handle massive datasets. Furthermore, we discuss connections to learning distances for use with nearest neighbor methods, including how random decision trees and ensemble methods learn nearest neighbor structure, as well as recent developments in crowdsourcing and graphons. In terms of theory, our focus is on nonasymptotic statistical guarantees, which we state in the form of how many training data and what algorithm parameters ensure that a nearest neighbor prediction method achieves a user-specified error tolerance. We begin with the most general of such results for nearest neighbor and related kernel regression and classification in general metric spaces. In such settings in which we assume very little structure, what enables successful prediction is smoothness in the function being estimated for regression, and a low probability of landing near the decision boundary for classification. In practice, these conditions could be difficult to verify for a real dataset. We then cover recent guarantees on nearest neighbor prediction in the three case studies of time series forecasting, recommending products to people over time, and delineating human organs in medical images by looking at image patches. In these case studies, clustering structure enables successful prediction.

diagonal sub-gaussian mixture model, kernel time sery classifier, theoretical guarantee, (15 more...)

arXiv.org Machine Learning

2502.159

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)

Genre:

Overview (1.00)
Research Report (0.81)

Industry:

Leisure & Entertainment (1.00)
Education (0.92)
Health & Medicine > Diagnostic Medicine > Imaging (0.87)
(3 more...)

Add feedback

Natural Language Generation

Reiter, Ehud

arXiv.org Artificial IntelligenceFeb-20-2025

This book provides a broad overview of Natural Language Generation (NLG), including technology, user requirements, evaluation, and real-world applications. The focus is on concepts and insights which hopefully will remain relevant for many years, not on the latest LLM innovations. It draws on decades of work by the author and others on NLG. The book has the following chapters: Introduction to NLG; Rule-Based NLG; Machine Learning and Neural NLG; Requirements; Evaluation; Safety, Maintenance, and Testing; and Applications. All chapters include examples and anecdotes from the author's personal experiences, and end with a Further Reading section. The book should be especially useful to people working on applied NLG, including NLG researchers, people in other fields who want to use NLG, and commercial developers. It will not however be useful to people who want to understand the latest LLM technology. There is a companion site with more information at https://ehudreiter.com/book/

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-031-68582-8

2502.14437

Country:

North America > United States > California (0.45)
Europe > United Kingdom > England (0.45)
Europe > Spain (0.28)
(20 more...)

Genre:

Summary/Review (1.00)
Research Report > Strength High (1.00)
Research Report > New Finding (1.00)
(5 more...)

Industry:

Transportation > Air (1.00)
Media > News (1.00)
Leisure & Entertainment > Sports > Basketball (1.00)
(15 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Binary and Multi-Class Intrusion Detection in IoT Using Standalone and Hybrid Machine and Deep Learning Models

Akif, Md Ahnaf

arXiv.org Artificial IntelligenceFeb-20-2025

Maintaining security in IoT systems depends on intrusion detection since these networks' sensitivity to cyber-attacks is growing. Based on the IoT23 dataset, this study explores the use of several Machine Learning (ML) and Deep Learning (DL) along with the hybrid models for binary and multi-class intrusion detection. The standalone machine and deep learning models like Random Forest (RF), Extreme Gradient Boosting (XGBoost), Artificial Neural Network (ANN), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Convolutional Neural Network (CNN) were used. Furthermore, two hybrid models were created by combining machine learning techniques: RF, XGBoost, AdaBoost, KNN, and SVM and these hybrid models were voting based hybrid classifier. Where one is for binary, and the other one is for multi-class classification. These models vi were tested using precision, recall, accuracy, and F1-score criteria and compared the performance of each model. This work thoroughly explains how hybrid, standalone ML and DL techniques could improve IDS (Intrusion Detection System) in terms of accuracy and scalability in IoT (Internet of Things).

artificial intelligence, classification, machine learning, (12 more...)

arXiv.org Artificial Intelligence

2503.22684

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre:

Overview (0.93)
Research Report > New Finding (0.92)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

An LLM-Based Approach for Insight Generation in Data Analysis

Pérez, Alberto Sánchez, Boukhary, Alaa, Papotti, Paolo, Lozano, Luis Castejón, Elwood, Adam

arXiv.org Artificial IntelligenceFeb-20-2025

Generating insightful and actionable information from databases is critical in data analysis. This paper introduces a novel approach using Large Language Models (LLMs) to automatically generate textual insights. Given a multi-table database as input, our method leverages LLMs to produce concise, text-based insights that reflect interesting patterns in the tables. Our framework includes a Hypothesis Generator to formulate domain-relevant questions, a Query Agent to answer such questions by generating SQL queries against a database, and a Summarization module to verbalize the insights. The insights are evaluated for both correctness and subjective insightfulness using a hybrid model of human judgment and automated metrics. Experimental results on public and enterprise databases demonstrate that our approach generates more insightful insights than other approaches while maintaining correctness.

database, information, insightfulness, (15 more...)

arXiv.org Artificial Intelligence

2503.11664

Country:

Europe > Spain (0.04)
North America > United States > California > Los Angeles County (0.04)
Europe > Italy (0.04)
(4 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.34)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Education (1.00)
Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Balancing Innovation and Integrity: AI Integration in Liberal Arts College Administration

Read, Ian Olivo

arXiv.org Artificial IntelligenceFeb-20-2025

This paper explores the intersection of artificial intelligence and higher education administration, focusing on liberal arts colleges (LACs). It examines AI's opportunities and challenges in academic and student affairs, legal compliance, and accreditation processes, while also addressing the ethical considerations of AI deployment in mission-driven institutions. Considering AI's value pluralism and potential allocative or representational harms caused by algorithmic bias, LACs must ensure AI aligns with its mission and principles. The study highlights other strategies for responsible AI integration, balancing innovation with institutional values.

art college, artificial intelligence, student, (14 more...)

arXiv.org Artificial Intelligence

2503.05747

Country:

Asia > India (0.14)
North America > United States > Michigan (0.04)
Asia > China (0.04)
(6 more...)

Genre:

Instructional Material (0.67)
Overview (0.67)
Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Education > Educational Setting > Higher Education (1.00)
(3 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

Methods and Trends in Detecting Generated Images: A Comprehensive Review

Mahara, Arpan, Rishe, Naphtali

arXiv.org Artificial IntelligenceFeb-20-2025

The proliferation of generative models, such as Generative Adversarial Networks (GANs), Diffusion Models, and Variational Autoencoders (VAEs), has enabled the synthesis of high-quality multimedia data. However, these advancements have also raised significant concerns regarding adversarial attacks, unethical usage, and societal harm. Recognizing these challenges, researchers have increasingly focused on developing methodologies to detect synthesized data effectively, aiming to mitigate potential risks. Prior reviews have primarily focused on deepfake detection and often lack coverage of recent advancements in synthetic image detection, particularly methods leveraging multimodal frameworks for improved forensic analysis. To address this gap, the present survey provides a comprehensive review of state-of-the-art methods for detecting and classifying synthetic images generated by advanced generative AI models. This review systematically examines core detection methodologies, identifies commonalities among approaches, and categorizes them into meaningful taxonomies. Furthermore, given the crucial role of large-scale datasets in this field, we present an overview of publicly available datasets that facilitate further research and benchmarking in synthetic data detection.

dataset, detection, synthetic image, (14 more...)

arXiv.org Artificial Intelligence

2502.15176

Country:

Asia > Japan > Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Overview (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.67)

Add feedback

Leveraging ChatGPT for Sponsored Ad Detection and Keyword Extraction in YouTube Videos

Kok-Shun, Brice Valentin, Chan, Johnny

arXiv.org Artificial IntelligenceFeb-20-2025

Brice Valentin Kok - Shun Department of Information Systems and Operations Management University of Auckland Auckland, New Zealand 0000 - 0001 - 9923 - 5042 Johnny Chan Department of Information Systems and Operations Management University of Auckland Auckland, New Zealand 0000 - 0002 - 3535 - 4533 Abstract -- This work - in - progress paper presents a novel approach to detecting sponsored advertisement segments in YouTube videos and comparing the advertisement with the main content. Our methodology involves the collect ion of 421 auto - generated and manual transcripts which are then fed into a prompt - engineered GPT - 4o for ad detection, a KeyBERT for keyword extraction, and another iteration of ChatGPT for ca tegory identification . The results revealed a significant prevalence of product - related ads across vari ous educational topics, with ad categories refined using GPT - 4 o into succinct 9 content and 4 advertisement categories . This approach provides a scalable and efficient alternative to traditional ad detection methods while offering new insights into the types and relevance of ads embedded within educational content. T his study highlights the potential of LLMs in transforming ad detection processes and improving our understanding of ad vertisement strategies in digital media. In recent years, video - sharing platforms like YouTube have become dominant sources of entertainment, education, and information [1] . YouTube is invaluable for content creators, marketers, and advertisers. One of the key features of YouTube's revenue model is the integration of sponsored advertisement (ad) segments, which allows content creators to monetize their videos while providing advertisers a direct route to target specific audiences [2] .

ad segment, category, transcript, (12 more...)

arXiv.org Artificial Intelligence

2502.15102

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.85)
North America > United States > New York > New York County > New York City (0.04)
North America > Canada > British Columbia > Vancouver Island > Capital Regional District > Victoria (0.04)
(6 more...)

Genre:

Research Report (0.84)
Overview > Innovation (0.34)

Industry: Marketing (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fundamental Survey on Neuromorphic Based Audio Classification

Basu, Amlan, Chaudhari, Pranav, Di Caterina, Gaetano

arXiv.org Artificial IntelligenceFeb-20-2025

Audio classification is paramount in a variety of applications including surveillance, healthcare monitoring, and environmental analysis. Traditional methods frequently depend on intricate signal processing algorithms and manually crafted features, which may fall short in fully capturing the complexities of audio patterns. Neuromorphic computing, inspired by the architecture and functioning of the human brain, presents a promising alternative for audio classification tasks. This survey provides an exhaustive examination of the current state-of-the-art in neuromorphic-based audio classification. It delves into the crucial components of neuromorphic systems, such as Spiking Neural Networks (SNNs), memristors, and neuromorphic hardware platforms, highlighting their advantages in audio classification. Furthermore, the survey explores various methodologies and strategies employed in neuromorphic audio classification, including event-based processing, spike-based learning, and bio-inspired feature extraction. It examines how these approaches address the limitations of traditional audio classification methods, particularly in terms of energy efficiency, real-time processing, and robustness to environmental noise. Additionally, the paper conducts a comparative analysis of different neuromorphic audio classification models and benchmarks, evaluating their performance metrics, computational efficiency, and scalability. By providing a comprehensive guide for researchers, engineers and practitioners, this survey aims to stimulate further innovation and advancements in the evolving field of neuromorphic audio classification.

application, classification, neural network, (15 more...)

arXiv.org Artificial Intelligence

2502.15056

Country:

North America > United States > Texas (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Portugal > Coimbra > Coimbra (0.04)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Information Technology (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.93)

Add feedback

GiGL: Large-Scale Graph Neural Networks at Snapchat

Zhao, Tong, Liu, Yozen, Kolodner, Matthew, Montemayor, Kyle, Ghazizadeh, Elham, Batra, Ankit, Fan, Zihao, Gao, Xiaobin, Guo, Xuan, Ren, Jiwen, Park, Serim, Yu, Peicheng, Yu, Jun, Vij, Shubham, Shah, Neil

arXiv.org Artificial IntelligenceFeb-20-2025

Recent advances in graph machine learning (ML) with the introduction of Graph Neural Networks (GNNs) have led to a widespread interest in applying these approaches to business applications at scale. GNNs enable differentiable end-to-end (E2E) learning of model parameters given graph structure which enables optimization towards popular node, edge (link) and graph-level tasks. While the research innovation in new GNN layers and training strategies has been rapid, industrial adoption and utility of GNNs has lagged considerably due to the unique scale challenges that large-scale graph ML problems create. In this work, we share our approach to training, inference, and utilization of GNNs at Snapchat. To this end, we present GiGL (Gigantic Graph Learning), an open-source library to enable large-scale distributed graph ML to the benefit of researchers, ML engineers, and practitioners. We use GiGL internally at Snapchat to manage the heavy lifting of GNN workflows, including graph data preprocessing from relational DBs, subgraph sampling, distributed training, inference, and orchestration. GiGL is designed to interface cleanly with open-source GNN modeling libraries prominent in academia like PyTorch Geometric (PyG), while handling scaling and productionization challenges that make it easier for internal practitioners to focus on modeling. GiGL is used in multiple production settings, and has powered over 35 launches across multiple business domains in the last 2 years in the contexts of friend recommendation, content recommendation and advertising. This work details high-level design and tools the library provides, scaling properties, case studies in diverse business settings with industry-scale graphs, and several key lessons learned in employing graph ML at scale on large social data. GiGL is open-sourced at https://github.com/snap-research/GiGL.

gigl, graph, proceedings, (13 more...)

arXiv.org Artificial Intelligence

2502.15054

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre:

Research Report (0.50)
Overview (0.46)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

P2W: From Power Traces to Weights Matrix -- An Unconventional Transfer Learning Approach

Siyadatzadeh, Roozbeh, Mehrafrooz, Fatemeh, Mentens, Nele, Stefanov, Todor

arXiv.org Artificial IntelligenceFeb-20-2025

The rapid growth of deploying machine learning (ML) models within embedded systems on a chip (SoCs) has led to transformative shifts in fields like healthcare and autonomous vehicles. One of the primary challenges for training such embedded ML models is the lack of publicly available high-quality training data. Transfer learning approaches address this challenge by utilizing the knowledge encapsulated in an existing ML model as a starting point for training a new ML model. However, existing transfer learning approaches require direct access to the existing model which is not always feasible, especially for ML models deployed on embedded SoCs. Therefore, in this paper, we introduce a novel unconventional transfer learning approach to train a new ML model by extracting and using weights from an existing ML model running on an embedded SoC without having access to the model within the SoC. Our approach captures power consumption measurements from the SoC while it is executing the ML model and translates them to an approximated weights matrix used to initialize the new ML model. This improves the learning efficiency and predictive performance of the new model, especially in scenarios with limited data available to train the model. Our novel approach can effectively increase the accuracy of the new ML model up to 3 times compared to classical training methods using the same amount of limited training data.

accuracy, ml model, new ml model, (15 more...)

arXiv.org Artificial Intelligence

2502.14968

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Netherlands > South Holland > Leiden (0.04)
(2 more...)

Genre:

Overview > Innovation (0.48)
Research Report > New Finding (0.46)

Industry:

Information Technology > Security & Privacy (0.93)
Health & Medicine > Therapeutic Area (0.69)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback