AITopics | lad

Collaborating Authors

lad

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Personalized Query Auto-Completion for Long and Short-Term Interests with Adaptive Detoxification Generation

Wang, Zhibo, Jiang, Xiaoze, Qin, Zhiheng, Yu, Enyun, Li, Han

arXiv.org Artificial IntelligenceMay-28-2025

Query auto-completion (QAC) plays a crucial role in modern search systems. However, in real-world applications, there are two pressing challenges that still need to be addressed. First, there is a need for hierarchical personalized representations for users. Previous approaches have typically used users' search behavior as a single, overall representation, which proves inadequate in more nuanced generative scenarios. Additionally, query prefixes are typically short and may contain typos or sensitive information, increasing the likelihood of generating toxic content compared to traditional text generation tasks. Such toxic content can degrade user experience and lead to public relations issues. Therefore, the second critical challenge is detoxifying QAC systems. To address these two limitations, we propose a novel model (LaD) that captures personalized information from both long-term and short-term interests, incorporating adaptive detoxification. In LaD, personalized information is captured hierarchically at both coarse-grained and fine-grained levels. This approach preserves as much personalized information as possible while enabling online generation within time constraints. To move a futher step, we propose an online training method based on Reject Preference Optimization (RPO). By incorporating a special token [Reject] during both the training and inference processes, the model achieves adaptive detoxification. Consequently, the generated text presented to users is both non-toxic and relevant to the given prefix. We conduct comprehensive experiments on industrial-scale datasets and perform online A/B tests, delivering the largest single-experiment metric improvement in nearly two years of our product. Our model has been deployed on Kuaishou search, driving the primary traffic for hundreds of millions of active users. The code is available at https://github.com/JXZe/LaD.

information retrieval, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.20966

Country:

North America > Canada (0.16)
Asia > China (0.14)

Genre: Research Report > Experimental Study (0.46)

Industry: Education > Educational Setting > Online (0.54)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)

Add feedback

Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection

Lim, Ying Fu, Zhu, Jiawen, Pang, Guansong

arXiv.org Artificial IntelligenceMar-11-2025

Log Anomaly Detection (LAD) seeks to identify atypical patterns in log data that are crucial to assessing the security and condition of systems. Although Large Language Models (LLMs) have shown tremendous success in various fields, the use of LLMs in enabling the detection of log anomalies is largely unexplored. This work aims to fill this gap. Due to the prohibitive costs involved in fully fine-tuning LLMs, we explore the use of parameter-efficient fine-tuning techniques (PEFTs) for adapting LLMs to LAD. To have an in-depth exploration of the potential of LLM-driven LAD, we present a comprehensive investigation of leveraging two of the most popular PEFTs -- Low-Rank Adaptation (LoRA) and Representation Fine-tuning (ReFT) -- to tap into three prominent LLMs of varying size, including RoBERTa, GPT-2, and Llama-3, for parameter-efficient LAD. Comprehensive experiments on four public log datasets are performed to reveal important insights into effective LLM-driven LAD in several key perspectives, including the efficacy of these PEFT-based LLM-driven LAD methods, their stability, sample efficiency, robustness w.r.t. unstable logs, and cross-dataset generalization. Code is available at https://github.com/mala-lab/LogADReft.

anomaly detection, dataset, llm, (13 more...)

arXiv.org Artificial Intelligence

2503.08045

Country:

Asia > Singapore (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

LADs: Leveraging LLMs for AI-Driven DevOps

Khan, Ahmad Faraz, Khan, Azal Ahmad, Mohamed, Anas, Ali, Haider, Moolinti, Suchithra, Haroon, Sabaat, Tahir, Usman, Fazzini, Mattia, Butt, Ali R., Anwar, Ali

arXiv.org Artificial IntelligenceFeb-28-2025

Automating cloud configuration and deployment remains a critical challenge due to evolving infrastructures, heterogeneous hardware, and fluctuating workloads. Existing solutions lack adaptability and require extensive manual tuning, leading to inefficiencies and misconfigurations. We introduce LADs, the first LLM-driven framework designed to tackle these challenges by ensuring robustness, adaptability, and efficiency in automated cloud management. Instead of merely applying existing techniques, LADs provides a principled approach to configuration optimization through in-depth analysis of what optimization works under which conditions. By leveraging Retrieval-Augmented Generation, Few-Shot Learning, Chain-of-Thought, and Feedback-Based Prompt Chaining, LADs generates accurate configurations and learns from deployment failures to iteratively refine system settings. Our findings reveal key insights into the trade-offs between performance, cost, and scalability, helping practitioners determine the right strategies for different deployment scenarios. For instance, we demonstrate how prompt chaining-based adaptive feedback loops enhance fault tolerance in multi-tenant environments and how structured log analysis with example shots improves configuration accuracy. Through extensive evaluations, LADs reduces manual effort, optimizes resource utilization, and improves system reliability. By open-sourcing LADs, we aim to drive further innovation in AI-powered DevOps automation.

configuration, user intent, validation, (16 more...)

arXiv.org Artificial Intelligence

2502.20825

Country:

North America > United States > Virginia (0.04)
North America > United States > Minnesota (0.04)
North America > United States > Illinois (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data

Maitra, Ishaan, Lin, Raymond, Chen, Eric, Donnelly, Jon, Šćepanović, Sanja, Rudin, Cynthia

arXiv.org Artificial IntelligenceJan-3-2025

Health outcomes depend on complex environmental and sociodemographic factors whose effects change over location and time. Only recently has fine-grained spatial and temporal data become available to study these effects, namely the MEDSAT dataset of English health, environmental, and sociodemographic information. Leveraging this new resource, we use a variety of variable importance techniques to robustly identify the most informative predictors across multiple health outcomes. We then develop an interpretable machine learning framework based on Generalized Additive Models (GAMs) and Multiscale Geographically Weighted Regression (MGWR) to analyze both local and global spatial dependencies of each variable on various health outcomes. Our findings identify NO2 as a global predictor for asthma, hypertension, and anxiety, alongside other outcome-specific predictors related to occupation, marriage, and vegetation. Regional analyses reveal local variations with air pollution and solar radiation, with notable shifts during COVID. This comprehensive approach provides actionable insights for addressing health disparities, and advocates for the integration of interpretable machine learning in public health.

artificial intelligence, machine learning, predictor, (17 more...)

arXiv.org Artificial Intelligence

2501.02111

Country:

North America > United States > North Carolina (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Consumer Health (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.94)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.90)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Landmark Alternating Diffusion

Yeh, Sing-Yuan, Wu, Hau-Tieng, Talmon, Ronen, Tsui, Mao-Pei

arXiv.org Machine LearningApr-29-2024

Alternating Diffusion (AD) is a commonly applied diffusion-based sensor fusion algorithm. While it has been successfully applied to various problems, its computational burden remains a limitation. Inspired by the landmark diffusion idea considered in the Robust and Scalable Embedding via Landmark Diffusion (ROSELAND), we propose a variation of AD, called Landmark AD (LAD), which captures the essence of AD while offering superior computational efficiency. We provide a series of theoretical analyses of LAD under the manifold setup and apply it to the automatic sleep stage annotation problem with two electroencephalogram channels to demonstrate its application.

lan, landmark, landmark distribution, (16 more...)

arXiv.org Machine Learning

2404.19649

Country:

Asia > Taiwan > Taiwan Province > Taipei (0.04)
Oceania > Australia > Queensland > Brisbane (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Israel > Haifa District > Haifa (0.04)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.45)

Industry: Health & Medicine > Therapeutic Area (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Data Science > Data Integration (0.86)

Add feedback

On Learning with LAD

Jothishwaran, C. A., Srivastava, Biplav, Singla, Jitin, Gangopadhyay, Sugata

arXiv.org Artificial IntelligenceSep-28-2023

The logical analysis of data, LAD, is a technique that yields two-class classifiers based on Boolean functions having disjunctive normal form (DNF) representation. Although LAD algorithms employ optimization techniques, the resulting binary classifiers or binary rules do not lead to overfitting. We propose a theoretical justification for the absence of overfitting by estimating the Vapnik-Chervonenkis dimension (VC dimension) for LAD models where hypothesis sets consist of DNFs with a small number of cubic monomials. We illustrate and confirm our observations empirically.

lad, learning

arXiv.org Artificial Intelligence

2309.1663

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.53)

Add feedback

RoLA: A Real-Time Online Lightweight Anomaly Detection System for Multivariate Time Series

Lee, Ming-Chang, Lin, Jia-Chun

arXiv.org Artificial IntelligenceMay-25-2023

A multivariate time series refers to observations of two or more variables taken from a device or a system simultaneously over time. There is an increasing need to monitor multivariate time series and detect anomalies in real time to ensure proper system operation and good service quality. It is also highly desirable to have a lightweight anomaly detection system that considers correlations between different variables, adapts to changes in the pattern of the multivariate time series, offers immediate responses, and provides supportive information regarding detection results based on unsupervised learning and online model training. In the past decade, many multivariate time series anomaly detection approaches have been introduced. However, they are unable to offer all the above-mentioned features. In this paper, we propose RoLA, a real-time online lightweight anomaly detection system for multivariate time series based on a divide-and-conquer strategy, parallel processing, and the majority rule. RoLA employs multiple lightweight anomaly detectors to monitor multivariate time series in parallel, determine the correlations between variables dynamically on the fly, and then jointly detect anomalies based on the majority rule in real time. To demonstrate the performance of RoLA, we conducted an experiment based on a public dataset provided by the FerryBox of the One Ocean Expedition. The results show that RoLA provides satisfactory detection accuracy and lightweight performance.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2305.16509

Country: Europe > Norway > Western Norway > Vestland > Bergen (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Efficient and Low Overhead Website Fingerprinting Attacks and Defenses based on TCP/IP Traffic

Huang, Guodong, Ma, Chuan, Ding, Ming, Qian, Yuwen, Ge, Chunpeng, Fang, Liming, Liu, Zhe

arXiv.org Artificial IntelligenceFeb-27-2023

Website fingerprinting attack is an extensively studied technique used in a web browser to analyze traffic patterns and thus infer confidential information about users. Several website fingerprinting attacks based on machine learning and deep learning tend to use the most typical features to achieve a satisfactory performance of attacking rate. However, these attacks suffer from several practical implementation factors, such as a skillfully pre-processing step or a clean dataset. To defend against such attacks, random packet defense (RPD) with a high cost of excessive network overhead is usually applied. In this work, we first propose a practical filter-assisted attack against RPD, which can filter out the injected noises using the statistical characteristics of TCP/IP traffic. Then, we propose a list-assisted defensive mechanism to defend the proposed attack method. To achieve a configurable trade-off between the defense and the network overhead, we further improve the list-based defense by a traffic splitting mechanism, which can combat the mentioned attacks as well as save a considerable amount of network overhead. In the experiments, we collect real-life traffic patterns using three mainstream browsers, i.e., Microsoft Edge, Google Chrome, and Mozilla Firefox, and extensive results conducted on the closed and open-world datasets show the effectiveness of the proposed algorithms in terms of defense accuracy and network efficiency.

artificial intelligence, machine learning, packet, (16 more...)

arXiv.org Artificial Intelligence

2302.13763

Country:

North America > United States > Texas > Travis County > Austin (0.05)
Asia > China > Jiangsu Province > Nanjing (0.05)
North America > United States > New York > New York County > New York City (0.04)
(8 more...)

Genre: Research Report (0.82)

Industry:

Information Technology > Security & Privacy (1.00)
Information Technology > Software (0.88)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.93)
(3 more...)

Add feedback

LAD: Language Models as Data for Zero-Shot Dialog

Mehri, Shikib, Altun, Yasemin, Eskenazi, Maxine

arXiv.org Artificial IntelligenceJul-28-2022

However, fine-tuning can be impractical dialog remains elusive. A likely reason for this (e.g., in academic settings) with large LMs (e.g., discrepancy is that dialog models require significant GPT-3) due to the cost, computational power and data because they need to learn task-specific immutable architectures. To this end, this paper structural constraints, such as the domain ontology aims to address the following: 'How can we leverage and the dialog policy. While large language the strong language understanding and generation models (e.g., GPT-3) exhibit strong language understanding abilities of large LMs to facilitate zero-shot and generation abilities (Brown et al., generalization in task-oriented dialog?' 2020), they have no a priori knowledge of the Given the in-context meta-learning abilities of structural constraints implied by a specific (unseen) large LMs (Brown et al., 2020), prior work has problem setting (e.g., relevant intents, dialog policy, explored prompt-engineering or prompt-tuning etc.). As such, in order to adapt a pre-trained (Reynolds and McDonell, 2021; Lester et al., 2021; LM for task-oriented dialog, it is necessary to impose Madotto et al., 2021). Well-designed prompts can structural constraints on the unstructured convey the necessary structural constraints.

dialog, prediction, structural constraint, (14 more...)

arXiv.org Artificial Intelligence

2207.14393

Country:

North America > United States > New York (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.82)

Industry: Consumer Products & Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Analysis of an adaptive lead weighted ResNet for multiclass classification of 12-lead ECGs

Zhao, Zhibin, Murphy, Darcy, Gifford, Hugh, Williams, Stefan, Darlington, Annie, Relton, Samuel D., Fang, Hui, Wong, David C.

arXiv.org Artificial IntelligenceDec-1-2021

Background: Twelve lead ECGs are a core diagnostic tool for cardiovascular diseases. Here, we describe and analyse an ensemble deep neural network architecture to classify 24 cardiac abnormalities from 12-lead ECGs. Method: We proposed a squeeze and excite ResNet to automatically learn deep features from 12-lead ECGs, in order to identify 24 cardiac conditions. The deep features were augmented with age and gender features in the final fully connected layers. Output thresholds for each class were set using a constrained grid search. To determine why the model made incorrect predictions, two expert clinicians independently interpreted a random set of 100 misclassified ECGs concerning Left Axis Deviation. Results: Using the bespoke weighted accuracy metric, we achieved a 5-fold cross validation score of 0.684, and sensitivity and specificity of 0.758 and 0.969, respectively. We scored 0.520 on the full test data, and ranked 2nd out of 41 in the official challenge rankings. On a random set of misclassified ECGs, agreement between two clinicians and training labels was poor (clinician 1: kappa = -0.057, clinician 2: kappa = -0.159). In contrast, agreement between the clinicians was very high (kappa = 0.92). Discussion: The proposed prediction model performed well on the validation and hidden test data in comparison to models trained on the same data. We also discovered considerable inconsistency in training labels, which is likely to hinder development of more accurate models.

adaptive lead weighted resnet, classification, multiclass classification, (14 more...)

arXiv.org Artificial Intelligence

2112.01496

Country:

Europe > United Kingdom > England > Leicestershire > Loughborough (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)

Add feedback