AITopics | dga

Collaborating Authors

dga

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

fc03d48253286a798f5116ec00e99b2b-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 00:49:30 GMT

fedavg, gradient, latency, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Industry: Information Technology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

a935ba2236c6ba0fb620f23354e789ff-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 06:07:02 GMT

attribution, attribution method, qualitative comparison, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning

Neural Information Processing SystemsDec-25-2025, 07:45:26 GMT

Federated Learning is an emerging direction in distributed machine learning that en-ables jointly training a model without sharing the data. Since the data is distributed across many edge devices through wireless / long-distance connections, federated learning suffers from inevitable high communication latency. However, the latency issues are undermined in the current literature [15] and existing approaches suchas FedAvg [27] become less efficient when the latency increases. To over comethe problem, we propose \textbf{D}elayed \textbf{G}radient \textbf{A}veraging (DGA), which delays the averaging step to improve efficiency and allows local computation in parallel tocommunication. We theoretically prove that DGA attains a similar convergence rate as FedAvg, and empirically show that our algorithm can tolerate high network latency without compromising accuracy. Specifically, we benchmark the training speed on various vision (CIFAR, ImageNet) and language tasks (Shakespeare),with both IID and non-IID partitions, and show DGA can bring 2.55$\times$ to 4.07$\times$ speedup. Moreover, we built a 16-node Raspberry Pi cluster and show that DGA can consistently speed up real-world federated learning applications.

communication latency, delayed gradient averaging, federated learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Command & Control (C2) Traffic Detection Via Algorithm Generated Domain (Dga) Classification Using Deep Learning And Natural Language Processing

Felix, Maria Milena Araujo

arXiv.org Artificial IntelligenceDec-10-2025

Abstract: The sophistication of modern malware, specifically regarding communication with Command and Control (C2) servers, has rendered static blacklist - based defenses obsolete. The use of Domain Generation Algorithms (DGA) allows attackers to generate thousands of dynamic addresses daily, hindering blocking by traditional firewalls. This paper aims to propose and evaluate a method for detecting DGA domains using Deep Learning and Natural Language Processing (NLP) techniques. The methodology consisted of collecting a hybrid database containing 50,000 legitimate and 50,000 malicious domains, followed by the extraction of lexical features and the training of a Recurrent Neural Network (LSTM). Results demonstrated that while statistical entropy analysis is effective for simple DGAs, the Neural Network approach presents superiority in detecting complex patterns, reaching 97.2% accuracy and reducing the false positive rate in ambiguous lawful traffic scenarios.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2512.07866

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DIDS: Domain Impact-aware Data Sampling for Large Language Model Training

Shi, Weijie, Zhang, Jipeng, Wu, Yaguang, Fang, Jingzhi, Zhang, Ruiyuan, Xu, Jiajie, Zhu, Jia, Chen, Hao, Zhao, Yao, Han, Sirui, Zhou, Xiaofang

arXiv.org Artificial IntelligenceAug-25-2025

Large language models (LLMs) are commonly trained on multi-domain datasets, where domain sampling strategies significantly impact model performance due to varying domain importance across downstream tasks. Existing approaches for optimizing domain-level sampling strategies struggle with maintaining intra-domain consistency and accurately measuring domain impact. In this paper, we present Domain Impact-aware Data Sampling (DIDS). To ensure intra-domain consistency, a gradient clustering algorithm is proposed to group training data based on their learning effects, where a proxy language model and dimensionality reduction are employed to reduce computational overhead. To accurately measure domain impact, we develop a Fisher Information Matrix (FIM) guided metric that quantifies how domain-specific parameter updates affect the model's output distributions on downstream tasks, with theoretical guarantees. Furthermore, to determine optimal sampling ratios, DIDS combines both the FIM-guided domain impact assessment and loss learning trajectories that indicate domain-specific potential, while accounting for diminishing marginal returns. Extensive experiments demonstrate that DIDS achieves 3.4% higher average performance while maintaining comparable training efficiency. The code is available at https://github.com/shiweijiezero/DIDS.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2504.13227

Country: Asia (0.46)

Genre: Research Report > New Finding (0.67)

Industry: Education > Curriculum > Subject-Specific Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.66)

Add feedback

Delayed Gradient Averaging: Tolerate the Communication Latency in Federated Learning

Neural Information Processing SystemsAug-19-2025, 01:06:06 GMT

Federated Learning is an emerging direction in distributed machine learning that enables jointly training a model without sharing the data.

artificial intelligence, latency, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Industry: Information Technology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Qualitative comparison for ablation study

Neural Information Processing SystemsAug-17-2025, 12:07:26 GMT

The results confirm that the post-processing helps to improve the resolution of the attribution. We provide the simple implementation of our algorithm in Python language. We provide the ablation study on (1) the usage of ReLU and (2) WC/EPC masks in this section. To achieve better performance in both metrics, we suggest to use both masks. We provide the quantitative evaluation on different attribution methods.

artificial intelligence, attribution method, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning

Neural Information Processing SystemsJan-19-2025, 14:59:57 GMT

communication latency, delayed gradient averaging, federated learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Dynamic Gradient Alignment for Online Data Mixing

Fan, Simin, Grangier, David, Ablin, Pierre

arXiv.org Artificial IntelligenceOct-3-2024

The composition of training data mixtures is critical for effectively training large language models (LLMs), as it directly impacts their performance on downstream tasks. Our goal is to identify an optimal data mixture to specialize an LLM for a specific task with access to only a few examples. Traditional approaches to this problem include ad-hoc reweighting methods, importance sampling, and gradient alignment techniques. This paper focuses on gradient alignment and introduces Dynamic Gradient Alignment (DGA), a scalable online gradient alignment algorithm. DGA dynamically estimates the pre-training data mixture on which the models' gradients align as well as possible with those of the model on the specific task. DGA is the first gradient alignment approach that incurs minimal overhead compared to standard pre-training and outputs a competitive model, eliminating the need for retraining the model. Experimentally, we demonstrate significant improvements over importance sampling in two key scenarios: (i) when the pre-training set is small and importance sampling overfits due to limited data; and (ii) when there is insufficient specialized data, trapping importance sampling on narrow pockets of data. Our findings underscore the effectiveness of gradient alignment methods in optimizing training data mixtures, particularly in data-constrained environments, and offer a practical solution for enhancing LLM performance on specific tasks with limited data availability.

dga, domain weight, specific loss val, (11 more...)

arXiv.org Artificial Intelligence

2410.02498

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)

Add feedback

The WGA's AI Wins are Good--But They're Not Enough

WIREDSep-28-2023, 18:00:28 GMT

I've been in the entertainment industry since I was nine. I joined the Screen Actors Guild (SAG) when I was 11 in 1977, the Writers Guild of America (WGA) when I was 22, and the Directors Guild of America (DGA) the following year. I got my start as a child actor on Broadway, studied film at NYU, then went on to act in movies like The Lost Boys and the Bill & Ted franchise while writing and directing my own narrative work. I've lived through several labor crises and strikes, but none like our current work shutdown, which began last spring when all three unions' contracts were simultaneously due for renegotiation and the Alliance of Motion Picture and Television Producers (AMPTP) refused their terms. The unifying stress point for labor is the devaluing of the worker, which reached a boiling point with the rapid advancement of highly sophisticated and ubiquitous machine learning tools. Actors have been replaced by AI replications of their likenesses, or their voices have been stolen outright.

artist, protection, wga, (15 more...)

WIRED

Country: Asia > China (0.05)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback