AITopics | computing technology

Collaborating Authors

computing technology

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Robust Federated Learning against Noisy Clients via Masked Optimization

Jiang, Xuefeng, Wen, Tian, Yang, Zhiqin, Wu, Lvhua, Chen, Yufeng, Sun, Sheng, Wang, Yuwei, Liu, Min

arXiv.org Machine LearningJun-4-2025

In recent years, federated learning (FL) has made significant advance in privacy-sensitive applications. However, it can be hard to ensure that FL participants provide well-annotated data for training. The corresponding annotations from different clients often contain complex label noise at varying levels. This label noise issue has a substantial impact on the performance of the trained models, and clients with greater noise levels can be largely attributed for this degradation. To this end, it is necessary to develop an effective optimization strategy to alleviate the adverse effects of these noisy clients.In this study, we present a two-stage optimization framework, MaskedOptim, to address this intricate label noise problem. The first stage is designed to facilitate the detection of noisy clients with higher label noise rates. The second stage focuses on rectifying the labels of the noisy clients' data through an end-to-end label correction mechanism, aiming to mitigate the negative impacts caused by misinformation within datasets. This is achieved by learning the potential ground-truth labels of the noisy clients' datasets via backpropagation. To further enhance the training robustness, we apply the geometric median based model aggregation instead of the commonly-used vanilla averaged model aggregation. We implement sixteen related methods and conduct evaluations on three image datasets and one text dataset with diverse label noise patterns for a comprehensive comparison. Extensive experimental results indicate that our proposed framework shows its robustness in different scenarios. Additionally, our label correction framework effectively enhances the data quality of the detected noisy clients' local datasets. % Our codes will be open-sourced to facilitate related research communities. Our codes are available via https://github.com/Sprinter1999/MaskedOptim .

artificial intelligence, learning, machine learning, (15 more...)

arXiv.org Machine Learning

2506.02079

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Asia > China > Beijing > Beijing (0.05)
(21 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Information Technology (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation

Lian, Junhong, Ao, Xiang, Liu, Xinyu, Liu, Yang, He, Qing

arXiv.org Artificial IntelligenceJan-27-2025

Personalized news headline generation aims to provide users with attention-grabbing headlines that are tailored to their preferences. Prevailing methods focus on user-oriented content preferences, but most of them overlook the fact that diverse stylistic preferences are integral to users' panoramic interests, leading to suboptimal personalization. In view of this, we propose a novel Stylistic-Content Aware Personalized Headline Generation (SCAPE) framework. SCAPE extracts both content and stylistic features from headlines with the aid of large language model (LLM) collaboration. It further adaptively integrates users' long- and short-term interests through a contrastive learning-based hierarchical fusion network. By incorporating the panoramic interests into the headline generator, SCAPE reflects users' stylistic-content preferences during the generation process. Extensive experiments on the real-world dataset PENS demonstrate the superiority of SCAPE over baselines.

headline generation, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3701716.3715539

2501.119

Country:

Asia > China > Beijing > Beijing (0.06)
Oceania > Australia > New South Wales > Sydney (0.05)
South America > Colombia (0.05)
(5 more...)

Genre: Research Report (0.50)

Industry: Government > Voting & Elections (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework

Wang, Kun, Chang, Kaiyan, Wang, Mengdi, Zou, Xinqi, Xu, Haobo, Han, Yinhe, Wang, Ying

arXiv.org Artificial IntelligenceJan-5-2025

Recent advances of large language models in the field of Verilog generation have raised several ethical and security concerns, such as code copyright protection and dissemination of malicious code. Researchers have employed watermarking techniques to identify codes generated by large language models. However, the existing watermarking works fail to protect RTL code copyright due to the significant syntactic and semantic differences between RTL code and software code in languages such as Python. This paper proposes a hardware watermarking framework RTLMarker that embeds watermarks into RTL code and deeper into the synthesized netlist. We propose a set of rule-based Verilog code transformations , ensuring the watermarked RTL code's syntactic and semantic correctness. In addition, we consider an inherent tradeoff between watermark transparency and watermark effectiveness and jointly optimize them. The results demonstrate RTLMarker's superiority over the baseline in RTL code watermarking.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2501.02446

Country:

Asia > China (0.31)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.16)

Genre: Research Report (0.70)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Do All Problems Have Technical Fixes?

Communications of the ACMSep-26-2024, 13:26:21 GMT

Tech solutionism, as identified by Moss and Metcalf,7 is the notion that all problems have tractable technical fixes. We see variants in the naming and definition of this phenomenon: the technology imperative,8 or "the underlying technocratic philosophy of inevitability",4 or even old-fashioned technocracy itself. All versions designate a confident deployment of technology to solve a non-technical problem, with costs and other drawbacks reduced to secondary consideration. A certain Tech Leader promotes a new startup, Sunshine, thus: "… by applying AI … you can both solve valuable problems and you can give people back time. You can also build their confidence in AI."6

reasoning, tech leader, technical fix, (10 more...)

Communications of the ACM

Country: North America > United States > California (0.05)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration

Xue, Runzhen, Yan, Mingyu, Han, Dengke, Tang, Zhimin, Ye, Xiaochun, Fan, Dongrui

arXiv.org Artificial IntelligenceAug-27-2024

Heterogeneous Graph Neural Networks (HGNNs) have expanded graph representation learning to heterogeneous graph fields. Recent studies have demonstrated their superior performance across various applications, including medical analysis and recommendation systems, often surpassing existing methods. However, GPUs often experience inefficiencies when executing HGNNs due to their unique and complex execution patterns. Compared to traditional Graph Neural Networks, these patterns further exacerbate irregularities in memory access. To tackle these challenges, recent studies have focused on developing domain-specific accelerators for HGNNs. Nonetheless, most of these efforts have concentrated on optimizing the datapath or scheduling data accesses, while largely overlooking the potential benefits that could be gained from leveraging the inherent properties of the semantic graph, such as its topology, layout, and generation. In this work, we focus on leveraging the properties of semantic graphs to enhance HGNN performance. First, we analyze the Semantic Graph Build (SGB) stage and identify significant opportunities for data reuse during semantic graph generation. Next, we uncover the phenomenon of buffer thrashing during the Graph Feature Processing (GFP) stage, revealing potential optimization opportunities in semantic graph layout. Furthermore, we propose a lightweight hardware accelerator frontend for HGNNs, called SiHGNN. This accelerator frontend incorporates a tree-based Semantic Graph Builder for efficient semantic graph generation and features a novel Graph Restructurer for optimizing semantic graph layouts. Experimental results show that SiHGNN enables the state-of-the-art HGNN accelerator to achieve an average performance improvement of 2.95$\times$.

graph, semantic graph, vertex, (14 more...)

arXiv.org Artificial Intelligence

2408.15089

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Beijing > Beijing (0.05)
Asia > China > Guangdong Province > Shenzhen (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

ITERTL: An Iterative Framework for Fine-tuning LLMs for RTL Code Generation

Wu, Peiyang, Guo, Nan, Xiao, Xiao, Li, Wenming, Ye, Xiaochun, Fan, Dongrui

arXiv.org Artificial IntelligenceJun-27-2024

Recently, large language models (LLMs) have demonstrated excellent performance in understanding human instructions and generating code, which has inspired researchers to explore the feasibility of generating RTL code with LLMs. However, the existing approaches to fine-tune LLMs on RTL codes typically are conducted on fixed datasets, which do not fully stimulate the capability of LLMs and require large amounts of reference data. To mitigate these issues , we introduce a simple yet effective iterative training paradigm named ITERTL. During each iteration, samples are drawn from the model trained in the previous cycle. Then these new samples are employed for training in this loop. Through this iterative approach, the distribution mismatch between the model and the training samples is reduced. Additionally, the model is thus enabled to explore a broader generative space and receive more comprehensive feedback. Theoretical analyses are conducted to investigate the mechanism of the effectiveness. Experimental results show the model trained through our proposed approach can compete with and even outperform the state-of-the-art (SOTA) open-source model with nearly 37\% reference samples, achieving remarkable 42.9\% and 62.2\% pass@1 rate on two VerilogEval evaluation datasets respectively. While using the same amount of reference samples, our method can achieved a relative improvement of 16.9\% and 12.5\% in pass@1 compared to the non-iterative method. This study facilitates the application of LLMs for generating RTL code in practical scenarios with limited data.

arxiv preprint arxiv, iteration, llm, (15 more...)

arXiv.org Artificial Intelligence

2407.12022

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Privacy-Enhanced Training-as-a-Service for On-Device Intelligence: Concept, Architectural Scheme, and Open Problems

Wu, Zhiyuan, Sun, Sheng, Wang, Yuwei, Liu, Min, Gao, Bo, He, Tianliu, Wang, Wen

arXiv.org Artificial IntelligenceApr-27-2024

On-device intelligence (ODI) enables artificial intelligence (AI) applications to run on end devices, providing real-time and customized AI inference without relying on remote servers. However, training models for on-device deployment face significant challenges due to the decentralized and privacy-sensitive nature of users' data, along with end-side constraints related to network connectivity, computation efficiency, etc. Existing training paradigms, such as cloud-based training, federated learning, and transfer learning, fail to sufficiently address these practical constraints that are prevalent for devices. To overcome these challenges, we propose Privacy-Enhanced Training-as-a-Service (PTaaS), a novel service computing paradigm that provides privacy-friendly, customized AI model training for end devices. PTaaS outsources the core training process to remote and powerful cloud or edge servers, efficiently developing customized on-device models based on uploaded anonymous queries, enhancing data privacy while reducing the computation load on individual devices. We explore the definition, goals, and design principles of PTaaS, alongside emerging technologies that support the PTaaS paradigm. An architectural scheme for PTaaS is also presented, followed by a series of open problems that set the stage for future research directions in the field of PTaaS.

chinese academy, model training, ptaas, (15 more...)

arXiv.org Artificial Intelligence

2404.10255

Country:

Asia > China > Beijing > Beijing (0.06)
Asia > China > Shanghai > Shanghai (0.04)
North America > United States > Virginia (0.04)
(3 more...)

Genre:

Instructional Material (1.00)
Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.55)

Add feedback

EnergAIze: Multi Agent Deep Deterministic Policy Gradient for Vehicle to Grid Energy Management

Fonseca, Tiago, Ferreira, Luis, Cabral, Bernardo, Severino, Ricardo, Praca, Isabel

arXiv.org Artificial IntelligenceApr-9-2024

This paper investigates the increasing roles of Renewable Energy Sources (RES) and Electric Vehicles (EVs). While indicating a new era of sustainable energy, these also introduce complex challenges, including the need to balance supply and demand and smooth peak consumptions amidst rising EV adoption rates. Addressing these challenges requires innovative solutions such as Demand Response (DR), energy flexibility management, Renewable Energy Communities (RECs), and more specifically for EVs, Vehicle-to-Grid (V2G). However, existing V2G approaches often fall short in real-world adaptability, global REC optimization with other flexible assets, scalability, and user engagement. To bridge this gap, this paper introduces EnergAIze, a Multi-Agent Reinforcement Learning (MARL) energy management framework, leveraging the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm. EnergAIze enables user-centric and multi-objective energy management by allowing each prosumer to select from a range of personal management objectives, thus encouraging engagement. Additionally, it architects' data protection and ownership through decentralized computing, where each prosumer can situate an energy management optimization node directly at their own dwelling. The local node not only manages local energy assets but also fosters REC wide optimization. The efficacy of EnergAIze was evaluated through case studies employing the CityLearn simulation framework. These simulations were instrumental in demonstrating EnergAIze's adeptness at implementing V2G technology within a REC and other energy assets. The results show reduction in peak loads, ramping, carbon emissions, and electricity costs at the REC level while optimizing for individual prosumers objectives.

algorithm, energaize, objective, (15 more...)

arXiv.org Artificial Intelligence

2404.02361

Country:

North America > United States (0.04)
Europe > Switzerland (0.04)
Europe > Sweden (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

What Do Computing and Economics Have to Say to Each Other?

Communications of the ACMFeb-15-2024, 18:32:04 GMT

I described a 1999 result by Koutsoupias and Papadimitriou, regarding multi-agent systems. They studied systems in which non-cooperative agents share a common resource and proposed the ratio between the worst possible Nash equilibrium and the social optimum as a measure of the effectiveness of the system. This ratio has become known as the "Price of Anarchy," as it measures how far from optimal such non-cooperative systems can be. They showed that the price of anarchy could be arbitrarily high, depending on the complexity of the system. The Price-of-Anarchy concept has later been extended to other types of equilibria--for example, Pareto-Optimal Equilibria.b

computing and economics, computing technology, productivity growth, (13 more...)

Communications of the ACM

Country: North America > United States (0.17)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.56)

Add feedback

Digital Twins and Dependency/Constraint-Aware AI for Digital Manufacturing

Communications of the ACMJun-23-2023, 02:50:56 GMT

Increasing productivity in manufacturing has been an elusive goal despite significant advances in factory automation technology and robotics. There are four main challenges currently facing manufacturers: low production efficiency; product defects and inconsistent quality; unforeseen machine maintenance; and high energy use and waste costs. The fourth industrial revolution--also referred to as Industry 4.0--sets out critical technological directions for addressing these grand challenges via data-driven digital manufacturing (DM) solutions incorporating novel computing technology that combines AI/machine learning (ML) and digital twins (DTs)4 for digitally representing complex physical industrial machine, products, and people in production. While digital manufacturing powered by digital twins and dependency/constraint-aware ML is still in early stages, it has shown its potential in improving manufacturing productivity by 20%-30%. Although the Industry 4.0 vision and directions are supported by major manufacturing companies and technology providers (for example, Siemens, Bosch, and IBM), its technology baseline is not mature enough to address related computing needs.

australia, dm solution, twin and dependency constraint-aware ai, (13 more...)

Communications of the ACM

Country: Oceania > Australia > Victoria > Melbourne (0.07)

Industry: Information Technology (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback