AITopics | Wang, Guan

Plotting

Wang, Guan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ModEL: A Modularized End-to-end Reinforcement Learning Framework for Autonomous Driving

Wang, Guan, Niu, Haoyi, Zhu, Desheng, Hu, Jianming, Zhan, Xianyuan, Zhou, Guyue

arXiv.org Artificial IntelligenceOct-21-2021

Heated debates continue over the best autonomous driving framework. The classic modular pipeline is widely adopted in the industry owing to its great interpretability and stability, whereas the end-to-end paradigm has demonstrated considerable simplicity and learnability along with the rise of deep learning. We introduce a new modularized end-to-end reinforcement learning framework (ModEL) for autonomous driving, which combines the merits of both previous approaches. The autonomous driving stack of ModEL is decomposed into perception, planning, and control module, leveraging scene understanding, end-to-end reinforcement learning, and PID control respectively. Furthermore, we build a fully functional autonomous vehicle to deploy this framework. Through extensive simulation and real-world experiments, our framework has shown great generalizability to various complicated scenarios and outperforms the competing baselines.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2110.11573

Country: North America > United States (0.28)

Genre: Research Report (0.82)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

A Deep Learning Inference Scheme Based on Pipelined Matrix Multiplication Acceleration Design and Non-uniform Quantization

Zhang, Yuyang, Leung, Dik Hin, Guo, Min, Xiao, Yijia, Liu, Haoyue, Li, Yunfei, Zhang, Jiyuan, Wang, Guan, Chen, Zhen

arXiv.org Artificial IntelligenceOct-10-2021

Matrix multiplication is the bedrock in Deep Learning inference application. When it comes to hardware acceleration on edge computing devices, matrix multiplication often takes up a great majority of the time. To achieve better performance in edge computing, we introduce a low-power Multi-layer Perceptron (MLP) accelerator based on a pipelined matrix multiplication scheme and a nonuniform quantization methodology. The implementation is running on Field-programmable Gate Array (FPGA) devices and tested its performance on handwritten digit classification and Q-learning tasks. Results show that our method can achieve better performance with fewer power consumption.

artificial intelligence, machine learning, neural network, (14 more...)

arXiv.org Artificial Intelligence

2110.04861

Country: Asia > China (0.14)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Hao, Xiaotian, Peng, Zhaoqing, Ma, Yi, Wang, Guan, Jin, Junqi, Hao, Jianye, Chen, Shan, Bai, Rongquan, Xie, Mingzhou, Xu, Miao, Zheng, Zhenzhe, Yu, Chuan, Li, Han, Xu, Jian, Gai, Kun

arXiv.org Machine LearningJun-29-2020

In E-commerce, advertising is essential for merchants to reach their target users. The typical objective is to maximize the advertiser's cumulative revenue over a period of time under a budget constraint. In real applications, an advertisement (ad) usually needs to be exposed to the same user multiple times until the user finally contributes revenue (e.g., places an order). However, existing advertising systems mainly focus on the immediate revenue with single ad exposures, ignoring the contribution of each exposure to the final conversion, thus usually falls into suboptimal solutions. In this paper, we formulate the sequential advertising strategy optimization as a dynamic knapsack problem. We propose a theoretically guaranteed bilevel optimization framework, which significantly reduces the solution space of the original optimization space while ensuring the solution quality. To improve the exploration efficiency of reinforcement learning, we also devise an effective action space reduction approach. Extensive offline and online experiments show the superior performance of our approaches over state-of-the-art baselines in terms of cumulative revenue.

artificial intelligence, cpr thr, optimization problem, (17 more...)

arXiv.org Machine Learning

2006.16312

Country:

Asia > China (0.46)
Europe > Austria > Vienna (0.14)

Genre: Research Report (1.00)

Industry:

Marketing (1.00)
Information Technology > Services > e-Commerce Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

DCAF: A Dynamic Computation Allocation Framework for Online Serving System

Jiang, Biye, Zhang, Pengye, Chen, Rihan, Dai, Binding, Luo, Xinchen, Yang, Yin, Wang, Guan, Zhou, Guorui, Zhu, Xiaoqiang, Gai, Kun

arXiv.org Artificial IntelligenceJun-17-2020

Modern large-scale systems such as recommender system and online advertising system are built upon computation-intensive infrastructure. The typical objective in these applications is to maximize the total revenue, e.g. GMV~(Gross Merchandise Volume), under a limited computation resource. Usually, the online serving system follows a multi-stage cascade architecture, which consists of several stages including retrieval, pre-ranking, ranking, etc. These stages usually allocate resource manually with specific computing power budgets, which requires the serving configuration to adapt accordingly. As a result, the existing system easily falls into suboptimal solutions with respect to maximizing the total revenue. The limitation is due to the face that, although the value of traffic requests vary greatly, online serving system still spends equal computing power among them. In this paper, we introduce a novel idea that online serving system could treat each traffic request differently and allocate "personalized" computation resource based on its value. We formulate this resource allocation problem as a knapsack problem and propose a Dynamic Computation Allocation Framework~(DCAF). Under some general assumptions, DCAF can theoretically guarantee that the system can maximize the total revenue within given computation budget. DCAF brings significant improvement and has been deployed in the display advertising system of Taobao for serving the main traffic. With DCAF, we are able to maintain the same business performance with 20\% computation resource reduction.

artificial intelligence, dcaf, neural network, (17 more...)

arXiv.org Artificial Intelligence

2006.09684

Country: North America > United States (0.14)

Genre: Research Report (0.84)

Industry:

Information Technology > Services (1.00)
Marketing (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Measure Contribution of Participants in Federated Learning

Wang, Guan, Dang, Charlie Xiaoqian, Zhou, Ziye

arXiv.org Machine LearningSep-17-2019

--Federated Machine Learning (FML) creates an ecosystem for multiple parties to collaborate on building models while protecting data privacy for the participants. A measure of the contribution for each party in FML enables fair credits allocation. In this paper we develop simple but powerful techniques to fairly calculate the contributions of multiple parties in FML, in the context of both horizontal FML and vertical FML. For Horizontal FML we use deletion method to calculate the grouped instance influence. For V ertical FML we use Shapley V alues to calculate the grouped feature importance. Our methods open the door for research in model contribution and credit allocation in the context of federated machine learning. I NTRODUCTION Federated Learning or Federated Machine Learning (FML) [1] is introduced to solve privacy issues in machine learning using data from multiple parties.

contribution, health & medicine, oncology, (18 more...)

arXiv.org Machine Learning

1909.08525

Country: Asia > China (0.46)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Banking & Finance (0.95)
Health & Medicine > Therapeutic Area > Oncology (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Cooperative Lane Changing via Deep Reinforcement Learning

Wang, Guan, Hu, Jianming, Li, Zhiheng, Li, Li

arXiv.org Artificial IntelligenceJun-20-2019

In this paper, we study how to learn an appropriate lane changing strategy for autonomous vehicles by using deep reinforcement learning. We show that the reward of the system should consider the overall traffic efficiency instead of the travel efficiency of an individual vehicle. In summary, cooperation leads to a more harmonic and efficient traffic system rather than competition

deep learning, neural network, vehicle, (19 more...)

arXiv.org Artificial Intelligence

1906.08662

Country:

Asia (0.47)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre: Research Report (0.83)

Industry:

Transportation > Ground > Road (1.00)
Automobiles & Trucks (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Interpret Federated Learning with Shapley Values

Wang, Guan

arXiv.org Machine LearningMay-11-2019

Federated Learning is introduced to protect privacy by distributing training data into multiple parties. Each party trains its own model and a meta-model is constructed from the sub models. In this way the details of the data are not disclosed in between each party. In this paper we investigate the model interpretation methods for Federated Learning, specifically on the measurement of feature importance of vertical Federated Learning where feature space of the data is divided into two parties, namely host and guest. For host party to interpret a single prediction of vertical Federated Learning model, the interpretation results, namely the feature importance, are very likely to reveal the protected data from guest party. We propose a method to balance the model interpretability and data privacy in vertical Federated Learning by using Shapley values to reveal detailed feature importance for host features and a unified importance value for federated guest features. Our experiments indicate robust and informative results for interpreting Federated Learning models.

artificial intelligence, banking & finance, federated learning, (16 more...)

arXiv.org Machine Learning

1905.04519

Country:

North America (1.00)
Europe > United Kingdom (0.14)
Asia > China (0.14)

Genre: Research Report (0.65)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback