AITopics | Li, Di

Collaborating Authors

Li, Di

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Applying Deep Learning to Ads Conversion Prediction in Last Mile Delivery Marketplace

Li, Di, Miao, Xiaochang, Song, Huiyu, Chu, Chao, Xu, Hao, Rahurkar, Mandar

arXiv.org Artificial IntelligenceFeb-14-2025

Deep neural networks (DNNs) have revolutionized web-scale ranking systems, enabling breakthroughs in capturing complex user behaviors and driving performance gains. At DoorDash, we first harnessed this transformative power by transitioning our homepage Ads ranking system from traditional tree based models to cutting edge multi task DNNs. This evolution sparked advancements in data foundations, model design, training efficiency, evaluation rigor, and online serving, delivering substantial business impact and reshaping our approach to machine learning. In this paper, we talk about our problem driven journey, from identifying the right problems and crafting targeted solutions to overcoming the complexity of developing and scaling a deep learning recommendation system. Through our successes and learned lessons, we aim to share insights and practical guidance to teams pursuing similar advancements in machine learning systems.

artificial intelligence, machine learning, prediction, (15 more...)

arXiv.org Artificial Intelligence

2502.10514

Country:

North America > United States (0.46)
North America > Canada > Ontario (0.15)

Genre:

Research Report (0.41)
Instructional Material (0.34)

Industry: Information Technology > Services (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Learning for Day Forecasts from Sparse Observations

Andrychowicz, Marcin, Espeholt, Lasse, Li, Di, Merchant, Samier, Merose, Alexander, Zyda, Fred, Agrawal, Shreya, Kalchbrenner, Nal

arXiv.org Artificial IntelligenceJul-6-2023

Deep neural networks offer an alternative paradigm for modeling weather conditions. The ability of neural models to make a prediction in less than a second once the data is available and to do so with very high temporal and spatial resolution, and the ability to learn directly from atmospheric observations, are just some of these models' unique advantages. Neural models trained using atmospheric observations, the highest fidelity and lowest latency data, have to date achieved good performance only up to twelve hours of lead time when compared with state-of-the-art probabilistic Numerical Weather Prediction models and only for the sole variable of precipitation. In this paper, we present MetNet-3 that extends significantly both the lead time range and the variables that an observation based neural model can predict well. MetNet-3 learns from both dense and sparse data sensors and makes predictions up to 24 hours ahead for precipitation, wind, temperature and dew point. MetNet-3 introduces a key densification technique that implicitly captures data assimilation and produces spatially dense forecasts in spite of the network training on extremely sparse targets. MetNet-3 has a high temporal and spatial resolution of, respectively, up to 2 minutes and 1 km as well as a low operational latency. We find that MetNet-3 is able to outperform the best single- and multi-member NWPs such as HRRR and ENS over the CONUS region for up to 24 hours ahead setting a new performance milestone for observation based neural models. MetNet-3 is operational and its forecasts are served in Google Search in conjunction with other models.

artificial intelligence, machine learning, metnet-3, (18 more...)

arXiv.org Artificial Intelligence

2306.06079

Country: North America > United States (1.00)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs

Jiao, Yuling, Li, Di, Lu, Xiliang, Yang, Jerry Zhijian, Yuan, Cheng

arXiv.org Artificial IntelligenceApr-7-2023

While Both DRM and WAN use the variational forms of the PDEs, PINNs solves PDEs by a direct minimization of the square residuals, which makes the method more flexible and easier to be formulated to general problems. By reformulating the original PDE problems into a optimization problems, The physics-informed neural networks [1, 4, 5, 6] can easily approximate the solution of PDE with a deep neural network (DNN) function space, through minimizing the corresponding loss term which is defined as the integral of residuals. In practice, this integral is approximated by using Monte Carlo (MC) methods with finite points, which are usually sampled according to a uniform distribution on the computational domain. In contrast to classical computational methods, where the main concern is the approximation error, one needs to balance the approximation error and the generalization error for the neural network solvers, where the approximation error mainly originates from the modeling capability of the neural network, while the generalization error is mainly related to the discretization of loss with random samples. While a uniform random sampling strategy is simple to implement, the low regularity regions of the solution may not be taken consideration enough, which makes PINNs be inefficient and even inaccurate, especially when the problems is high dimensional.

artificial intelligence, machine learning, pinn, (16 more...)

arXiv.org Artificial Intelligence

2303.15849

Country: Asia > China (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Universal Rules for Fooling Deep Neural Networks based Text Classification

Li, Di, Vargas, Danilo Vasconcellos, Kouichi, Sakurai

arXiv.org Machine LearningJan-21-2019

Recently, deep learning based natural language processing techniques are being extensively used to deal with spam mail, censorship evaluation in social networks, among others. However, there is only a couple of works evaluating the vulnerabilities of such deep neural networks. Here, we go beyond attacks to investigate, for the first time, universal rules, i.e., rules that are sample agnostic and therefore could turn any text sample in an adversarial one. In fact, the universal rules do not use any information from the method itself (no information from the method, gradient information or training dataset information is used), making them black-box universal attacks. In other words, the universal rules are sample and method agnostic. By proposing a coevolutionary optimization algorithm we show that it is possible to create universal rules that can automatically craft imperceptible adversarial samples (only less than five perturbations which are close to misspelling are inserted in the text sample). A comparison with a random search algorithm further justifies the strength of the method. Thus, universal rules for fooling networks are here shown to exist. Hopefully, the results from this work will impact the development of yet more sample and model agnostic attacks as well as their defenses, culminating in perhaps a new age for artificial intelligence.

deep learning, neural network, universal rule, (16 more...)

arXiv.org Machine Learning

1901.07132

Country: Asia > Japan > Kyūshū & Okinawa > Kyūshū (0.15)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Design and Implementation of XiaoIce, an Empathetic Social Chatbot

Zhou, Li, Gao, Jianfeng, Li, Di, Shum, Heung-Yeung

arXiv.org Artificial IntelligenceDec-21-2018

This paper describes the development of the Microsoft XiaoIce system, the most popular social chatbot in the world. XiaoIce is uniquely designed as an AI companion with an emotional connection to satisfy the human need for communication, affection, and social belonging. We take into account both intelligent quotient (IQ) and emotional quotient (EQ) in system design, cast human-machine social chat as decision-making over Markov Decision Processes (MDPs), and optimize XiaoIce for long-term user engagement, measured in expected Conversation-turns Per Session (CPS). We detail the system architecture and key components including dialogue manager, core chat, skills, and an empathetic computing module. We show how XiaoIce dynamically recognizes human feelings and states, understands user intents, and responds to user needs throughout long conversations. Since the release in 2014, XiaoIce has communicated with over 660 million users and succeeded in establishing long-term relationships with many of them. Analysis of large-scale online logs shows that XiaoIce has achieved an average CPS of 23, which is significantly higher than that of other chatbots and even human conversations.

deep learning, neural network, xiaoice, (20 more...)

arXiv.org Artificial Intelligence

1812.08989

Country:

Asia (1.00)
North America > United States > Massachusetts > Hampshire County > Amherst (0.14)

Genre: Research Report (0.64)

Industry:

Information Technology > Services (0.93)
Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback