AITopics | Yang, Xiaofeng

Collaborating Authors

Yang, Xiaofeng

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

DESCN: Deep Entire Space Cross Networks for Individual Treatment Effect Estimation

Zhong, Kailiang, Xiao, Fengtong, Ren, Yan, Liang, Yaorong, Yao, Wenqing, Yang, Xiaofeng, Cen, Ling

arXiv.org Artificial IntelligenceOct-19-2023

Causal Inference has wide applications in various areas such as E-commerce and precision medicine, and its performance heavily relies on the accurate estimation of the Individual Treatment Effect (ITE). Conventionally, ITE is predicted by modeling the treated and control response functions separately in their individual sample spaces. However, such an approach usually encounters two issues in practice, i.e. divergent distribution between treated and control groups due to treatment bias, and significant sample imbalance of their population sizes. This paper proposes Deep Entire Space Cross Networks (DESCN) to model treatment effects from an end-to-end perspective. DESCN captures the integrated information of the treatment propensity, the response, and the hidden treatment effect through a cross network in a multi-task learning manner. Our method jointly learns the treatment and response functions in the entire sample space to avoid treatment bias and employs an intermediate pseudo treatment effect prediction network to relieve sample imbalance. Extensive experiments are conducted on a synthetic dataset and a large-scaled production dataset from the E-commerce voucher distribution business. The results indicate that DESCN can successfully enhance the accuracy of ITE estimation and improve the uplift ranking performance. A sample of the production dataset and the source code are released to facilitate future research in the community, which is, to the best of our knowledge, the first large-scale public biased treatment dataset for causal inference.

artificial intelligence, decision tree learning, machine learning, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3534678.3539198

2207.0992

Country:

Asia (0.68)
North America > United States (0.50)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine (0.49)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.46)

Add feedback

Towards General and Efficient Online Tuning for Spark

Li, Yang, Jiang, Huaijun, Shen, Yu, Fang, Yide, Yang, Xiaofeng, Huang, Danqing, Zhang, Xinyi, Zhang, Wentao, Zhang, Ce, Chen, Peng, Cui, Bin

arXiv.org Artificial IntelligenceSep-4-2023

The distributed data analytic system -- Spark is a common choice for processing massive volumes of heterogeneous data, while it is challenging to tune its parameters to achieve high performance. Recent studies try to employ auto-tuning techniques to solve this problem but suffer from three issues: limited functionality, high overhead, and inefficient search. In this paper, we present a general and efficient Spark tuning framework that can deal with the three issues simultaneously. First, we introduce a generalized tuning formulation, which can support multiple tuning goals and constraints conveniently, and a Bayesian optimization (BO) based solution to solve this generalized optimization problem. Second, to avoid high overhead from additional offline evaluations in existing methods, we propose to tune parameters along with the actual periodic executions of each job (i.e., online evaluations). To ensure safety during online job executions, we design a safe configuration acquisition method that models the safe region. Finally, three innovative techniques are leveraged to further accelerate the search process: adaptive sub-space generation, approximate gradient descent, and meta-learning method. We have implemented this framework as an independent cloud service, and applied it to the data platform in Tencent. The empirical results on both public benchmarks and large-scale production tasks demonstrate its superiority in terms of practicality, generality, and efficiency. Notably, this service saves an average of 57.00% memory cost and 34.93% CPU cost on 25K in-production tasks within 20 iterations, respectively.

artificial intelligence, general and efficient online tuning, machine learning, (1 more...)

arXiv.org Artificial Intelligence

doi: 10.14778/3611540.3611548

2309.01901

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

Chen, Yiwen, Zhang, Chi, Yang, Xiaofeng, Cai, Zhongang, Yu, Gang, Yang, Lei, Lin, Guosheng

arXiv.org Artificial IntelligenceAug-22-2023

Recent strides in Text-to-3D techniques have been propelled by distilling knowledge from powerful large text-to-image diffusion models (LDMs). Nonetheless, existing Text-to-3D approaches often grapple with challenges such as over-saturation, inadequate detailing, and unrealistic outputs. This study presents a novel strategy that leverages explicitly synthesized multi-view images to address these issues. Our approach involves the utilization of image-to-image pipelines, empowered by LDMs, to generate posed high-quality images based on the renderings of coarse 3D models. Although the generated images mostly alleviate the aforementioned issues, challenges such as view inconsistency and significant content variance persist due to the inherent generative nature of large diffusion models, posing extensive difficulties in leveraging these images effectively. To overcome this hurdle, we advocate integrating a discriminator alongside a novel Diffusion-GAN dual training strategy to guide the training of 3D models. For the incorporated discriminator, the synthesized multi-view images are considered real data, while the renderings of the optimized 3D models function as fake data. We conduct a comprehensive set of experiments that demonstrate the effectiveness of our method over baseline approaches.

artificial intelligence, dataset, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2308.11473

Country: Asia > Japan > Honshū > Chūbu (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Self-Training Vision Language BERTs with a Unified Conditional Model

Yang, Xiaofeng, Lv, Fengmao, Liu, Fayao, Lin, Guosheng

arXiv.org Artificial IntelligenceJan-19-2023

Abstract--Natural language BERTs are trained with language corpus in a self-supervised manner. An example of generated image descriptions. Given different condition flags, our proposed UCM model is able to generate diverse image descriptions, such as COCO caption, dense caption, and questions. It's clear that the generated contents have different styles. Large scale pretraining has become the dominating approach in various natural language processing tasks. The success of large scale pretraining is due to a large amount of language setting. Although these models can be finetuned to perform training data available everywhere and the self-training algorithm. In this paper, we Second, current common practice in vision language BERT propose a self-training approach that allows to pretrain VL-pretraining uses various image descriptions to train, such as BERTs using unlabeled image data. Those image Self-training is usually done by iterating the following three descriptions have significant differences, making it difficult for steps: 1) training with labeled data, 2) generating pseudo labels an unconditional model to learn to generate adequate pseudo for unlabeled data, 3) mixing the labeled data and unlabeled captions for unlabeled images. However, the has shown its effectiveness in various tasks [4], [5], how to self-training of vision language BERTs is nontrivial due to use it effectively in training vision language BERTs is not yet the following reasons. First, although auto-encoding models studied.

caption, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TCSVT.2023.3235704

2201.0201

Country: Asia (0.29)

Genre: Research Report (0.82)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.46)

Add feedback

DMBGN: Deep Multi-Behavior Graph Networks for Voucher Redemption Rate Prediction

Xiao, Fengtong, Li, Lin, Xu, Weinan, Zhao, Jingyu, Yang, Xiaofeng, Lang, Jun, Wang, Hao

arXiv.org Artificial IntelligenceJun-7-2021

In E-commerce, vouchers are important marketing tools to enhance users' engagement and boost sales and revenue. The likelihood that a user redeems a voucher is a key factor in voucher distribution decision. User-item Click-Through-Rate (CTR) models are often applied to predict the user-voucher redemption rate. However, the voucher scenario involves more complicated relations among users, items and vouchers. The users' historical behavior in a voucher collection activity reflects users' voucher usage patterns, which is nevertheless overlooked by the CTR-based solutions. In this paper, we propose a Deep Multi-behavior Graph Networks (DMBGN) to shed light on this field for the voucher redemption rate prediction. The complex structural user-voucher-item relationships are captured by a User-Behavior Voucher Graph (UVG). User behavior happening both before and after voucher collection is taken into consideration, and a high-level representation is extracted by Higher-order Graph Neural Networks. On top of a sequence of UVGs, an attention network is built which can help to learn users' long-term voucher redemption preference. Extensive experiments on three large-scale production datasets demonstrate the proposed DMBGN model is effective, with 10% to 16% relative AUC improvement over Deep Neural Networks (DNN), and 2% to 4% AUC improvement over Deep Interest Network (DIN). Source code and a sample dataset are made publicly available to facilitate future research.

deep learning, neural network, voucher, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3447548.3467191

2106.03356

Country:

Asia (0.29)
North America > United States > Illinois (0.14)

Genre: Research Report > New Finding (0.67)

Industry: Information Technology > Services > e-Commerce Services (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback