AITopics | Xu, Bixiong

Collaborating Authors

Xu, Bixiong

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

STAND-Guard: A Small Task-Adaptive Content Moderation Model

Wang, Minjia, Lin, Pingping, Cai, Siqi, An, Shengnan, Ma, Shengjie, Lin, Zeqi, Huang, Congrui, Xu, Bixiong

arXiv.org Artificial IntelligenceNov-7-2024

Content moderation, the process of reviewing and monitoring the safety of generated content, is important for development of welcoming online platforms and responsible large language models. Content moderation contains various tasks, each with its unique requirements tailored to specific scenarios. Therefore, it is crucial to develop a model that can be easily adapted to novel or customized content moderation tasks accurately without extensive model tuning. This paper presents STAND-GUARD, a Small Task-Adaptive coNtent moDeration model. The basic motivation is: by performing instruct tuning on various content moderation tasks, we can unleash the power of small language models (SLMs) on unseen (out-of-distribution) content moderation tasks. We also carefully study the effects of training tasks and model size on the efficacy of cross-task fine-tuning mechanism. Experiments demonstrate STAND-Guard is comparable to GPT-3.5-Turbo across over 40 public datasets, as well as proprietary datasets derived from real-world business scenarios. Remarkably, STAND-Guard achieved nearly equivalent results to GPT-4-Turbo on unseen English binary classification tasks

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2411.05214

Country: Europe (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Law Enforcement & Public Safety (0.69)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning Timestamp-Level Representations for Time Series with Hierarchical Contrastive Loss

Yue, Zhihan, Wang, Yujing, Duan, Juanyong, Yang, Tianmeng, Huang, Congrui, Xu, Bixiong

arXiv.org Artificial IntelligenceJun-19-2021

This paper presents TS2Vec, a universal framework for learning timestamp-level representations of time series. Unlike existing methods, TS2Vec performs timestamp-wise discrimination, which learns a contextual representation vector directly for each timestamp. We find that the learned representations have superior predictive ability. A linear regression trained on top of the learned representations outperforms previous SOTAs for supervised time series forecasting. Also, the instance-level representations can be simply obtained by applying a max pooling layer on top of learned representations of all timestamps. We conduct extensive experiments on time series classification tasks to evaluate the quality of instance-level representations. As a result, TS2Vec achieves significant improvement compared with existing SOTAs of unsupervised time series representation on 125 UCR datasets and 29 UEA datasets. The source code is publicly available at https://github.com/yuezhihan/ts2vec.

deep learning, neural network, representation, (17 more...)

arXiv.org Artificial Intelligence

2106.10466

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Spectral Temporal Graph Neural Network for Multivariate Time-series Forecasting

Cao, Defu, Wang, Yujing, Duan, Juanyong, Zhang, Ce, Zhu, Xia, Huang, Conguri, Tong, Yunhai, Xu, Bixiong, Bai, Jing, Tong, Jie, Zhang, Qi

arXiv.org Artificial IntelligenceMar-13-2021

Multivariate time-series forecasting plays a crucial role in many real-world applications. It is a challenging problem as one needs to consider both intra-series temporal correlations and inter-series correlations simultaneously. Recently, there have been multiple works trying to capture both correlations, but most, if not all of them only capture temporal correlations in the time domain and resort to pre-defined priors as inter-series relationships. In this paper, we propose Spectral Temporal Graph Neural Network (StemGNN) to further improve the accuracy of multivariate time-series forecasting. StemGNN captures inter-series correlations and temporal dependencies \textit{jointly} in the \textit{spectral domain}. It combines Graph Fourier Transform (GFT) which models inter-series correlations and Discrete Fourier Transform (DFT) which models temporal dependencies in an end-to-end framework. After passing through GFT and DFT, the spectral representations hold clear patterns and can be predicted effectively by convolution and sequential learning modules. Moreover, StemGNN learns inter-series correlations automatically from the data without using pre-defined priors. We conduct extensive experiments on ten real-world datasets to demonstrate the effectiveness of StemGNN. Code is available at https://github.com/microsoft/StemGNN/

deep learning, forecasting, immunology, (18 more...)

arXiv.org Artificial Intelligence

2103.07719

Country:

North America > United States (0.28)
Asia > Middle East > Qatar (0.14)

Genre: Research Report (1.00)

Industry:

Information Technology (0.93)
Banking & Finance (0.67)
Health & Medicine > Therapeutic Area (0.33)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multivariate Time-series Anomaly Detection via Graph Attention Network

Zhao, Hang, Wang, Yujing, Duan, Juanyong, Huang, Congrui, Cao, Defu, Tong, Yunhai, Xu, Bixiong, Bai, Jing, Tong, Jie, Zhang, Qi

arXiv.org Machine LearningSep-4-2020

Anomaly detection on multivariate time-series is of great importance in both data mining research and industrial applications. Recent approaches have achieved significant progress in this topic, but there is remaining limitations. One major limitation is that they do not capture the relationships between different time-series explicitly, resulting in inevitable false alarms. In this paper, we propose a novel self-supervised framework for multivariate time-series anomaly detection to address this issue. Our framework considers each univariate time-series as an individual feature and includes two graph attention layers in parallel to learn the complex dependencies of multivariate time-series in both temporal and feature dimensions. In addition, our approach jointly optimizes a forecasting-based model and are construction-based model, obtaining better time-series representations through a combination of single-timestamp prediction and reconstruction of the entire time-series. We demonstrate the efficacy of our model through extensive experiments. The proposed method outperforms other state-of-the-art models on three real-world datasets. Further analysis shows that our method has good interpretability and is useful for anomaly diagnosis.

anomaly detection, deep learning, neural network, (17 more...)

arXiv.org Machine Learning

2009.0204

Country: North America (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Time-Series Anomaly Detection Service at Microsoft

Ren, Hansheng, Xu, Bixiong, Wang, Yujing, Yi, Chao, Huang, Congrui, Kou, Xiaoyu, Xing, Tony, Yang, Mao, Tong, Jie, Zhang, Qi

arXiv.org Machine LearningJun-10-2019

Large companies need to monitor various metrics (for example, Page Views and Revenue) of their applications and services in real time. At Microsoft, we develop a time-series anomaly detection service which helps customers to monitor the time-series continuously and alert for potential incidents on time. In this paper, we introduce the pipeline and algorithm of our anomaly detection service, which is designed to be accurate, efficient and general. The pipeline consists of three major modules, including data ingestion, experimentation platform and online compute. To tackle the problem of time-series anomaly detection, we propose a novel algorithm based on Spectral Residual (SR) and Convolutional Neural Network (CNN). Our work is the first attempt to borrow the SR model from visual saliency detection domain to time-series anomaly detection. Moreover, we innovatively combine SR and CNN together to improve the performance of SR model. Our approach achieves superior experimental results compared with state-of-the-art baselines on both public datasets and Microsoft production data.

anomaly detection, deep learning, neural network, (21 more...)

arXiv.org Machine Learning

doi: 10.1145/3292500.3330680

1906.03821

Country: North America > United States (0.46)

Genre: Research Report > Promising Solution (0.68)

Industry: Information Technology (0.68)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback