Gao, Zheng
Citation Recommendation based on Argumentative Zoning of User Queries
Ma, Shutian, Zhang, Chengzhi, Zhang, Heng, Gao, Zheng
Due to the increasing of scientific publication, scientific information recommendation has become an urgent problem which can save retrieval cost. There are kinds of information that can be recommended, such as paper recommendation (Mei et al., 2022), author recommendation (Alhoori & Furuta, 2017), journal recommendation (Gรผndoฤan et al., 2023) and so on. Among them, citation recommendation has arisen researchers' attention, which aims to help people find appropriate and necessary work to cite based on the given user queries. This paper aims to improve citation recommendation by considering the argumentative zoning of the citing sentence. Normally, authors will follow a logical framework when writing scientific papers. For example, the International Committee of Medical Journal Editors (ICMJE) recommends the IMRaD (Introduction, Methods, Results and Discussion) structure in writing and editing guidelines of biomedical publications (Editors & others, 2004). The structure of a research article is designed to present the research work clearly and concisely. This structure also helps to make it easy for readers to understand and evaluate the research.
Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning
Wang, Haining, Clark, Jason, McKelvey, Hannah, Sterman, Leila, Gao, Zheng, Tian, Zuoyu, Kรผbler, Sandra, Liu, Xiaozhong
A vast amount of scholarly work is published daily, yet much of it remains inaccessible to the general public due to dense jargon and complex language. To address this challenge in science communication, we introduce a reinforcement learning framework that fine-tunes a language model to rewrite scholarly abstracts into more comprehensible versions. Guided by a carefully balanced combination of word- and sentence-level accessibility rewards, our language model effectively substitutes technical terms with more accessible alternatives, a task which models supervised fine-tuned or guided by conventional readability measures struggle to accomplish. Our best model adjusts the readability level of scholarly abstracts by approximately six U.S. grade levels -- in other words, from a postgraduate to a high school level. This translates to roughly a 90% relative boost over the supervised fine-tuning baseline, all while maintaining factual accuracy and high-quality language. An in-depth analysis of our approach shows that balanced rewards lead to systematic modifications in the base model, likely contributing to smoother optimization and superior performance. We envision this work as a step toward bridging the gap between scholarly research and the general public, particularly younger readers and those without a college degree.
FrFT based estimation of linear and nonlinear impairments using Vision Transformer
Jiang, Ting, Gao, Zheng, Chen, Yizhao, Hu, Zihe, Tang, Ming
To comprehensively assess optical fiber communication system conditions, it is essential to implement joint estimation of the following four critical impairments: nonlinear signal-to-noise ratio (SNRNL), optical signal-to-noise ratio (OSNR), chromatic dispersion (CD) and differential group delay (DGD). However, current studies only achieve identifying a limited number of impairments within a narrow range, due to limitations in network capabilities and lack of unified representation of impairments. To address these challenges, we adopt time-frequency signal processing based on fractional Fourier transform (FrFT) to achieve the unified representation of impairments, while employing a Transformer based neural networks (NN) to break through network performance limitations. To verify the effectiveness of the proposed estimation method, the numerical simulation is carried on a 5-channel polarization-division-multiplexed quadrature phase shift keying (PDM-QPSK) long haul optical transmission system with the symbol rate of 50 GBaud per channel, the mean absolute error (MAE) for SNRNL, OSNR, CD, and DGD estimation is 0.091 dB, 0.058 dB, 117 ps/nm, and 0.38 ps, and the monitoring window ranges from 0~20 dB, 10~30 dB, 0~51000 ps/nm, and 0~100 ps, respectively. Our proposed method achieves accurate estimation of linear and nonlinear impairments over a broad range, representing a significant advancement in the field of optical performance monitoring (OPM).
AMAD: Adversarial Multiscale Anomaly Detection on High-Dimensional and Time-Evolving Categorical Data
Gao, Zheng, Guo, Lin, Ma, Chi, Ma, Xiao, Sun, Kai, Xiang, Hang, Zhu, Xiaoqiang, Li, Hongsong, Liu, Xiaozhong
Anomaly detection is facing with emerging challenges in many important industry domains, such as cyber security and online recommendation and advertising. The recent trend in these areas calls for anomaly detection on time-evolving data with high-dimensional categorical features without labeled samples. Also, there is an increasing demand for identifying and monitoring irregular patterns at multiple resolutions. In this work, we propose a unified end-to-end approach to solve these challenges by combining the advantages of Adversarial Autoencoder and Recurrent Neural Network. The model learns data representations cross different scales with attention mechanisms, on which an enhanced two-resolution anomaly detector is developed for both instances and data blocks. Extensive experiments are performed over three types of datasets to demonstrate the efficacy of our method and its superiority over the state-of-art approaches.
Neural Related Work Summarization with a Joint Context-driven Attention Mechanism
Wang, Yongzhen, Liu, Xiaozhong, Gao, Zheng
Conventional solutions to automatic related work summarization rely heavily on human-engineered features. In this paper, we develop a neural data-driven summarizer by leveraging the seq2seq paradigm, in which a joint context-driven attention mechanism is proposed to measure the contextual relevance within full texts and a heterogeneous bibliography graph simultaneously. Our motivation is to maintain the topic coherency between a related work section and its target document, where both the textual and graphic contexts play a big role in characterizing the relationship among scientific publications accurately. Experimental results on a large dataset show that our approach achieves a considerable improvement over a typical seq2seq summarizer and five classical summarization baselines.