AITopics | Liu, Shaobo

Collaborating Authors

Liu, Shaobo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Privacy-Preserving Hybrid Ensemble Model for Network Anomaly Detection: Balancing Security and Data Protection

Liu, Shaobo, Zhao, Zihao, He, Weijie, Wang, Jiren, Peng, Jing, Ma, Haoyuan

arXiv.org Artificial IntelligenceFeb-13-2025

Privacy-preserving network anomaly detection has become an essential area of research due to growing concerns over the protection of sensitive data. Traditional anomaly de- tection models often prioritize accuracy while neglecting the critical aspect of privacy. In this work, we propose a hybrid ensemble model that incorporates privacy-preserving techniques to address both detection accuracy and data protection. Our model combines the strengths of several machine learning algo- rithms, including K-Nearest Neighbors (KNN), Support Vector Machines (SVM), XGBoost, and Artificial Neural Networks (ANN), to create a robust system capable of identifying network anomalies while ensuring privacy. The proposed approach in- tegrates advanced preprocessing techniques that enhance data quality and address the challenges of small sample sizes and imbalanced datasets. By embedding privacy measures into the model design, our solution offers a significant advancement over existing methods, ensuring both enhanced detection performance and strong privacy safeguards.

data mining, detection, machine learning, (11 more...)

arXiv.org Artificial Intelligence

2502.09001

Country: North America > United States (0.50)

Genre: Research Report > Experimental Study (0.35)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

Research on Key Technologies for Cross-Cloud Federated Training of Large Language Models

Yang, Haowei, Sui, Mingxiu, Liu, Shaobo, Qian, Xinyue, Zhang, Zhaoyang, Liu, Bingying

arXiv.org Artificial IntelligenceDec-22-2024

These models have achieved remarkable success in areas such as machine translation, speech recognition, and text generation. However, training these large models typically requires vast computational resources and data, which not only places high demands on the resources of a single cloud platform but can also lead to computational bottlenecks, latency issues, and cost pressures[1]. Cross-cloud federated training has emerged as an effective solution to these challenges. By leveraging the computational resources of multiple cloud platforms, cross-cloud federated training enables distributed processing of large datasets and synchronous model parameter updates, thereby accelerating the training process. The implementation of cross-cloud federated training involves addressing several key technical challenges, including efficiently allocating and managing the computational resources of cloud platforms, optimizing data communication between clouds, and ensuring data privacy and security during the training process[2].

cloud platform, large language model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2410.1913

Country: North America > United States (0.95)

Genre: Research Report (0.64)

Industry:

Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

TRIZ Method for Urban Building Energy Optimization: GWO-SARIMA-LSTM Forecasting model

Zheng, Shirong, Liu, Shaobo, Zhang, Zhenhong, Gu, Dian, Xia, Chunqiu, Pang, Huadong, Ampaw, Enock Mintah

arXiv.org Artificial IntelligenceOct-20-2024

With the advancement of global climate change and sustainable development goals, urban building energy consumption optimization and carbon emission reduction have become the focus of research. Traditional energy consumption prediction methods often lack accuracy and adaptability due to their inability to fully consider complex energy consumption patterns, especially in dealing with seasonal fluctuations and dynamic changes. This study proposes a hybrid deep learning model that combines TRIZ innovation theory with GWO, SARIMA and LSTM to improve the accuracy of building energy consumption prediction. TRIZ plays a key role in model design, providing innovative solutions to achieve an effective balance between energy efficiency, cost and comfort by systematically analyzing the contradictions in energy consumption optimization. GWO is used to optimize the parameters of the model to ensure that the model maintains high accuracy under different conditions. The SARIMA model focuses on capturing seasonal trends in the data, while the LSTM model handles short-term and long-term dependencies in the data, further improving the accuracy of the prediction. The main contribution of this research is the development of a robust model that leverages the strengths of TRIZ and advanced deep learning techniques, improving the accuracy of energy consumption predictions. Our experiments demonstrate a significant 15% reduction in prediction error compared to existing models. This innovative approach not only enhances urban energy management but also provides a new framework for optimizing energy use and reducing carbon emissions, contributing to sustainable development.

artificial intelligence, intelligence technology and innovation, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2410.15283

Country: North America > United States (1.00)

Genre: Research Report > Promising Solution (0.86)

Industry:

Information Technology > Security & Privacy (0.93)
Energy > Renewable (0.93)
Energy > Oil & Gas > Upstream (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Balancing Innovation and Privacy: Data Security Strategies in Natural Language Processing Applications

Liu, Shaobo, Liu, Guiran, Zhu, Binrong, Luo, Yuanshuai, Wu, Linxiao, Wang, Rui

arXiv.org Artificial IntelligenceOct-11-2024

This research addresses privacy protection in Natural Language Processing (NLP) by introducing a novel algorithm based on differential privacy, aimed at safeguarding user data in common applications such as chatbots, sentiment analysis, and machine translation. With the widespread application of NLP technology, the security and privacy protection of user data have become important issues that need to be solved urgently. This paper proposes a new privacy protection algorithm designed to effectively prevent the leakage of user sensitive information. By introducing a differential privacy mechanism, our model ensures the accuracy and reliability of data analysis results while adding random noise. This method not only reduces the risk caused by data leakage but also achieves effective processing of data while protecting user privacy. Compared to traditional privacy methods like data anonymization and homomorphic encryption, our approach offers significant advantages in terms of computational efficiency and scalability while maintaining high accuracy in data analysis. The proposed algorithm's efficacy is demonstrated through performance metrics such as accuracy (0.89), precision (0.85), and recall (0.88), outperforming other methods in balancing privacy and utility. As privacy protection regulations become increasingly stringent, enterprises and developers must take effective measures to deal with privacy risks. Our research provides an important reference for the application of privacy protection technology in the field of NLP, emphasizing the need to achieve a balance between technological innovation and user privacy. In the future, with the continuous advancement of technology, privacy protection will become a core element of data-driven applications and promote the healthy development of the entire industry.

machine learning, natural language, privacy protection, (18 more...)

arXiv.org Artificial Intelligence

2410.08553

Country: North America > United States > California (0.47)

Genre: Research Report > Experimental Study (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Applying Hybrid Graph Neural Networks to Strengthen Credit Risk Analysis

Sun, Mengfang, Sun, Wenying, Sun, Ying, Liu, Shaobo, Jiang, Mohan, Xu, Zhen

arXiv.org Artificial IntelligenceOct-5-2024

This paper presents a novel approach to credit risk prediction by employing Graph Convolutional Neural Networks (GCNNs) to assess the creditworthiness of borrowers. Leveraging the power of big data and artificial intelligence, the proposed method addresses the challenges faced by traditional credit risk assessment models, particularly in handling imbalanced datasets and extracting meaningful features from complex relationships. The paper begins by transforming raw borrower data into graph-structured data, where borrowers and their relationships are represented as nodes and edges, respectively. A classic subgraph convolutional model is then applied to extract local features, followed by the introduction of a hybrid GCNN model that integrates both local and global convolutional operators to capture a comprehensive representation of node features. The hybrid model incorporates an attention mechanism to adaptively select features, mitigating issues of over-smoothing and insufficient feature consideration. The study demonstrates the potential of GCNNs in improving the accuracy of credit risk prediction, offering a robust solution for financial institutions seeking to enhance their lending decision-making processes.

artificial intelligence, machine learning, node, (15 more...)

arXiv.org Artificial Intelligence

2410.04283

Country: North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry:

Banking & Finance > Risk Management (1.00)
Banking & Finance > Credit (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System

Zhao, Yang, Zhou, Chang, Cao, Jin, Zhao, Yi, Liu, Shaobo, Cheng, Chiyu, Li, Xingchen

arXiv.org Artificial IntelligenceJul-2-2024

This paper explores multi-scenario optimization on large platforms using multi-agent reinforcement learning (MARL). We address this by treating scenarios like search, recommendation, and advertising as a cooperative, partially observable multi-agent decision problem. We introduce the Multi-Agent Recurrent Deterministic Policy Gradient (MARDPG) algorithm, which aligns different scenarios under a shared objective and allows for strategy communication to boost overall performance. Our results show marked improvements in metrics such as click-through rate (CTR), conversion rate, and total sales, confirming our method's efficacy in practical settings.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2407.02759

Country: North America > United States > California (0.69)

Genre: Research Report > New Finding (0.69)

Industry: Health & Medicine > Therapeutic Area (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.67)

Add feedback

Research on Driver Facial Fatigue Detection Based on Yolov8 Model

Zhou, Chang, Zhao, Yang, Liu, Shaobo, Zhao, Yi, Li, Xingchen, Cheng, Chiyu

arXiv.org Artificial IntelligenceJun-4-2024

In a society where traffic accidents frequently occur, fatigue driving has emerged as a grave issue. Fatigue driving detection technology, especially those based on the YOLOv8 deep learning model, has seen extensive research and application as an effective preventive measure. This paper discusses in depth the methods and technologies utilized in the YOLOv8 model to detect driver fatigue, elaborates on the current research status both domestically and internationally, and systematically introduces the processing methods and algorithm principles for various datasets. This study aims to provide a robust technical solution for preventing and detecting fatigue driving, thereby contributing significantly to reducing traffic accidents and safeguarding lives.

detection, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2406.18575

Country: North America > United States > California (0.47)

Genre: Research Report (0.84)

Industry: Health & Medicine (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Disentangle-based Continual Graph Representation Learning

Kou, Xiaoyu, Lin, Yankai, Liu, Shaobo, Li, Peng, Zhou, Jie, Zhang, Yan

arXiv.org Artificial IntelligenceOct-6-2020

Graph embedding (GE) methods embed nodes (and/or edges) in graph into a low-dimensional semantic space, and have shown its effectiveness in modeling multi-relational data. However, existing GE models are not practical in real-world applications since it overlooked the streaming nature of incoming data. To address this issue, we study the problem of continual graph representation learning which aims to continually train a GE model on new data to learn incessantly emerging multi-relational data while avoiding catastrophically forgetting old learned knowledge. Moreover, we propose a disentangle-based continual graph representation learning (DiCGRL) framework inspired by the human's ability to learn procedural knowledge. The experimental results show that DiCGRL could effectively alleviate the catastrophic forgetting problem and outperform state-of-the-art continual learning models. The code and datasets are released on https://github.com/KXY-PUBLIC/DiCGRL.

artificial intelligence, relational triplet, us government, (19 more...)

arXiv.org Artificial Intelligence

2010.02565

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > New Finding (0.48)

Industry: Education (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Add feedback

Exploiting Contextual Information via Dynamic Memory Network for Event Detection

Liu, Shaobo, Cheng, Rui, Yu, Xiaoming, Cheng, Xueqi

arXiv.org Artificial IntelligenceOct-3-2018

The task of event detection involves identifying and categorizing event triggers. Contextual information has been shown effective on the task. However, existing methods which utilize contextual information only process the context once. We argue that the context can be better exploited by processing the context multiple times, allowing the model to perform complex reasoning and to generate better context representation, thus improving the overall performance. Meanwhile, dynamic memory network (DMN) has demonstrated promising capability in capturing contextual information and has been applied successfully to various tasks. In light of the multi-hop mechanism of the DMN to model the context, we propose the trigger detection dynamic memory network (TD-DMN) to tackle the event detection problem.

artificial intelligence, module, neural network, (17 more...)

arXiv.org Artificial Intelligence

1810.03449

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback