AITopics | Yan, Bin

Collaborating Authors

Yan, Bin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey

Song, Zirui, Yan, Bin, Liu, Yuhan, Fang, Miao, Li, Mingzhe, Yan, Rui, Chen, Xiuying

arXiv.org Artificial IntelligenceFeb-15-2025

Large Language Models (LLMs) have demonstrated remarkable success in various tasks such as natural language understanding, text summarization, and machine translation. However, their general-purpose nature often limits their effectiveness in domain-specific applications that require specialized knowledge, such as healthcare, chemistry, or legal analysis. To address this, researchers have explored diverse methods to enhance LLMs by integrating domain-specific knowledge. In this survey, we provide a comprehensive overview of these methods, which we categorize into four key approaches: dynamic knowledge injection, static knowledge embedding, modular adapters, and prompt optimization. Each approach offers unique mechanisms to equip LLMs with domain expertise, balancing trade-offs between flexibility, scalability, and efficiency. We discuss how these methods enable LLMs to tackle specialized tasks, compare their advantages and disadvantages, evaluate domain-specific LLMs against general LLMs, and highlight the challenges and opportunities in this emerging field. For those interested in delving deeper into this area, we also summarize the commonly used datasets and benchmarks. To keep researchers updated on the latest studies, we maintain an open-source at: https://github.com/abilliyb/Knowledge_Injection_Survey_Papers, dedicated to documenting research in the field of specialized LLM.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2502.10708

Country: Asia (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.48)

Industry:

Health & Medicine (1.00)
Law (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Dynamic ensemble selection based on Deep Neural Network Uncertainty Estimation for Adversarial Robustness

Qin, Ruoxi, Wang, Linyuan, Du, Xuehui, Chen, Xingyuan, Yan, Bin

arXiv.org Artificial IntelligenceAug-1-2023

The deep neural network has attained significant efficiency in image recognition. However, it has vulnerable recognition robustness under extensive data uncertainty in practical applications. The uncertainty is attributed to the inevitable ambient noise and, more importantly, the possible adversarial attack. Dynamic methods can effectively improve the defense initiative in the arms race of attack and defense of adversarial examples. Different from the previous dynamic method depend on input or decision, this work explore the dynamic attributes in model level through dynamic ensemble selection technology to further protect the model from white-box attacks and improve the robustness. Specifically, in training phase the Dirichlet distribution is apply as prior of sub-models' predictive distribution, and the diversity constraint in parameter space is introduced under the lightweight sub-models to construct alternative ensembel model spaces. In test phase, the certain sub-models are dynamically selected based on their rank of uncertainty value for the final prediction to ensure the majority accurate principle in ensemble robustness and accuracy. Compared with the previous dynamic method and staic adversarial traning model, the presented approach can achieve significant robustness results without damaging accuracy by combining dynamics and diversity property.

artificial intelligence, machine learning, robustness, (18 more...)

arXiv.org Artificial Intelligence

2308.00346

Country: North America > United States > California > Los Angeles County > Long Beach (0.14)

Genre: Research Report (0.83)

Industry:

Information Technology > Security & Privacy (0.88)
Government > Military (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

RF-GNN: Random Forest Boosted Graph Neural Network for Social Bot Detection

Shi, Shuhao, Qiao, Kai, Yang, Jie, Song, Baojie, Chen, Jian, Yan, Bin

arXiv.org Artificial IntelligenceApr-13-2023

However, the existence of automated accounts, also known as social bots, has brought many problems to social media. These bots have been employed to disseminate false information, manipulate elections, and deceive users, resulting in negative societal consequences [1; 2; 3]. Effectively detecting bots on social media plays an important role in protecting user interests and ensuring stable platform operation. Therefore, the accurate detection of bots on social media platforms is becoming increasingly crucial. Random Forest (RF) [4] is a classical algorithm of ensemble learning that can significantly improve the performance of the base classifier, Decision Tree (DT) [5]. Specifically, S sub-training sets are generated by randomly selecting n samples with replacement from the original training set of N samples S times. Then, m features are selected from the M-dimensional features of each sub-training set, and S base classifiers are trained using different sub-training sets. The final classification result is determined by the voting of the base classifiers. Due to its excellent performance, RF has been widely applied in various competitions, such as data mining and financial risk detection, and is also frequently used in social bot detection.

artificial intelligence, machine learning, social media, (19 more...)

arXiv.org Artificial Intelligence

2304.08239

Country: Asia > China (0.14)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.64)
Information Technology > Services (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
(2 more...)

Add feedback

Select and Calibrate the Low-confidence: Dual-Channel Consistency based Graph Convolutional Networks

Shi, Shuhao, Chen, Jian, Qiao, Kai, Yang, Shuai, Wang, Linyuan, Yan, Bin

arXiv.org Artificial IntelligenceMay-7-2022

The Graph Convolutional Networks (GCNs) have achieved excellent results in node classification tasks, but the model's performance at low label rates is still unsatisfactory. Previous studies in Semi-Supervised Learning (SSL) for graph have focused on using network predictions to generate soft pseudo-labels or instructing message propagation, which inevitably contains the incorrect prediction due to the over-confident in the predictions. Our proposed Dual-Channel Consistency based Graph Convolutional Networks (DCC-GCN) uses dual-channel to extract embeddings from node features and topological structures, and then achieves reliable low-confidence and high-confidence samples selection based on dual-channel consistency. We further confirmed that the low-confidence samples obtained based on dual-channel consistency were low in accuracy, constraining the model's performance. Unlike previous studies ignoring low-confidence samples, we calibrate the feature embeddings of the low-confidence samples by using the neighborhood's high-confidence samples. Our experiments have shown that the DCC-GCN can more accurately distinguish between low-confidence and high-confidence samples, and can also significantly improve the accuracy of low-confidence samples. We conducted extensive experiments on the benchmark datasets and demonstrated that DCC-GCN is significantly better than state-of-the-art baselines at different label rates.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10489-023-05110-5

2205.03753

Country: Asia > China (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Adaptive Multi-layer Contrastive Graph Neural Networks

Shi, Shuhao, Xie, Pengfei, Luo, Xu, Qiao, Kai, Wang, Linyuan, Chen, Jian, Yan, Bin

arXiv.org Artificial IntelligenceSep-28-2021

We present Adaptive Multi-layer Contrastive Graph Neural Networks (AMC-GNN), a self-supervised learning framework for Graph Neural Network, which learns feature representations of sample data without data labels. AMC-GNN generates two graph views by data augmentation and compares different layers' output embeddings of Graph Neural Network encoders to obtain feature representations, which could be used for downstream tasks. AMC-GNN could learn the importance weights of embeddings in different layers adaptively through the attention mechanism, and an auxiliary encoder is introduced to train graph contrastive encoders better. The accuracy is improved by maximizing the representation's consistency of positive pairs in the early layers and the final embedding space. Our experiments show that the results can be consistently improved by using the AMC-GNN framework, across four established graph benchmarks: Cora, Citeseer, Pubmed, DBLP citation network datasets, as well as four newly proposed datasets: Co-author-CS, Co-author-Physics, Amazon-Computers, Amazon-Photo.

artificial intelligence, machine learning, representation, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s11063-022-11064-5

2109.14159

Country: Asia > China (0.14)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback