AITopics | Zhang, Zhibo

Collaborating Authors

Zhang, Zhibo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Corporate Fraud Detection in Rich-yet-Noisy Financial Graph

Wang, Shiqi, Zhang, Zhibo, Fang, Libing, Nguyen, Cam-Tu, Li, Wenzhon

arXiv.org Artificial IntelligenceFeb-26-2025

Corporate fraud detection aims to automatically recognize companies that conduct wrongful activities such as fraudulent financial statements or illegal insider trading. Previous learning-based methods fail to e ffectively integrate rich interactions in the company network. To close this gap, we collect 18-year financial records in China to form three graph datasets with fraud labels. We analyze the characteristics of the financial graphs, highlighting two pronounced issues: (1) information overload: the dominance of (noisy) non-company nodes over company nodes hinders the message-passing process in Graph Convolution Networks (GCN); and (2) hidden fraud: there exists a large percentage of possible undetected violations in the collected data. The hidden fraud problem will introduce noisy labels in the training dataset and compromise fraud detection results. The proposed model adopts a two-stage learning method to enhance robustness against hidden frauds. Introduction Corporate fraud refers to illegal schemes by listed companies in the stock market, aiming at financial gains through di ff erent means such as fraudulent financial statements and illegal insider trading. This kind of fraud bears systematic risks, which can potentially lead to financial crises at the macro level [1]. Unfortunately, the rapid growth of young capital markets has given rise to an increasing number of fraudulent cases in recent years, putting pressure on regulators and auditors. Since the traditional human supervision solution is no longer effi cient, it is desirable to build an autonomous system to assist regulators in this essential task. These machine-learning models are built to classify annual financial statements as fraudulent or not, based on expert-chosen feature sets. Unfortunately, the rich interactions in the company network have not been e ffec-tively integrated for corporate fraud detection. Financial experts, on the other hand, have recognized the influence of "Directors / Supervisors / Executives (DSE)" and "Related Party Transactions (RPT)" on corporate fraud (see Figure 1). DSE refers to the members of the director board of the company. Being the decision-making body in a company, the director board is certainly the agent behind most corporate frauds [6]. Connection via DSE also helps companies lower the coordination cost for illegal activities, thus significantly increasing the likelihood of committing fraud [7]. RPT refers to deals or arrangements between two companies that are joined by a previous business association or share common interests. RPTs, particularly those that go unchecked, carry the risk of financial fraud by various means such as illegal profit transmission [8, 9].

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2502.19305

Country: Asia > China (0.49)

Genre: Research Report (1.00)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

POI-Enhancer: An LLM-based Semantic Enhancement Framework for POI Representation Learning

Cheng, Jiawei, Wang, Jingyuan, Zhang, Yichuan, Ji, Jiahao, Zhu, Yuanshao, Zhang, Zhibo, Zhao, Xiangyu

arXiv.org Artificial IntelligenceFeb-14-2025

POI representation learning plays a crucial role in handling tasks related to user mobility data. Recent studies have shown that enriching POI representations with multimodal information can significantly enhance their task performance. Previously, the textual information incorporated into POI representations typically involved only POI categories or check-in content, leading to relatively weak textual features in existing methods. In contrast, large language models (LLMs) trained on extensive text data have been found to possess rich textual knowledge. However leveraging such knowledge to enhance POI representation learning presents two key challenges: first, how to extract POI-related knowledge from LLMs effectively, and second, how to integrate the extracted information to enhance POI representations. To address these challenges, we propose POI-Enhancer, a portable framework that leverages LLMs to improve POI representations produced by classic POI learning models. We first design three specialized prompts to extract semantic information from LLMs efficiently. Then, the Dual Feature Alignment module enhances the quality of the extracted information, while the Semantic Feature Fusion module preserves its integrity. The Cross Attention Fusion module then fully adaptively integrates such high-quality information into POI representations and Multi-View Contrastive Learning further injects human-understandable semantic information into these representations. Extensive experiments on three real-world datasets demonstrate the effectiveness of our framework, showing significant improvements across all baseline representations.

information, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2502.10038

Country:

Asia > China (0.29)
North America > United States > New York (0.15)

Genre: Research Report > New Finding (0.46)

Industry: Consumer Products & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Model-Editing-Based Jailbreak against Safety-aligned Large Language Models

Li, Yuxi, Zhang, Zhibo, Wang, Kailong, Shi, Ling, Wang, Haoyu

arXiv.org Artificial IntelligenceDec-11-2024

Large Language Models (LLMs) have transformed numerous fields by enabling advanced natural language interactions but remain susceptible to critical vulnerabilities, particularly jailbreak attacks. Current jailbreak techniques, while effective, often depend on input modifications, making them detectable and limiting their stealth and scalability. This paper presents Targeted Model Editing (TME), a novel white-box approach that bypasses safety filters by minimally altering internal model structures while preserving the model's intended functionalities. TME identifies and removes safety-critical transformations (SCTs) embedded in model matrices, enabling malicious queries to bypass restrictions without input modifications. By analyzing distinct activation patterns between safe and unsafe queries, TME isolates and approximates SCTs through an optimization process. Implemented in the D-LLM framework, our method achieves an average Attack Success Rate (ASR) of 84.86% on four mainstream open-source LLMs, maintaining high performance. Unlike existing methods, D-LLM eliminates the need for specific triggers or harmful response collections, offering a stealthier and more effective jailbreak strategy. This work reveals a covert and robust threat vector in LLM security and emphasizes the need for stronger safeguards in model safety alignment.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2412.08201

Country:

Asia (1.00)
North America > United States (0.68)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Research on Older Adults' Interaction with E-Health Interface Based on Explainable Artificial Intelligence

Huang, Xueting, Zhang, Zhibo, Guo, Fusen, Wang, Xianghao, Chi, Kun, Wu, Kexin

arXiv.org Artificial IntelligenceFeb-1-2024

This paper proposed a comprehensive mixed-methods framework with varied samples of older adults, including user experience, usability assessments, and in-depth interviews with the integration of Explainable Artificial Intelligence (XAI) methods. The experience of older adults' interaction with the E-health interface is collected through interviews and transformed into operatable databases whereas XAI methods are utilized to explain the collected interview results in this research work. The results show that XAI-infused e-health interfaces could play an important role in bridging the age-related digital divide by investigating elders' preferences when interacting with E-health interfaces. Furthermore, the study identifies important design factors, such as intuitive visualization and straightforward explanations, that are critical for creating efficient Human-Computer Interaction (HCI) tools among older users. Furthermore, this study emphasizes the revolutionary potential of XAI in e-health interfaces for older users, emphasizing the importance of transparency and understandability in HCI-driven healthcare solutions. This study's findings have far-reaching implications for the design and development of user-centric e-health technologies, intending to increase the overall well-being of older adults.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2402.07915

Country:

North America > United States (0.68)
Oceania > Australia (0.46)
Asia > China > Guangdong Province (0.29)

Genre:

Questionnaire & Opinion Survey (1.00)
Personal > Interview (0.66)
Research Report > New Finding (0.48)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Reputation-Based Federated Learning Defense to Mitigate Threats in EEG Signal Classification

Zhang, Zhibo, Li, Pengfei, Hammadi, Ahmed Y. Al, Guo, Fusen, Damiani, Ernesto, Yeun, Chan Yeob

arXiv.org Artificial IntelligenceOct-22-2023

This paper presents a reputation-based threat mitigation framework that defends potential security threats in electroencephalogram (EEG) signal classification during model aggregation of Federated Learning. While EEG signal analysis has attracted attention because of the emergence of brain-computer interface (BCI) technology, it is difficult to create efficient learning models for EEG analysis because of the distributed nature of EEG data and related privacy and security concerns. To address these challenges, the proposed defending framework leverages the Federated Learning paradigm to preserve privacy by collaborative model training with localized data from dispersed sources and introduces a reputation-based mechanism to mitigate the influence of data poisoning attacks and identify compromised participants. To assess the efficiency of the proposed reputation-based federated learning defense framework, data poisoning attacks based on the risk level of training data derived by Explainable Artificial Intelligence (XAI) techniques are conducted on both publicly available EEG signal datasets and the self-established EEG signal dataset. Experimental results on the poisoned datasets show that the proposed defense methodology performs well in EEG signal classification while reducing the risks associated with security threats.

artificial intelligence, federated learning, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2401.01896

Country: Asia > Middle East > UAE (0.14)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Robust Adversary Detection-Deactivation Method for Metaverse-oriented Collaborative Deep Learning

Li, Pengfei, Zhang, Zhibo, Al-Sumaiti, Ameena S., Werghi, Naoufel, Yeun, Chan Yeob

arXiv.org Artificial IntelligenceOct-21-2023

Metaverse is trending to create a digital circumstance that can transfer the real world to an online platform supported by large quantities of real-time interactions. Pre-trained Artificial Intelligence (AI) models are demonstrating their increasing capability in aiding the metaverse to achieve an excellent response with negligible delay, and nowadays, many large models are collaboratively trained by various participants in a manner named collaborative deep learning (CDL). However, several security weaknesses can threaten the safety of the CDL training process, which might result in fatal attacks to either the pre-trained large model or the local sensitive data sets possessed by an individual entity. In CDL, malicious participants can hide within the major innocent and silently uploads deceptive parameters to degenerate the model performance, or they can abuse the downloaded parameters to construct a Generative Adversarial Network (GAN) to acquire the private information of others illegally. To compensate for these vulnerabilities, this paper proposes an adversary detection-deactivation method, which can limit and isolate the access of potential malicious participants, quarantine and disable the GAN-attack or harmful backpropagation of received threatening gradients. A detailed protection analysis has been conducted on a Multiview CDL case, and results show that the protocol can effectively prevent harmful access by heuristic manner analysis and can protect the existing model by swiftly checking received gradients using only one low-cost branch with an embedded firewall.

artificial intelligence, machine learning, participant, (17 more...)

arXiv.org Artificial Intelligence

2401.01895

Country:

North America (0.67)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail

Zhang, Zhibo, Damiani, Ernesto, Hamadi, Hussam Al, Yeun, Chan Yeob, Taher, Fatma

arXiv.org Artificial IntelligenceMay-15-2023

In recent years, spammers are now trying to obfuscate their intents by introducing hybrid spam e-mail combining both image and text parts, which is more challenging to detect in comparison to e-mails containing text or image only. The motivation behind this research is to design an effective approach filtering out hybrid spam e-mails to avoid situations where traditional text-based or image-baesd only filters fail to detect hybrid spam e-mails. To the best of our knowledge, a few studies have been conducted with the goal of detecting hybrid spam e-mails. Ordinarily, Optical Character Recognition (OCR) technology is used to eliminate the image parts of spam by transforming images into text. However, the research questions are that although OCR scanning is a very successful technique in processing text-and-image hybrid spam, it is not an effective solution for dealing with huge quantities due to the CPU power required and the execution time it takes to scan e-mail files. And the OCR techniques are not always reliable in the transformation processes. To address such problems, we propose new late multi-modal fusion training frameworks for a text-and-image hybrid spam e-mail filtering system compared to the classical early fusion detection frameworks based on the OCR method. Convolutional Neural Network (CNN) and Continuous Bag of Words were implemented to extract features from image and text parts of hybrid spam respectively, whereas generated features were fed to sigmoid layer and Machine Learning based classifiers including Random Forest (RF), Decision Tree (DT), Naive Bayes (NB) and Support Vector Machine (SVM) to determine the e-mail ham or spam.

artificial intelligence, machine learning, optical character recognition, (3 more...)

arXiv.org Artificial Intelligence

2210.14616

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)
(2 more...)

Add feedback

Explainable Label-flipping Attacks on Human Emotion Assessment System

Zhang, Zhibo, Hammadi, Ahmed Y. Al, Damiani, Ernesto, Yeun, Chan Yeob

arXiv.org Artificial IntelligenceFeb-8-2023

Abstract--This paper's main goal is to provide an attacker's The dataset was compiled to look at potential applications as current or past employees, and have inside knowledge of the of brainwave signals for spotting insider threats in the business's security protocols, customer information, and workplace. The Emotiv Insight 5 channels were the tool used computer systems. To assess human emotions and acts, speech to collect the data. Information from 17 people who gave their [1] and facial expression [2] data were employed traditionally. On the other hand, EEG The four risk categories--High-Risk, Medium-Risk, Low-signals [3] have been utilized in recent years to assess a Risk, and Normal--found in the risk matrix were used to person's emotional state to prevent potential industrial insider classify each signal for a captured image, and each signal was assaults because people cannot conceal or manipulate their then given the appropriate label.

artificial intelligence, machine learning, ml model, (12 more...)

arXiv.org Artificial Intelligence

2302.04109

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (0.57)
Education > Assessment & Standards > Assessment Methods (0.44)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.67)

Add feedback

Explainable Data Poison Attacks on Human Emotion Evaluation Systems based on EEG Signals

Zhang, Zhibo, Umar, Sani, Hammadi, Ahmed Y. Al, Yoon, Sangyoung, Damiani, Ernesto, Ardagna, Claudio Agostino, Bena, Nicola, Yeun, Chan Yeob

arXiv.org Artificial IntelligenceJan-17-2023

The major aim of this paper is to explain the data poisoning attacks using label-flipping during the training stage of the electroencephalogram (EEG) signal-based human emotion evaluation systems deploying Machine Learning models from the attackers' perspective. Human emotion evaluation using EEG signals has consistently attracted a lot of research attention. The identification of human emotional states based on EEG signals is effective to detect potential internal threats caused by insider individuals. Nevertheless, EEG signal-based human emotion evaluation systems have shown several vulnerabilities to data poison attacks. The findings of the experiments demonstrate that the suggested data poison assaults are model-independently successful, although various models exhibit varying levels of resilience to the attacks. In addition, the data poison attacks on the EEG signal-based human emotion evaluation systems are explained with several Explainable Artificial Intelligence (XAI) methods, including Shapley Additive Explanation (SHAP) values, Local Interpretable Model-agnostic Explanations (LIME), and Generated Decision Trees. And the codes of this paper are publicly available on GitHub.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ACCESS.2023.3245813

2301.06923

Country:

Europe (0.93)
Asia > Middle East (0.28)

Genre: Research Report > New Finding (0.87)

Industry:

Information Technology > Security & Privacy (1.00)
Education (0.93)
Government (0.70)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

A new database of Houma Alliance Book ancient handwritten characters and classifier fusion approach

Yuan, Xiaoyu, Zhang, Zhibo, Sun, Yabo, Xue, Zekai, Shao, Xiuyan, Huang, Xiaohua

arXiv.org Artificial IntelligenceAug-2-2022

The Houma Alliance Book is one of the national treasures of the Museum in Shanxi Museum Town in China. It has great historical significance in researching ancient history. To date, the research on the Houma Alliance Book has been staying in the identification of paper documents, which is inefficient to identify and difficult to display, study and publicize. Therefore, the digitization of the recognized ancient characters of Houma League can effectively improve the efficiency of recognizing ancient characters and provide more reliable technical support and text data. This paper proposes a new database of Houma Alliance Book ancient handwritten characters and a multi-modal fusion method to recognize ancient handwritten characters. In the database, 297 classes and 3,547 samples of Houma Alliance ancient handwritten characters are collected from the original book collection and by human imitative writing. Furthermore, the decision-level classifier fusion strategy is applied to fuse three well-known deep neural network architectures for ancient handwritten character recognition. Experiments are performed on our new database. The experimental results first provide the baseline result of the new database to the research community and then demonstrate the efficiency of our proposed method.

artificial intelligence, houma alliance book, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2207.05993

Country: Asia > China (0.34)

Genre: Research Report (1.00)

Industry: Media (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback