AITopics | Zhao, Linxi

Collaborating Authors

Zhao, Linxi

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance

Zhao, Linxi, Deng, Yihe, Zhang, Weitong, Gu, Quanquan

arXiv.org Artificial IntelligenceFeb-13-2024

The advancement of Large Vision-Language Models (LVLMs) has increasingly highlighted the critical issue of their tendency to hallucinate non-existing objects in the images. To address this issue, previous works focused on using specially curated datasets or powerful LLMs (e.g., GPT-3.5) to rectify the outputs of LVLMs. However, these approaches require either expensive training/fine-tuning or API access to advanced LLMs to correct the model's output post-generation. In this paper, we tackle this challenge by introducing a framework called Mitigating hallucinAtion via classifieR-Free guIdaNcE (MARINE), which is both training-free and API-free, and can effectively and efficiently reduce object hallucinations during the generation process. Specifically, MARINE enriches the visual context of LVLMs by integrating existing open-source vision models, and employs classifier-free guidance to incorporate the additional object grounding features to improve the precision of LVLMs' generations. Through comprehensive evaluations across $6$ popular LVLMs with diverse evaluation metrics, we demonstrate the effectiveness of MARINE, which even outperforms existing fine-tuning-based methods. Remarkably, it not only reduces hallucinations but also improves the detailedness of LVLMs' generations, as assessed by GPT-4V.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2402.0868

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Comprehensive Dataset and Automated Pipeline for Nailfold Capillary Analysis

Zhao, Linxi, Tang, Jiankai, Chen, Dongyu, Liu, Xiaohong, Zhou, Yong, Wang, Guangyu, Wang, Yuntao

arXiv.org Artificial IntelligenceDec-10-2023

The introduction of machine learning marks a pivotal shift, presenting Nailfold capillaroscopy is a well-established method for automated medical image analysis as a promising alternative assessing health conditions, but the untapped potential of automated due to its higher accuracy compared to traditional image medical image analysis using machine learning remains processing algorithms[5]. Recent studies have attempted to despite recent advancements. In this groundbreaking use single deep-learning models for tasks such as nailfold study, we present a pioneering effort in constructing a comprehensive capillary segmentation[4, 8], measurement of capillary size dataset--321 images, 219 videos, 68 clinic reports, and density[5], and white cell counting[9]. Despite notable with expert annotations--that serves as a crucial resource achievements, the untapped potential of automated medical for training deep-learning models. Leveraging this image analysis persists due to the urgent need for annotated dataset, we propose an end-to-end nailfold capillary analysis and extensive datasets essential for effective training and pipeline capable of automatically detecting and measuring diverse fine-tuning deep neural networks.

artificial intelligence, capillaroscopy, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2312.0593

Country: Asia > China (0.31)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Diagnostic Medicine (0.71)
Health & Medicine > Consumer Health (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback