AITopics | Zhao, Tian

Collaborating Authors

Zhao, Tian

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions

Sun, Kai, Xue, Siyan, Sun, Fuchun, Sun, Haoran, Luo, Yu, Wang, Ling, Wang, Siyuan, Guo, Na, Liu, Lei, Zhao, Tian, Wang, Xinzhou, Yang, Lei, Jin, Shuo, Yan, Jun, Dong, Jiahong

arXiv.org Artificial IntelligenceDec-3-2024

Recent advancements in deep learning have significantly revolutionized the field of clinical diagnosis and treatment, offering novel approaches to improve diagnostic precision and treatment efficacy across diverse clinical domains, thus driving the pursuit of precision medicine. The growing availability of multi-organ and multimodal datasets has accelerated the development of large-scale Medical Multimodal Foundation Models (MMFMs). These models, known for their strong generalization capabilities and rich representational power, are increasingly being adapted to address a wide range of clinical tasks, from early diagnosis to personalized treatment strategies. This review offers a comprehensive analysis of recent developments in MMFMs, focusing on three key aspects: datasets, model architectures, and clinical applications. We also explore the challenges and opportunities in optimizing multimodal representations and discuss how these advancements are shaping the future of healthcare by enabling improved patient outcomes and more efficient clinical workflows.

data mining, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2412.02621

Country:

Europe (0.92)
Asia > China (0.46)
North America > Canada (0.28)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Promising Solution (0.87)
Research Report > Experimental Study (0.67)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(6 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(3 more...)

Add feedback

GCtx-UNet: Efficient Network for Medical Image Segmentation

Alrfou, Khaled, Zhao, Tian

arXiv.org Artificial IntelligenceJun-9-2024

Automated medical image segmentation is critical in providing valuable information for the prevention, diagnosis, progression monitoring, and prognosis of various diseases, as well as quantitative pathology assessment. The U-shaped deep-neural networks, which include encoders, decoders, and skip connections, are now the most widely used methods for medical image segmentation. Although the U-shaped networks have achieved state-of-the-art performance in numerous medical image segmentation tasks, it still has limitations. One primary limitation is the encoders' ability to effectively extract and integrate long-range and local features. Methods based on Convolutional Neural Networks (CNNs) such as UNet [26] and UNet++ [35] excel at capturing local features, but they struggle to model long-range dependencies within data. While Transformer-based methods such as Swin-UNet [6] can model long-range pixel relations, they lack spatial induction bias in modeling local information, which leads to unsatisfactory results. Past research explored CNN-Transformer hybrid architectures such as TransUnet [8] to capture global and local information but these models often significantly increase the number of parameters.

artificial intelligence, gctx-unet, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2406.05891

Country:

Europe (0.46)
Asia (0.28)
North America > United States > Wisconsin (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark

Coleman, Cody, Kang, Daniel, Narayanan, Deepak, Nardi, Luigi, Zhao, Tian, Zhang, Jian, Bailis, Peter, Olukotun, Kunle, Re, Chris, Zaharia, Matei

arXiv.org Machine LearningJun-4-2018

The deep learning community has proposed optimizations spanning hardware, software, and learning theory to improve the computational performance of deep learning workloads. While some of these optimizations perform the same operations faster (e.g., switching from a NVIDIA K80 to P100), many modify the semantics of the training procedure (e.g., large minibatch training, reduced precision), which can impact a model's generalization ability. Due to a lack of standard evaluation criteria that considers these trade-offs, it has become increasingly difficult to compare these different advances. To address this shortcoming, DAWNBENCH and the upcoming MLPERF benchmarks use time-to-accuracy as the primary metric for evaluation, with the accuracy threshold set close to state-of-the-art and measured on a held-out dataset not used in training; the goal is to train to this accuracy threshold as fast as possible. In DAWNBENCH , the winning entries improved time-to-accuracy on ImageNet by two orders of magnitude over the seed entries. Despite this progress, it is unclear how sensitive time-to-accuracy is to the chosen threshold as well as the variance between independent training runs, and how well models optimized for time-to-accuracy generalize. In this paper, we provide evidence to suggest that time-to-accuracy has a low coefficient of variance and that the models tuned for it generalize nearly as well as pre-trained models. We additionally analyze the winning entries to understand the source of these speedups, and give recommendations for future benchmarking efforts.

arxiv preprint arxiv, deep learning, neural network, (17 more...)

arXiv.org Machine Learning

1806.01427

Genre: Research Report (0.40)

Industry: Information Technology (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback