AITopics | Lee, Jongho

Collaborating Authors

Lee, Jongho

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Multi-LLM Collaborative Caption Generation in Scientific Documents

Kim, Jaeyoung, Lee, Jongho, Choi, Hong-Jun, Hsu, Ting-Yao, Huang, Chieh-Yang, Kim, Sungchul, Rossi, Ryan, Yu, Tong, Giles, Clyde Lee, Huang, Ting-Hao 'Kenneth', Choi, Sungchul

arXiv.org Artificial IntelligenceJan-5-2025

Scientific figure captioning is a complex task that requires generating contextually appropriate descriptions of visual content. However, existing methods often fall short by utilizing incomplete information, treating the task solely as either an image-to-text or text summarization problem. This limitation hinders the generation of high-quality captions that fully capture the necessary details. Moreover, existing data sourced from arXiv papers contain low-quality captions, posing significant challenges for training large language models (LLMs). In this paper, we introduce a framework called Multi-LLM Collaborative Figure Caption Generation (MLBCAP) to address these challenges by leveraging specialized LLMs for distinct sub-tasks. Our approach unfolds in three key modules: (Quality Assessment) We utilize multimodal LLMs to assess the quality of training data, enabling the filtration of low-quality captions. (Diverse Caption Generation) We then employ a strategy of fine-tuning/prompting multiple LLMs on the captioning task to generate candidate captions. (Judgment) Lastly, we prompt a prominent LLM to select the highest quality caption from the candidates, followed by refining any remaining inaccuracies. Human evaluations demonstrate that informative captions produced by our approach rank better than human-written captions, highlighting its effectiveness. Our code is available at https://github.com/teamreboott/MLBCAP

caption, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2501.02552

Country:

North America > United States > Pennsylvania (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?

de Wynter, Adrian, Watts, Ishaan, Altıntoprak, Nektar Ege, Wongsangaroonsri, Tua, Zhang, Minghui, Farra, Noura, Baur, Lena, Claudet, Samantha, Gajdusek, Pavel, Gören, Can, Gu, Qilong, Kaminska, Anna, Kaminski, Tomasz, Kuo, Ruby, Kyuba, Akiko, Lee, Jongho, Mathur, Kartik, Merok, Petter, Milovanović, Ivana, Paananen, Nani, Paananen, Vesa-Matti, Pavlenko, Anna, Vidal, Bruno Pereira, Strika, Luciano, Tsao, Yueh, Turcato, Davide, Vakhno, Oleksandr, Velcsov, Judit, Vickers, Anna, Visser, Stéphanie, Widarmanto, Herdyan, Zaikin, Andrey, Chen, Si-Qing

arXiv.org Artificial IntelligenceApr-22-2024

Large language models (LLMs) and small language models (SLMs) are being adopted at remarkable speed, although their safety still remains a serious concern. With the advent of multilingual S/LLMs, the question now becomes a matter of scale: can we expand multilingual safety evaluations of these models with the same velocity at which they are deployed? To this end we introduce RTP-LX, a human-transcreated and human-annotated corpus of toxic prompts and outputs in 28 languages. RTP-LX follows participatory design practices, and a portion of the corpus is especially designed to detect culturally-specific toxic language. We evaluate seven S/LLMs on their ability to detect toxic content in a culturally-sensitive, multilingual scenario. We find that, although they typically score acceptably in terms of accuracy, they have low agreement with human judges when judging holistically the toxicity of a prompt, and have difficulty discerning harm in context-dependent scenarios, particularly with subtle-yet-harmful content (e.g. microagressions, bias). We release of this dataset to contribute to further reduce harmful uses of these models and improve their safe deployment.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2404.14397

Country:

Asia (0.93)
North America > United States (0.28)
Europe > Middle East > Malta (0.14)

Genre: Research Report (0.64)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

A Note on LoRA

Fomenko, Vlad, Yu, Han, Lee, Jongho, Hsieh, Stanley, Chen, Weizhu

arXiv.org Artificial IntelligenceApr-7-2024

LoRA (Low-Rank Adaptation) has emerged as a preferred method for efficiently adapting Large Language Models (LLMs) with remarkable simplicity and efficacy. This note extends the original LoRA paper by offering new perspectives that were not initially discussed and presents a series of insights for deploying LoRA at scale. Without introducing new experiments, we aim to improve the understanding and application of LoRA.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2404.05086

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Kim, Taesu, Lee, Jongho, Ahn, Daehyun, Kim, Sarang, Choi, Jiwoong, Kim, Minkyu, Kim, Hyungjun

arXiv.org Artificial IntelligenceFeb-15-2024

We introduce QUICK, a group of novel optimized CUDA kernels for the efficient inference of quantized Large Language Models (LLMs). QUICK addresses the shared memory bank-conflict problem of state-of-the-art mixed precision matrix multiplication kernels. Our method interleaves the quantized weight matrices of LLMs offline to skip the shared memory write-back after the dequantization. We demonstrate up to 1.91x speedup over existing kernels of AutoAWQ on larger batches and up to 1.94x throughput gain on representative LLM models on various NVIDIA GPU devices.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2402.10076

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

DeepRF: Deep Reinforcement Learning Designed RadioFrequency Waveform in MRI

Shin, Dongmyung, Kim, Younghoon, Oh, Chungseok, An, Hongjun, Park, Juhyung, Kim, Jiye, Lee, Jongho

arXiv.org Artificial IntelligenceMay-7-2021

A carefully engineered radiofrequency (RF) pulse plays a key role in a number of systems such as mobile phone, radar, and magnetic resonance imaging (MRI). The design of an RF waveform, however, is often posed as an inverse problem that has no general solution. As a result, various design methods each with a specific purpose have been developed based on the intuition of human experts. In this work, we propose an artificial intelligence-powered RF pulse design framework, DeepRF, which utilizes the self-learning characteristics of deep reinforcement learning (DRL) to generate a novel RF beyond human intuition. Additionally, the method can design various types of RF pulses via customized reward functions. The algorithm of DeepRF consists of two modules: the RF generation module, which utilizes DRL to explore new RF pulses, and the RF refinement module, which optimizes the seed RF pulses from the generation module via gradient ascent. The effectiveness of DeepRF is demonstrated using four exemplary RF pulses, slice-selective excitation pulse, slice-selective inversion pulse, B1-insensitive volume inversion pulse, and B1-insensitive selective inversion pulse, that are commonly used in MRI. The results show that the DeepRF-designed pulses successfully satisfy the design criteria while improving specific absorption rates when compared to those of the conventional RF pulses. Further analyses suggest that the DeepRF-designed pulses utilize new mechanisms of magnetization manipulation that are difficult to be explained by conventional theory, suggesting the potentials of DeepRF in discovering unseen design dimensions beyond human intuition. This work may lay the foundation for an emerging field of AI-driven RF waveform design.

deep learning, neural network, rf pulse, (21 more...)

arXiv.org Artificial Intelligence

2105.03061

Country: North America > United States (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DIFFnet: Diffusion parameter mapping network generalized for input diffusion gradient schemes and bvalues

Park, Juhung, Jung, Woojin, Choi, Eun-Jung, Oh, Se-Hong, Shin, Dongmyung, An, Hongjun, Lee, Jongho

arXiv.org Artificial IntelligenceFeb-4-2021

In MRI, deep neural networks have been proposed to reconstruct diffusion model parameters. However, the inputs of the networks were designed for a specific diffusion gradient scheme (i.e., diffusion gradient directions and numbers) and a specific b-value that are the same as the training data. In this study, a new deep neural network, referred to as DIFFnet, is developed to function as a generalized reconstruction tool of the diffusion-weighted signals for various gradient schemes and b-values. For generalization, diffusion signals are normalized in a q-space and then projected and quantized, producing a matrix (Qmatrix) as an input for the network. To demonstrate the validity of this approach, DIFFnet is evaluated for diffusion tensor imaging (DIFFnetDTI) and for neurite orientation dispersion and density imaging (DIFFnetNODDI). In each model, two datasets with different gradient schemes and b-values are tested. The results demonstrate accurate reconstruction of the diffusion parameters at substantially reduced processing time (approximately 8.7 times and 2240 times faster processing time than conventional methods in DTI and NODDI, respectively; less than 4% mean normalized root-mean-square errors (NRMSE) in DTI and less than 8% in NODDI). The generalization capability of the networks was further validated using reduced numbers of diffusion signals from the datasets. Different from previously proposed deep neural networks, DIFFnet does not require any specific gradient scheme and b-value for its input. As a result, it can be adopted as an online reconstruction tool for various complex diffusion imaging.

deep learning, neural network, noddi, (22 more...)

arXiv.org Artificial Intelligence

2102.02463

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (0.88)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

Nonlinear Dipole Inversion (NDI) enables Quantitative Susceptibility Mapping (QSM) without parameter tuning

Polak, Daniel, Chatnuntawech, Itthi, Yoon, Jaeyeon, Iyer, Siddharth Srinivasan, Lee, Jongho, Bachert, Peter, Adalsteinsson, Elfar, Setsompop, Kawin, Bilgic, Berkin

arXiv.org Machine LearningSep-30-2019

We propose Nonlinear Dipole Inversion (NDI) for high - quality Quantitative Susceptibility Mapping (QSM) without regularization tuning, while matching the image quality of state - of - the - art reconstruction techniques. In addition to avoiding over - smoothing that these techniques often suffer from, we also ob viate the need for parameter selection. NDI is flexible enough to allow for reconstruction from an arbitrary number of head orientations, and outperforms COSMOS even when using as few as 1 - direction data . This is made possible by a nonlinear forward - model that uses the magnitude as an effective prior, for which we derived a simple gradient descent update rule . We synergistically combine this physics - model with a Variational Network (VN) to leverage the power of d eep l earning in the VaNDI algorithm. This technique adopts the simple gradient descent rule from NDI and learns the network parameters during training, hence requires no additional parameter tuning. Further, we evaluate NDI at 7T using highly accelerated Wave - CAIPI acquisition s at 0.5 mm isotropic resolutio n and demonstrate high - quality QSM from as f e w as 2 - direction data .

neural network, neurology, reconstruction, (21 more...)

arXiv.org Machine Learning

1909.13692

Country:

Asia (0.46)
North America > United States > Massachusetts (0.29)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback