AITopics | Belli, Davide

Plotting

Belli, Davide

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking

Federici, Marco, Belli, Davide, van Baalen, Mart, Jalalirad, Amir, Skliar, Andrii, Major, Bence, Nagel, Markus, Whatmough, Paul

arXiv.org Artificial IntelligenceDec-2-2024

While mobile devices provide ever more compute power, improvements in DRAM bandwidth are much slower. This is unfortunate for large language model (LLM) token generation, which is heavily memory-bound. Previous work has proposed to leverage natural dynamic activation sparsity in ReLU-activated LLMs to reduce effective DRAM bandwidth per token. However, more recent LLMs use SwiGLU instead of ReLU, which result in little inherent sparsity. While SwiGLU activations can be pruned based on magnitude, the resulting sparsity patterns are difficult to predict, rendering previous approaches ineffective. To circumvent this issue, our work introduces Dynamic Input Pruning (DIP): a predictor-free dynamic sparsification approach, which preserves accuracy with minimal fine-tuning. DIP can further use lightweight LoRA adapters to regain some performance lost during sparsification. Lastly, we describe a novel cache-aware masking strategy, which considers the cache state and activation magnitude to further increase cache hit rate, improving LLM token rate on mobile devices. DIP outperforms other methods in terms of accuracy, memory and throughput trade-offs across simulated hardware settings. On Phi-3-Medium, DIP achieves a 46% reduction in memory and 40% increase in throughput with $<$ 0.1 loss in perplexity.

artificial intelligence, large language model, natural language, (13 more...)

arXiv.org Artificial Intelligence

2412.0138

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

GNSS Positioning using Cost Function Regulated Multilateration and Graph Neural Networks

Jalalirad, Amir, Belli, Davide, Major, Bence, Jee, Songwon, Shah, Himanshu, Morrison, Will

arXiv.org Artificial IntelligenceFeb-28-2024

He obtained his Ph.D. in Electrical Engineering from Eindhoven University of Technology in 2016. His research interests include applications of deep learning in positioning, navigation and RF signal processing systems. Davide Belli received his M.S. degree in Artificial Intelligence from the University of Amsterdam in 2019. He is currently a Senior Machine Learning Researcher at Qualcomm AI Research. His research interests include deep learning for the visual and RF domain, model personalization, and graph representation learning. Bence Major is a Staff Engineer at Qualcomm AI Research, leading a research team in the use of artificial intelligence for RF sensing and positioning. His research work focuses on non-visual sensory data, such as radar, ultrasound, and wireless signals. He received his M.S. degree in Computer Science from the Budapest University of Technology and Economics. Songwon Jee received his M.S. degree in Electrical Engineering from Stanford University in 2016. He is currently a Senior Staff Engineer in Location Technology Team at Qualcomm Technology Inc. His research interests include the application of deep learning for location technology involving GNSS, sensors, and wireless technologies. Himanshu Shah received his M.S. and Ph.D. degrees in Electrical Engineering from Arizona State University in 2004 and 2009 respectively.

artificial intelligence, machine learning, measurement error, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.33012/2023.19364

2402.1863

Country:

North America > United States (0.66)
Europe > Netherlands > North Holland > Amsterdam (0.24)
Europe > Netherlands > North Brabant > Eindhoven (0.24)
Europe > Hungary > Budapest > Budapest (0.24)

Genre: Research Report (0.50)

Industry: Telecommunications (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Chest X-ray Inpainting with Deep Generative Models

Sogancioglu, Ecem, Hu, Shi, Belli, Davide, van Ginneken, Bram

arXiv.org Machine LearningAug-29-2018

Generative adversarial networks have been successfully applied to inpainting in natural images. However, the current state-of-the-art models have not yet been widely adopted in the medical imaging domain. In this paper, we investigate the performance of three recently published deep learning based inpainting models: context encoders, semantic image inpainting, and the contextual attention model, applied to chest x-rays, as the chest exam is the most commonly performed radiological procedure. We train these generative models on 1.2M 128 $\times$ 128 patches from 60K healthy x-rays, and learn to predict the center 64 $\times$ 64 region in each patch. We test the models on both the healthy and abnormal radiographs. We evaluate the results by visual inspection and comparing the PSNR scores. The outputs of the models are in most cases highly realistic. We show that the methods have potential to enhance and detect abnormalities. In addition, we perform a 2AFC observer study and show that an experienced human observer performs poorly in detecting inpainted regions, particularly those generated by the contextual attention model.

deep learning, generative model, neural network, (18 more...)

arXiv.org Machine Learning

1809.01471

Genre: Research Report (1.00)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback