AITopics | Metel, Michael R.

Collaborating Authors

Metel, Michael R.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression

Metel, Michael R., Chen, Boxing, Rezagholizadeh, Mehdi

arXiv.org Artificial IntelligenceDec-7-2024

Several works have developed eviction policies to remove key-value (KV) pairs from the KV cache for more efficient inference. The focus has been on compressing the KV cache after the input prompt has been processed for faster token generation. In settings with limited GPU memory, and when the input context is longer than the generation length, we show that by also compressing the KV cache during the input processing phase, larger batch sizes can be used resulting in significantly higher throughput while still maintaining the original model's accuracy.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.05693

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Draft on the Fly: Adaptive Self-Speculative Decoding using Cosine Similarity

Metel, Michael R., Lu, Peng, Chen, Boxing, Rezagholizadeh, Mehdi, Kobyzev, Ivan

arXiv.org Artificial IntelligenceOct-1-2024

We present a simple on the fly method for faster inference of large language models. Unlike other (self-)speculative decoding techniques, our method does not require fine-tuning or black-box optimization to generate a fixed draft model, relying instead on simple rules to generate varying draft models adapted to the input context. We show empirically that our light-weight algorithm is competitive with the current SOTA for self-speculative decoding, while being a truly plug-and-play method.

draft model, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2410.01028

Country: North America > Canada (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.43)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Variants of SGD for Lipschitz Continuous Loss Functions in Low-Precision Environments

Metel, Michael R.

arXiv.org Artificial IntelligenceSep-27-2023

Motivated by neural network training in low-bit floating and fixed-point environments, this work studies the convergence of variants of SGD using adaptive step sizes with computational error. Considering a general stochastic Lipschitz continuous loss function, an asymptotic convergence result to a Clarke stationary point is proven as well as the non-asymptotic convergence to an approximate stationary point. It is assumed that only an approximation of the loss function's stochastic gradient can be computed in addition to error in computing the SGD step itself. Different variants of SGD are tested empirically in a variety of low-precision arithmetic environments, where improved test set accuracy is observed compared to SGD for two image recognition tasks.

artificial intelligence, lipschitz continuous loss function, machine learning, (3 more...)

arXiv.org Artificial Intelligence

2211.04655

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)

Add feedback

Mathematical Challenges in Deep Learning

Nia, Vahid Partovi, Zhang, Guojun, Kobyzev, Ivan, Metel, Michael R., Li, Xinlin, Sun, Ke, Hemati, Sobhan, Asgharian, Masoud, Kong, Linglong, Liu, Wulong, Chen, Boxing

arXiv.org Artificial IntelligenceMar-24-2023

Deep models are dominating the artificial intelligence (AI) industry since the ImageNet challenge in 2012. The size of deep models is increasing ever since, which brings new challenges to this field with applications in cell phones, personal computers, autonomous cars, and wireless base stations. Here we list a set of problems, ranging from training, inference, generalization bound, and optimization with some formalism to communicate these challenges with mathematicians, statisticians, and theoretical computer scientists. This is a subjective view of the research questions in deep learning that benefits the tech industry in long run.

artificial intelligence, machine learning, mathematical challenge, (16 more...)

arXiv.org Artificial Intelligence

2303.15464

Country:

North America > United States (0.46)
North America > Canada > Quebec > Montreal (0.15)

Genre:

Research Report > New Finding (0.34)
Research Report > Experimental Study (0.34)

Industry:

Transportation > Ground > Road (0.34)
Information Technology > Robotics & Automation (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback