AITopics | skim

Collaborating Authors

skim

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization

Bai, Runsheng, Liu, Bo, Liu, Qiang

arXiv.org Artificial IntelligenceDec-7-2024

Large Language Models (LLMs) exhibit impressive performance across various tasks, but deploying them for inference poses challenges. Their high resource demands often necessitate complex, costly multi-GPU pipelines, or the use of smaller, less capable models. While quantization offers a promising solution utilizing lower precision for model storage, existing methods frequently experience significant performance drops at lower precision levels. Additionally, they typically provide only a limited set of solutions at specific bit levels, many of which are extensively manually tuned. To address these challenges, we propose a new method called SKIM: Scaled K-means clustering wIth Mixed precision. Our approach introduces two novel techniques: 1. A greedy algorithm to solve approximately optimal bit allocation across weight channels, and 2. A trainable scaling vector for non-differentiable K-means clustering. These techniques substantially improve performance and can be adapted to any given bit. Notably, in terms of model perplexity, our method narrows the gap between 3-bit quantized LLaMA models and their full precision counterparts by 16.3% on average.

large language model, machine learning, quantization, (18 more...)

arXiv.org Artificial Intelligence

2412.0418

Genre: Research Report > Promising Solution (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Improved selective background Monte Carlo simulation at Belle II with graph attention networks and weighted events

Yu, Boyang, Hartmann, Nikolai, Schinnerl, Luca, Kuhr, Thomas

arXiv.org Artificial IntelligenceJul-12-2023

When measuring rare processes at Belle II, a huge luminosity is required, which means a large number of simulations are necessary to determine signal efficiencies and background contributions. However, this process demands high computation costs while most of the simulated data, in particular in case of background, are discarded by the event selection. Thus, filters using graph neural networks are introduced at an early stage to save the resources for the detector simulation and reconstruction of events discarded at analysis level. In our work, we improved the performance of the filters using graph attention and investigated statistical methods including sampling and reweighting to deal with the biases introduced by the filtering.

artificial intelligence, machine learning, node feature, (12 more...)

arXiv.org Artificial Intelligence

2307.06434

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.30)

Add feedback

Using GitHub as Artifactory for Machine Learning Model Artifacts · Omkar Prabhu

#artificialintelligenceSep-15-2022, 22:30:51 GMT

Note: This blog post is part of my ongoing work on experiments with model training, deployment and monitoring repository bitbeast. If you liked this blog post, please upvote on Hacker News. Last year, I launched Skim with my friends. It is a platform to find, manage and read research papers. The platform is powered by machine learning models for use cases like finding related papers and classifying research areas/tasks of the paper.

artifactory, gaama, github, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

ESOMAR Fusion 2019 - Can machines be emotional? SKIM

#artificialintelligenceNov-8-2019, 15:38:13 GMT

We're looking forward to ESOMAR Fusion 2019 where we'll share our journey with Audeering – a German start-up that develops machine learning to detect emotions in voice – in analyzing'how' people communicate their needs, attitudes and interest. We all know the importance of identifying both rational and emotional consumer needs and drivers of decision-making and this is particularly the case in new product development. However, whilst we have techniques to uncover emotions qualitatively, what about when we need to size the unmet need or opportunity for a new product innovation? Together with Audeering, we had a goal to access their underlying emotions and explore an opportunity or evaluate a new product with greater validity by understanding their emotions in voice.

emotion, esomar fusion 2019, skim

#artificialintelligence

Country:

North America > Dominican Republic > Distrito Nacional > Santo Domingo (0.11)
Europe > Spain > Galicia > Madrid (0.11)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Søren Hauberg @ CogSys, DTU Compute

#artificialintelligenceJan-18-2017, 05:35:13 GMT

You will be part of the section for Cognitive Systems, which is Denmark's leading group for machine learning research. The group aims for the highest quality research was e.g.

artificial intelligence, dtu compute, ren hauberg, (7 more...)

#artificialintelligence

Country: Europe > Denmark > Capital Region > Copenhagen (0.09)

Technology: Information Technology > Artificial Intelligence (0.59)

Add feedback