AITopics | amla

Collaborating Authors

amla

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AMLA: MUL by ADD in FlashAttention Rescaling

Liao, Qichen, Hu, Chengqiu, Miao, Fangzheng, Li, Bao, Liu, Yiyang, Lyu, Junlong, Jiang, Lirui, Wang, Jun, Zheng, Lingchao, Li, Jun, Fan, Yuwei

arXiv.org Artificial IntelligenceOct-23-2025

Multi-head Latent Attention (MLA) significantly reduces KVCache memory usage in Large Language Models while introducing substantial computational overhead and intermediate variable expansion. This poses challenges for efficient hardware implementation -- especially during the decode phase. This paper introduces Ascend MLA (AMLA), a high-performance kernel specifically optimized for Huawei's Ascend NPUs. AMLA is built on two core innovations: (1) A novel FlashAttention-based algorithm that replaces floating-point multiplications with integer additions for output block rescaling, leveraging binary correspondence between FP32 and INT32 representations; (2) A Preload Pipeline strategy with hierarchical tiling that maximizes FLOPS utilization: the Preload Pipeline achieves Cube-bound performance, while hierarchical tiling overlaps data movement and computation within the Cube core. Experiments show that on Ascend 910 NPUs (integrated in CloudMatrix384), AMLA achieves up to 614 TFLOPS, reaching 86.8% of the theoretical maximum FLOPS, outperforming the state-of-the-art open-source FlashMLA implementation, whose FLOPS utilization is up to 66.7% on NVIDIA H800 SXM5. The AMLA kernel has been integrated into Huawei's CANN and will be released soon.

dependency chain, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2509.25224

Genre: Research Report (0.40)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Review of the AMLAS Methodology for Application in Healthcare

Laher, Shakir, Brackstone, Carla, Reis, Sara, Nguyen, An, White, Sean, Habli, Ibrahim

arXiv.org Artificial IntelligenceSep-1-2022

In recent years, the number of machine learning (ML) technologies gaining regulatory approval for healthcare has increased significantly allowing them to be placed on the market. However, the regulatory frameworks applied to them were originally devised for traditional software, which has largely rule-based behaviour, compared to the data-driven and learnt behaviour of ML. As the frameworks are in the process of reformation, there is a need to proactively assure the safety of ML to prevent patient safety being compromised. The Assurance of Machine Learning for use in Autonomous Systems (AMLAS) methodology was developed by the Assuring Autonomy International Programme based on well-established concepts in system safety. This review has appraised the methodology by consulting ML manufacturers to understand if it converges or diverges from their current safety assurance practices, whether there are gaps and limitations in its structure and if it is fit for purpose when applied to the healthcare domain. Through this work we offer the view that there is clear utility for AMLAS as a safety assurance methodology when applied to healthcare machine learning technologies, although development of healthcare specific supplementary guidance would benefit those implementing the methodology.

manufacturer, requirement, safety requirement, (14 more...)

arXiv.org Artificial Intelligence

2209.00421

Country:

North America > United States (0.94)
Europe > United Kingdom > England (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Government Relations & Public Policy (0.96)
Government > Regional Government > North America Government > United States Government > FDA (0.47)
Government > Regional Government > Europe Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Graduate Software Engineer - AMLA

#artificialintelligenceApr-8-2022, 00:16:03 GMT

We are looking for intellectually curious engineers to help build our AMLA tool. AMLA quantifies the risk of potential money laundering through a "follow the money" approach. Its cutting-edge models identify the key red flags of money laundering, such as uneconomic trading and sudden changes in behaviour. Using a proprietary algorithm, AMLA also reveals hidden connections between our clients to uncover suspicious networks. Experience writing C# is desirable, though demonstrable project experience using any object-oriented programming language would be useful as a platform for success in this role.

amla, graduate software engineer, money laundering

#artificialintelligence

Technology:

Information Technology > Software > Programming Languages (0.64)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.64)

Add feedback