AITopics

Technology: Information Technology > Artificial Intelligence > Vision (0.34)

Neural Information Processing SystemsApr-25-2026, 19:27:46 GMT

Pay Attention to MLPs

Transformers [1] have become one of the most important architectural innovations in deep learning and have enabled many breakthroughs over the past few years. Here we propose a simple network architecture, gMLP, based on MLPs with gating, and show that it can perform as well as Transformers in key language and vision applications. Our comparisons show that self-attention is not critical for Vision Transformers, as gMLP can achieve the same accuracy. For BERT, our model achieves parity with Transformers on pretraining perplexity and is better on some downstream NLP tasks. On finetuning tasks where gMLP performs worse, making the gMLP model substantially larger can close the gap with Transformers. In general, our experiments show that gMLP can scale as well as Transformers over increased data and compute.

artificial intelligence, machine learning, natural language, (17 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsFeb-8-2026, 13:56:33 GMT

4cc05b35c2f937c5bd9e7d41d3686fff-Paper.pdf

Our comparisons show that self-attention is not critical for Vision Transformers, as gMLP can achieve the same accuracy.

artificial intelligence, machine learning, natural language, (19 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Neural Information Processing SystemsDec-24-2025, 02:24:00 GMT

Pay Attention to MLPs

Transformers have become one of the most important architectural innovations in deep learning and have enabled many breakthroughs over the past few years. Here we propose a simple network architecture, gMLP, based solely on MLPs with gating, and show that it can perform as well as Transformers in key language and vision applications. Our comparisons show that self-attention is not critical for Vision Transformers, as gMLP can achieve the same accuracy. For BERT, our model achieves parity with Transformers on pretraining perplexity and is better on some downstream NLP tasks. On finetuning tasks where gMLP performs worse, making the gMLP model substantially larger can close the gap with Transformers. In general, our experiments show that gMLP can scale as well as Transformers over increased data and compute.

name change, pay attention, transformer, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (0.62)
Information Technology > Artificial Intelligence > Machine Learning (0.42)

Neural Information Processing SystemsOct-10-2024, 07:11:43 GMT

Pay Attention to MLPs

gmlp, pay attention, transformer

Technology:

Information Technology > Artificial Intelligence > Vision (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

#artificialintelligenceDec-13-2022, 20:58:16 GMT

Good Machine Learning Practice for Medical Device Development: Guiding Principles

The U.S. Food and Drug Administration (FDA), Health Canada, and the United Kingdom's Medicines and Healthcare products Regulatory Agency (MHRA) have jointly identified 10 guiding principles that can inform the development of Good Machine Learning Practice (GMLP). These guiding principles will help promote safe, effective, and high-quality medical devices that use artificial intelligence and machine learning (AI/ML). Artificial intelligence and machine learning technologies have the potential to transform health care by deriving new and important insights from the vast amount of data generated during the delivery of health care every day. They use software algorithms to learn from real-world use and in some situations may use this information to improve the product's performance. But they also present unique considerations due to their complexity and the iterative and data-driven nature of their development.

good machine learning practice, guiding principle, medical device development, (7 more...)

Country:

North America > United States (1.00)
North America > Canada (0.27)
Europe > United Kingdom (0.27)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Government Relations & Public Policy (1.00)
Government > Regional Government > North America Government > United States Government > FDA (1.00)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceNov-27-2022, 03:45:08 GMT

Review -- gMLP: Pay Attention to MLPs

1989 … 2021 [gMLP] … 2022 [ConvNeXt] [PVTv2] [ViT-G] [AS-MLP] [ResTv2] [CSWin Transformer] [Pale Transformer] 1991 … 2020 [ALBERT] [GPT-3] [T5] [Pre-LN Transformer] [MobileBERT] [TinyBERT] [BART]…

gmlp, pay attention

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.43)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.43)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.43)

#artificialintelligenceSep-11-2022, 16:30:15 GMT

gMLP: What it is and how to use it in practice with Tensorflow and Keras?

It demonstrates near state-of-the-art results on NLP and computer vision tasks but using a lot less trainable parameters than corresponding Transformer models. The most important component of state-of-the art Transformer architectures is the attention mechanism. It is used to find what relationships between data items are important for the neural network. To spot the innovation of the gMLP, let's first understand what the already mentioned terms static parameterization and spatial projections mean. As described above, attention mechanisms change dynamically depending on the inputs.

gmlp, projection, tensorflow and keras, (8 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.57)

#artificialintelligenceJun-10-2022, 03:06:22 GMT

From Regulation to Data Aggregation: Three Machine Learning Trends to Watch

For over a decade, we've discussed the potential of machine learning (ML) in clinical research to objectively gather and analyze data, optimize trial design, and accelerate drug development. While the opportunities of these technologies get a lot of buzz, there is still a long way to go when it comes to proving they can deliver on their promise and ensuring their development is sustainable long-term. We now find ourselves at a crossroad to improve confidence in ML among pharmaceutical sponsors and clinicians, while finding alternative ways to keep pace with the data-hungry nature of these algorithms. Three key trends will direct the future of ML: regulatory guidance, an emphasis on model traceability as a means to build trust, and new data aggregation and analysis approaches that may help make ML innovation more practical and cost-effective. Until recently, federal oversight over ML's development has been limited, with developers defining best practices based on their own experience.

data aggregation, machine learning trend, regulation, (2 more...)

Country: North America > Canada (0.07)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.76)

Technology:

Information Technology > Data Science (0.75)
Information Technology > Artificial Intelligence > Machine Learning (0.66)

#artificialintelligenceFeb-6-2022, 19:18:14 GMT

gMLP: Winning over Transformers?

Alright, we all know that transformers are cool. At least in terms of NLP, these architectures are considered to be state-of-the-art (SOTA) for language modelling, and help us perform beautifully on various downtream tasks, such as named-entity-recognition (NER), question answering (QA), part of speech tagging (POS) etc. But in this tutorial, we will dive into another architecture called Gated Multilayer Perceptron (gMLP), proposed by Google Research team. As I mentioned above, transformer architectures are very powerful, and if you want to achieve a really high performance in your particular task, you should consider using some pre-trained transformers. You could usually find them on Huggingface.

architecture, matrix, transformer, (16 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)