AITopics | matcha

Collaborating Authors

matcha

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Throughput-OptimalTopology Design forCross-SiloFederatedLearning

Neural Information Processing SystemsFeb-10-2026, 20:01:20 GMT

Federated learning (FL) "involves training statistical models over remote devices or siloed data centers,suchasmobile phones orhospitals, whilekeepingdatalocalized"[56]because ofprivacy concerns orlimitedcommunication resources. Hence, clients only communicate with apotentially far-away (e.g., in another continent) orchestrator and do not Recent experimental and theoretical work suggests that, in practice,the first effect has been over-estimated by classic worst-caseconvergencebounds.

artificial intelligence, machine learning, topology, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

e29b722e35040b88678e25a1ec032a21-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 20:01:09 GMT

WheneverMSTandδ-MBST19 have throughput close to RING, they achieve faster training, as they have better spectral properties. Comparison with MATCHA (Review #2) The reviewer is right that MATCHA [99] selects more frequently the25 important links.

matcha, natural language, table1, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.33)

Add feedback

Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization

Lai, Zhengzhao, Zheng, Youbin, Cai, Zhenyang, Lyu, Haonan, Yang, Jinpu, Liang, Hongqing, Hu, Yan, Wang, Benyou

arXiv.org Artificial IntelligenceSep-12-2025

Materials characterization is fundamental to acquiring materials information, revealing the processing-microstructure-property relationships that guide material design and optimization. While multimodal large language models (MLLMs) have recently shown promise in generative and predictive tasks within materials science, their capacity to understand real-world characterization imaging data remains underexplored. To bridge this gap, we present MatCha, the first benchmark for materials characterization image understanding, comprising 1,500 questions that demand expert-level domain expertise. MatCha encompasses four key stages of materials research comprising 21 distinct tasks, each designed to reflect authentic challenges faced by materials scientists. Our evaluation of state-of-the-art MLLMs on MatCha reveals a significant performance gap compared to human experts. These models exhibit degradation when addressing questions requiring higher-level expertise and sophisticated visual perception. Simple few-shot and chain-of-thought prompting struggle to alleviate these limitations. These findings highlight that existing MLLMs still exhibit limited adaptability to real-world materials characterization scenarios. We hope MatCha will facilitate future research in areas such as new material discovery and autonomous scientific agents. MatCha is available at https://github.com/FreedomIntelligence/MatCha.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2509.09307

Country: Asia > China (0.46)

Genre: Research Report > New Finding (0.92)

Industry: Materials (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

e29b722e35040b88678e25a1ec032a21-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 23:20:13 GMT

artificial intelligence, machine learning, overlay, (19 more...)

Neural Information Processing Systems

Country:

Europe > France > Provence-Alpes-Côte d'Azur (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.94)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

e29b722e35040b88678e25a1ec032a21-AuthorFeedback.pdf

Neural Information Processing SystemsAug-16-2025, 23:20:02 GMT

convergence, matcha, topology, (14 more...)

Neural Information Processing Systems

Country: North America (0.07)

Technology: Information Technology > Artificial Intelligence (0.97)

Add feedback

Adjacent Leader Decentralized Stochastic Gradient Descent

He, Haoze, Wang, Jing, Choromanska, Anna

arXiv.org Artificial IntelligenceMay-18-2024

This work focuses on the decentralized deep learning optimization framework. We propose Adjacent Leader Decentralized Gradient Descent (AL-DSGD), for improving final model performance, accelerating convergence, and reducing the communication overhead of decentralized deep learning optimizers. AL-DSGD relies on two main ideas. Firstly, to increase the influence of the strongest learners on the learning system it assigns weights to different neighbor workers according to both their performance and the degree when averaging among them, and it applies a corrective force on the workers dictated by both the currently best-performing neighbor and the neighbor with the maximal degree. Secondly, to alleviate the problem of the deterioration of the convergence speed and performance of the nodes with lower degrees, AL-DSGD relies on dynamic communication graphs, which effectively allows the workers to communicate with more nodes while keeping the degrees of the nodes low. Experiments demonstrate that AL-DSGD accelerates the convergence of the decentralized state-of-the-art techniques and improves their test performance especially in the communication constrained environments. We also theoretically prove the convergence of the proposed scheme. Finally, we release to the community a highly general and concise PyTorch-based library for distributed training of deep learning models that supports easy implementation of any distributed deep learning approach ((a)synchronous, (de)centralized).

al-dsgd, algorithm, communication graph, (15 more...)

arXiv.org Artificial Intelligence

2405.11389

Country: North America > United States > New York (0.04)

Genre: Research Report (0.83)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Accelerating Parallel Stochastic Gradient Descent via Non-blocking Mini-batches

He, Haoze, Dube, Parijat

arXiv.org Artificial IntelligenceNov-9-2022

SOTA decentralized SGD algorithms can overcome the bandwidth bottleneck at the parameter server by using communication collectives like Ring All-Reduce for synchronization. While the parameter updates in distributed SGD may happen asynchronously there is still a synchronization barrier to make sure that the local training epoch at every learner is complete before the learners can advance to the next epoch. The delays in waiting for the slowest learners(stragglers) remain to be a problem in the synchronization steps of these state-of-the-art decentralized frameworks. In this paper, we propose the (de)centralized Non-blocking SGD (Non-blocking SGD) which can address the straggler problem in a heterogeneous environment. The main idea of Non-blocking SGD is to split the original batch into mini-batches, then accumulate the gradients and update the model based on finished mini-batches. The Non-blocking idea can be implemented using decentralized algorithms including Ring All-reduce, D-PSGD, and MATCHA to solve the straggler problem. Moreover, using gradient accumulation to update the model also guarantees convergence and avoids gradient staleness. Run-time analysis with random straggler delays and computational efficiency/throughput of devices is also presented to show the advantage of Non-blocking SGD. Experiments on a suite of datasets and deep learning networks validate the theoretical analyses and demonstrate that Non-blocking SGD speeds up the training and fastens the convergence. Compared with the state-of-the-art decentralized asynchronous algorithms like D-PSGD and MACHA, Non-blocking SGD takes up to 2x fewer time to reach the same training loss in a heterogeneous environment.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2211.00889

Country: North America > United States > New York (0.04)

Genre: Research Report (0.40)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling

Wang, Jianyu, Sahu, Anit Kumar, Yang, Zhouyi, Joshi, Gauri, Kar, Soummya

arXiv.org Machine LearningMay-22-2019

The trade-off between convergence error and communication delays in decentralized stochastic gradient descent~(SGD) is dictated by the sparsity of the inter-worker communication graph. In this paper, we propose MATCHA, a decentralized SGD method where we use matching decomposition sampling of the base graph to parallelize inter-worker information exchange so as to significantly reduce communication delay. At the same time, under standard assumptions for any general topology, in spite of the significant reduction of the communication delay, MATCHA maintains the same convergence rate as that of the state-of-the-art in terms of epochs. Experiments on a suite of datasets and deep neural networks validate the theoretical analysis and demonstrate the effectiveness of the proposed scheme as far as reducing communication delays is concerned.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Machine Learning

1905.09435

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)

Add feedback

France 2.0: understanding Matcha, the wine advisor chatbot - Recast.AI Blog

#artificialintelligenceMar-16-2018, 03:33:58 GMT

The first thing we did when working on the conception was focusing on natural language processing. It was mandatory for us that the bot could provide the same experience – in terms of interaction and natural conversation – that a human expert would deliver." Recast.AI's NLP helped us understand the thousands of ways to ask for'a light red wine for aperitif, around 10€", while allowing us to build a quick and adaptable flow. Our API then allows us to translate the data into wine-compliant information and to answer the request accordingly.

artificial intelligence, natural language, wine advisor chatbot, (4 more...)

#artificialintelligence

Country: Europe > France (0.40)

Industry: Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.78)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)

Add feedback