AITopics | Gregor, Michal

Collaborating Authors

Gregor, Michal

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Overshoot: Taking advantage of future gradients in momentum-based stochastic optimization

Kopal, Jakub, Gregor, Michal, de Leon-Martinez, Santiago, Simko, Jakub

arXiv.org Artificial IntelligenceJan-16-2025

Overshoot is a novel, momentum-based stochastic gradient descent optimization method designed to enhance performance beyond standard and Nesterov's momentum. In conventional momentum methods, gradients from previous steps are aggregated with the gradient at current model weights before taking a step and updating the model. Rather than calculating gradient at the current model weights, Overshoot calculates the gradient at model weights shifted in the direction of the current momentum. This sacrifices the immediate benefit of using the gradient w.r.t. the exact model weights now, in favor of evaluating at a point, which will likely be more relevant for future updates. We show that incorporating this principle into momentum-based optimizers (SGD with momentum and Adam) results in faster convergence (saving on average at least 15% of steps). Overshoot consistently outperforms both standard and Nesterov's momentum across a wide range of tasks and integrates into popular momentum-based optimizers with zero memory and small computational overhead.

artificial intelligence, gradient, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2501.09556

Country:

North America > United States (0.14)
Europe > Czechia (0.14)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

ExU: AI Models for Examining Multilingual Disinformation Narratives and Understanding their Spread

Vasilakes, Jake, Zhao, Zhixue, Vykopal, Ivan, Gregor, Michal, Hyben, Martin, Scarton, Carolina

arXiv.org Artificial IntelligenceMay-30-2024

Addressing online disinformation requires analysing narratives across languages to help fact-checkers and journalists sift through large amounts of data. The ExU project focuses on developing AI-based models for multilingual disinformation analysis, addressing the tasks of rumour stance classification and claim retrieval. We describe the ExU project proposal and summarise the results of a user requirements survey regarding the design of tools to support fact-checking.

artificial intelligence, claim retrieval, natural language, (14 more...)

arXiv.org Artificial Intelligence

2406.15443

Country:

Europe > Slovakia (0.16)
North America > Canada (0.15)
Europe > Italy (0.15)

Genre: Research Report (0.65)

Industry: Media > News (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking

Beňová, Ivana, Košecká, Jana, Gregor, Michal, Tamajka, Martin, Veselý, Marcel, Šimko, Marián

arXiv.org Artificial IntelligenceJan-29-2024

The dominant probing approaches rely on the zero-shot performance of image-text matching tasks to gain a finer-grained understanding of the representations learned by recent multimodal image-language transformer models. The evaluation is carried out on carefully curated datasets focusing on counting, relations, attributes, and others. This work introduces an alternative probing strategy called guided masking. The proposed approach ablates different modalities using masking and assesses Figure 1: Image from the SVO-Probes dataset (Hendricks the model's ability to predict the masked word and Nematzadeh, 2021). It consists of imagecaption with high accuracy. We focus on studying pairs, where the sentence either correctly describes multimodal models that consider regions of the image (positive example) or one aspect of interest (ROI) features obtained by object detectors the sentence (subject, verb, or object) does not match as input tokens. We probe the understanding the image (negative example). These pairs are used to of verbs using guided masking on probe models through zero-shot image-text matching. ViLBERT, LXMERT, UNITER, and Visual-Example of a positive caption: A person walking on BERT and show that these models can predict a trail.

artificial intelligence, caption, natural language, (17 more...)

arXiv.org Artificial Intelligence

2401.16575

Country:

Europe > Czechia (0.14)
Europe > Slovakia (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback