AITopics | Instructional Material

Collaborating Authors

Instructional Material

Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond

Han, Soyeon Caren, Cao, Feiqi, Poon, Josiah, Navigli, Roberto

arXiv.org Artificial IntelligenceOct-7-2024

This tutorial explores recent advancements in multimodal pretrained and large models, capable of integrating and processing diverse data forms such as text, images, audio, and video. Participants will gain an understanding of the foundational concepts of multimodality, the evolution of multimodal research, and the key technical challenges addressed by these models. We will cover the latest multimodal datasets and pretrained models, including those beyond vision and language. Additionally, the tutorial will delve into the intricacies of multimodal large models and instruction tuning strategies to optimise performance for specific tasks. Hands-on laboratories will offer practical experience with state-of-the-art multimodal models, demonstrating real-world applications like visual storytelling and visual question answering. This tutorial aims to equip researchers, practitioners, and newcomers with the knowledge and skills to leverage multimodal AI. ACM Multimedia 2024 is the ideal venue for this tutorial, aligning perfectly with our goal of understanding multimodal pretrained and large language models, and their tuning mechanisms.

language model, tuning, tutorial, (13 more...)

arXiv.org Artificial Intelligence

2410.05608

Country:

Oceania > Australia > Victoria > Melbourne (0.06)
Oceania > Australia > New South Wales > Sydney (0.05)
North America > United States > New York > New York County > New York City (0.05)
(2 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback

Xiong, Guojun, Dinesha, Ujwal, Mukherjee, Debajoy, Li, Jian, Shakkottai, Srinivas

arXiv.org Machine LearningOct-7-2024

Restless multi-armed bandits (RMAB) has been widely used to model constrained sequential decision making problems, where the state of each restless arm evolves according to a Markov chain and each state transition generates a scalar reward. However, the success of RMAB crucially relies on the availability and quality of reward signals. Unfortunately, specifying an exact reward function in practice can be challenging and even infeasible. In this paper, we introduce Pref-RMAB, a new RMAB model in the presence of preference signals, where the decision maker only observes pairwise preference feedback rather than scalar reward from the activated arms at each decision epoch. Preference feedback, however, arguably contains less information than the scalar reward, which makes Pref-RMAB seemingly more difficult. To address this challenge, we present a direct online preference learning (DOPL) algorithm for Pref-RMAB to efficiently explore the unknown environments, adaptively collect preference data in an online manner, and directly leverage the preference feedback for decision-makings. We prove that DOPL yields a sublinear regret. To our best knowledge, this is the first algorithm to ensure $\tilde{\mathcal{O}}(\sqrt{T\ln T})$ regret for RMAB with preference feedback. Experimental results further demonstrate the effectiveness of DOPL.

algorithm, preference feedback, ref -rmab, (16 more...)

arXiv.org Machine Learning

2410.05527

Country:

North America > United States > New York > Suffolk County > Stony Brook (0.04)
Oceania > New Zealand (0.04)
North America > United States > Texas (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report (1.00)
Instructional Material > Online (0.61)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.66)

Add feedback

Sequential Transfer in Multi-armed Bandit with Finite Set of Models

Mohammad Gheshlaghi azar, Alessandro Lazaric, Emma Brunskill

Neural Information Processing SystemsOct-6-2024, 10:56:40 GMT

Learning from prior tasks and transferring that experience to improve future performance is critical for building lifelong learning agents. Although results in supervised and reinforcement learning show that transfer may significantly improve the learning performance, most of the literature on transfer is focused on batch learning tasks. In this paper we study the problem of sequential transfer in online learning, notably in the multi-armed bandit framework, where the objective is to minimize the total regret over a sequence of tasks by transferring knowledge from prior tasks. We introduce a novel bandit algorithm based on a method-of-moments approach for estimating the possible tasks and derive regret bounds for it.

algorithm, knowledge, umucb, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Workflow (0.48)
Research Report (0.46)
Instructional Material (0.34)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Take charge of AI before it takes over your job

PCWorldOct-6-2024, 08:00:00 GMT

TL;DR: Get hands-on with AI tools like ChatGPT, Gemini, and GPT-4, and more with these courses on sale for 24.97 through October 27. AI is taking over the world -- but don't worry, you can still be in charge. With this ChatGPT and Gemini AI course bundle for 25, you'll learn to harness the power of AI. This course dives into the generative AI fundamentals, giving you the skills to use tools like ChatGPT, Gemini AI, GPT-4, and even DALL-E 2 for text, image, video, and audio creation. Learn how to leverage AI for productivity gains by automating everything from inbox management to content production.

deep learning, machine learning, take charge, (4 more...)

PCWorld

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Online (0.98)
Education > Educational Technology > Educational Software > Computer Based Training (0.41)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

VPI-Mlogs: A web-based machine learning solution for applications in petrophysics

Nguyen, Anh Tuan

arXiv.org Artificial IntelligenceOct-6-2024

Machine learning is an important part of the data science field. In petrophysics, machine learning algorithms and applications have been widely approached. In this context, Vietnam Petroleum Institute (VPI) has researched and deployed several effective prediction models, namely missing log prediction, fracture zone and fracture density forecast, etc. As one of our solutions, VPI-MLogs is a web-based deployment platform which integrates data preprocessing, exploratory data analysis, visualisation and model execution. Using the most popular data analysis programming language, Python, this approach gives users a powerful tool to deal with the petrophysical logs section. The solution helps to narrow the gap between common knowledge and petrophysics insights. This article will focus on the web-based application which integrates many solutions to grasp petrophysical data.

application, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.47800/PVJ.2022.10-06

2410.05332

Country: Asia > Vietnam (0.37)

Genre:

Research Report (0.50)
Instructional Material > Online (0.40)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Reasoning with Natural Language Explanations

Valentino, Marco, Freitas, André

arXiv.org Artificial IntelligenceOct-5-2024

Explanation constitutes an archetypal feature of human rationality, underpinning learning and generalisation, and representing one of the media supporting scientific discovery and communication. Due to the importance of explanations in human reasoning, an increasing amount of research in Natural Language Inference (NLI) has started reconsidering the role that explanations play in learning and inference, attempting to build explanation-based NLI models that can effectively encode and use natural language explanations on downstream tasks. Research in explanation-based NLI, however, presents specific challenges and opportunities, as explanatory reasoning reflects aspects of both material and formal inference, making it a particularly rich setting to model and deliver complex reasoning. In this tutorial, we provide a comprehensive introduction to the field of explanation-based NLI, grounding this discussion on the epistemological-linguistic foundations of explanations, systematically describing the main architectural trends and evaluation methodologies that can be used to build systems capable of explanatory reasoning.

explanation, large language model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2410.04148

Country:

Asia > Thailand > Bangkok > Bangkok (0.05)
North America > Mexico > Mexico City > Mexico City (0.04)
Europe > United Kingdom > England > Greater Manchester > Manchester (0.04)
(9 more...)

Genre:

Instructional Material (1.00)
Overview (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Abductive Reasoning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.46)
(2 more...)

Add feedback

Engadget Podcast: Why the Windows 11 2024 update is all about Copilot AI

EngadgetOct-4-2024, 11:30:26 GMT

This week, Microsoft started rolling out the Windows 11 2024 update, but it quickly became clear that the company was far more eager to unveil new features for its Copilot AI and Copilot AI PCs. In this episode, Devindra and Cherlynn chat about Microsoft's current AI priorities, and what it means for people with older PCs. Also, we discuss the death of HoloLens and Microsoft giving up on AR as Meta, Apple and even Snap build for an augmented reality future. Listen below or subscribe on your podcast app of choice. If you've got suggestions or topics you'd like covered on the show, be sure to email us or drop a note in the comments! And be sure to check out our other podcast, Engadget News! Tech debt led to Sonos' disastrous app relaunch, will they be able to win users back? Google is making Gmail summaries more useful and adding a "happening soon" tab to your inbox – 41:11 Harvard students hack together facial recognition for Meta's smart glasses that instantly doxes strangers – 44:00 ...

artificial intelligence, cherlynn, social media, (19 more...)

Engadget

Country:

North America > United States > New York (0.04)
North America > United States > Minnesota (0.04)

Genre:

Personal > Interview (1.00)
Instructional Material (0.93)

Industry:

Media > Film (1.00)
Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.34)

Add feedback

Neural Expectation Maximization

Klaus Greff, Sjoerd van Steenkiste, Jürgen Schmidhuber

Neural Information Processing SystemsOct-4-2024, 06:51:02 GMT

Many real world tasks such as reasoning and physical interaction require identification and manipulation of conceptual entities. A first step towards solving these tasks is the automated discovery of distributed symbol-like representations. In this paper, we explicitly formalize this problem as inference in a spatial mixture model where each component is parametrized by a neural network. Based on the Expectation Maximization framework we then derive a differentiable clustering method that simultaneously learns how to group and represent individual entities. We evaluate our method on the (sequential) perceptual grouping task and find that it is able to accurately recover the constituent objects. We demonstrate that the learned representations are useful for next-step prediction.

neural network, representation, rnn-em, (13 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Instructional Material (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

On Uncertainty In Natural Language Processing

Ulmer, Dennis

arXiv.org Artificial IntelligenceOct-4-2024

The last decade in deep learning has brought on increasingly capable systems that are deployed on a wide variety of applications. In natural language processing, the field has been transformed by a number of breakthroughs including large language models, which are used in increasingly many user-facing applications. In order to reap the benefits of this technology and reduce potential harms, it is important to quantify the reliability of model predictions and the uncertainties that shroud their development. This thesis studies how uncertainty in natural language processing can be characterized from a linguistic, statistical and neural perspective, and how it can be reduced and quantified through the design of the experimental pipeline. We further explore uncertainty quantification in modeling by theoretically and empirically investigating the effect of inductive model biases in text classification tasks. The corresponding experiments include data for three different languages (Danish, English and Finnish) and tasks as well as a large set of different uncertainty quantification approaches. Additionally, we propose a method for calibrated sampling in natural language generation based on non-exchangeable conformal prediction, which provides tighter token sets with better coverage of the actual continuation. Lastly, we develop an approach to quantify confidence in large black-box language models using auxiliary predictors, where the confidence is predicted from the input to and generated output text of the target model alone.

artificial intelligence, chatbot, large language model, (24 more...)

arXiv.org Artificial Intelligence

2410.03446

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > New York > New York County > New York City (0.13)
Europe > Denmark > Capital Region > Copenhagen (0.13)
(71 more...)

Genre:

Summary/Review (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
(3 more...)

Industry:

Transportation (1.00)
Law (1.00)
Information Technology (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(11 more...)

Add feedback

Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Hausdörfer, Oliver, von Rohr, Alexander, Lefort, Éric, Schoellig, Angela

arXiv.org Artificial IntelligenceOct-4-2024

Deep Reinforcement Learning (DRL) in simulation often results in brittle and unrealistic learning outcomes. To push the agent towards more desirable solutions, prior information can be injected in the learning process through, for instance, reward shaping, expert data, or motion primitives. We propose an additional inductive bias for robot learning: latent actions learned from expert demonstration as priors in the action space. We show that these action priors can be learned from only a single open-loop gait cycle using a simple autoencoder. Using these latent action priors combined with established style rewards for imitation in DRL achieves above expert demonstration level of performance and leads to more desirable gaits. Further, action priors substantially improve the performance on transfer tasks, even leading to gait transitions for higher target speeds. Videos and code are available at https://sites.google.com/view/latent-action-priors.

demonstration, expert demonstration, style reward, (15 more...)

arXiv.org Artificial Intelligence

2410.03246

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > Belgium > Flanders (0.04)

Genre:

Instructional Material > Online (0.40)
Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.90)

Add feedback