AITopics | Markov Models

Collaborating Authors

Markov Models

News Overviews Instructional Materials AI-Alerts Classics

Target Network and Truncation Overcome The Deadly Triad in $Q$-Learning

Chen, Zaiwei, Clarke, John Paul, Maguluri, Siva Theja

arXiv.org Machine LearningMay-3-2022

The Deep Q -Network (Mnih et al., 2015), as a typical example of Q -learning with function approximation, is one of the most successful algorithms to solve the reinforcement learning (RL) problem, and hence is viewed as a milestone in the development of modern RL. On the other hand, the behavior of Q -learning with function approximation is theoretically not well understood, and was identified in Sutton (1999) as one of four most important theoretical open problems. In fact, the infamous deadly triad (Sutton, 2015) is present in Q -learning with function approximation, and hence even in the basic setting where linear function approximation is used, the algorithm was shown to be unstable in general (Baird, 1995). While theoretically unclear, it was empirically evident from Mnih et al. (2015) that the following three ingredients: experience replay, target network, and truncation together overcome the divergence of Q - learning with function approximation. In this work, we focus on Q -learning with linear function approximation for infinite horizon discounted Markov decision processes (MDPs), and show theoretically that target network together with truncation is sufficient to provably stabilize Q -learning. The main contributions of this work are summarized in the following.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Machine Learning

2203.02628

Country:

North America > Canada > Alberta (0.14)
North America > United States > Texas > Travis County > Austin (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback

Sequential Decision Making - an overview

#artificialintelligenceMay-2-2022, 17:26:22 GMT

Central to many formulations of sequence recognition are problems in sequential decision-making. Typically, a sequence of events is observed through a transformation that introduces uncertainty into the observations, and based on these observations, the recognition process produces a hypothesis of the underlying events. The events in the underlying process are constrained to follow a certain loose order, for example by a grammar, so that decisions made early in the recognition process restrict or narrow the choices that can be made later. This problem is well known and leads to the use of dynamic programming (DP) algorithms [Bel57] so that unalterable decisions can be avoided until all available information has been processed. DP strategies are central to hidden Markov model (HMM) recognizers [LMS84,Lev85,Rab89,RBH86] and have also been widely used in systems based on neural networks (e.g., [SIY 89,Bur88,BW89,SL92,BM90,FLW90]) to transform static pattern classifiers into sequence recognizers.

formulation, recognition, sequence recognition, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Creative Problem Solving in Artificially Intelligent Agents: A Survey and Framework

Gizzi, Evana, Nair, Lakshmi, Chernova, Sonia, Sinapov, Jivko

arXiv.org Artificial IntelligenceApr-21-2022

Creative Problem Solving (CPS) is a sub-area within Artificial Intelligence (AI) that focuses on methods for solving off-nominal, or anomalous problems in autonomous systems. Despite many advancements in planning and learning, resolving novel problems or adapting existing knowledge to a new context, especially in cases where the environment may change in unpredictable ways post deployment, remains a limiting factor in the safe and useful integration of intelligent systems. The emergence of increasingly autonomous systems dictates the necessity for AI agents to deal with environmental uncertainty through creativity. To stimulate further research in CPS, we present a definition and a framework of CPS, which we adopt to categorize existing AI methods in this field. Our framework consists of four main components of a CPS problem, namely, 1) problem formulation, 2) knowledge representation, 3) method of knowledge manipulation, and 4) method of evaluation. We conclude our survey with open research questions, and suggested directions for the future.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1613/jair.1.13864

2204.10358

Country:

North America > United States > Massachusetts > Middlesex County > Medford (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > New York (0.04)

Genre: Research Report > Promising Solution (0.67)

Industry:

Education (1.00)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.45)

Add feedback

Deep Learning: Recurrent Neural Networks in Python

#artificialintelligenceApr-20-2022, 15:34:12 GMT

The Recurrent Neural Network (RNN) has been used to obtain state-of-the-art results in sequence modeling. This includes time series analysis, forecasting and natural language processing (NLP). Learn about why RNNs beat old-school machine learning algorithms like Hidden Markov Models. All of the materials required for this course can be downloaded and installed for FREE. We will do most of our work in Numpy, Matplotlib, and Tensorflow.

deep learning, python, recurrent neural network, (1 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

A System for Interactive Examination of Learned Security Policies

Hammar, Kim, Stadler, Rolf

arXiv.org Artificial IntelligenceApr-20-2022

We present a system for interactive examination of learned security policies. It allows a user to traverse episodes of Markov decision processes in a controlled manner and to track the actions triggered by security policies. Similar to a software debugger, a user can continue or or halt an episode at any time step and inspect parameters and probability distributions of interest. The system enables insight into the structure of a given policy and in the behavior of a policy in edge cases. We demonstrate the system with a network intrusion use case. We examine the evolution of an IT infrastructure's state and the actions prescribed by security policies while an attack occurs. The policies for the demonstration have been obtained through a reinforcement learning approach that includes a simulation system where policies are incrementally learned and an emulation system that produces statistics that drive the simulation runs.

attacker, infrastructure, security policy, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/NOMS54207.2022.9789707

2204.01126

Country:

Europe > Sweden (0.05)
Asia > Middle East > Republic of Türkiye > İzmir Province > İzmir (0.05)

Genre: Research Report (0.41)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.41)

Add feedback

Director, Artificial Intelligence (AI) & Machine Learning (ML)

#artificialintelligenceApr-14-2022, 20:44:52 GMT

This Director of AI & ML will be responsible for developing new models and systems to support Key Capture Energy's (KCE) battery storage facilities, as well as work closely with our software development and market operations analytics team to deploy models to production systems and utilize large-scale datasets for model development and optimization. Prior roles should include significant hands-on experience with typical AI/ML tasks such as feature engineering, feature selection, and hyperparameter tuning.

artificial intelligence, machine learning, model and system, (11 more...)

#artificialintelligence

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.07)
North America > United States > Texas > Harris County > Houston (0.07)

Industry: Energy (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.39)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.36)

Add feedback

Program Analysis of Probabilistic Programs

Gorinova, Maria I.

arXiv.org Machine LearningApr-14-2022

Probabilistic programming is a growing area that strives to make statistical analysis more accessible, by separating probabilistic modelling from probabilistic inference. In practice this decoupling is difficult. No single inference algorithm can be used as a probabilistic programming back-end that is simultaneously reliable, efficient, black-box, and general. Probabilistic programming languages often choose a single algorithm to apply to a given problem, thus inheriting its limitations. While substantial work has been done both to formalise probabilistic programming and to improve efficiency of inference, there has been little work that makes use of the available program structure, by formally analysing it, to better utilise the underlying inference algorithm. This dissertation presents three novel techniques (both static and dynamic), which aim to improve probabilistic programming using program analysis. The techniques analyse a probabilistic program and adapt it to make inference more efficient, sometimes in a way that would have been tedious or impossible to do by hand.

data mining, logic & formal reasoning, natural language, (24 more...)

arXiv.org Machine Learning

2204.06868

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > California > San Francisco County > San Francisco (0.13)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Government (0.92)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Security & Privacy (1.00)
(7 more...)

Add feedback

Text Generation with Markov Decision Processes

#artificialintelligenceApr-13-2022, 14:32:45 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. It's free, we don't spam, and we never share your email address.

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)
Information Technology > Artificial Intelligence > Natural Language (0.40)

Add feedback

A quantum generative model for multi-dimensional time series using Hamiltonian learning

Horowitz, Haim, Rao, Pooja, Radha, Santosh Kumar

arXiv.org Machine LearningApr-12-2022

Synthetic data generation has proven to be a promising solution for addressing data availability issues in various domains. Even more challenging is the generation of synthetic time series data, where one has to preserve temporal dynamics, i.e., the generated time series must respect the original relationships between variables across time. Recently proposed techniques such as generative adversarial networks (GANs) and quantum-GANs lack the ability to attend to the time series specific temporal correlations adequately. We propose using the inherent nature of quantum computers to simulate quantum dynamics as a technique to encode such features. We start by assuming that a given time series can be generated by a quantum process, after which we proceed to learn that quantum process using quantum machine learning. We then use the learned model to generate out-of-sample time series and show that it captures unique and complex features of the learned time series. We also study the class of time series that can be modeled using this technique. Finally, we experimentally demonstrate the proposed algorithm on an 11-qubit trapped-ion quantum machine.

artificial intelligence, machine learning, time sery, (18 more...)

arXiv.org Machine Learning

2204.0615

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

A Comprehensive Review of Sign Language Recognition: Different Types, Modalities, and Datasets

Madhiarasan, M., Roy, Partha Pratim

arXiv.org Artificial IntelligenceApr-7-2022

A machine can understand human activities, and the meaning of signs can help overcome the communication barriers between the inaudible and ordinary people. Sign Language Recognition (SLR) is a fascinating research area and a crucial task concerning computer vision and pattern recognition. Recently, SLR usage has increased in many applications, but the environment, background image resolution, modalities, and datasets affect the performance a lot. Many researchers have been striving to carry out generic real-time SLR models. This review paper facilitates a comprehensive overview of SLR and discusses the needs, challenges, and problems associated with SLR. We study related works about manual and non-manual, various modalities, and datasets. Research progress and existing state-of-the-art SLR models over the past decade have been reviewed. Finally, we find the research gap and limitations in this domain and suggest future directions. This review paper will be helpful for readers and researchers to get complete guidance about SLR and the progressive design of the state-of-the-art SLR model

machine learning, natural language, recognition, (19 more...)

arXiv.org Artificial Intelligence

2204.03328

Country:

Asia > India > Uttarakhand > Roorkee (0.04)
Europe > Germany (0.04)
Europe > Greece (0.04)
(11 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.46)

Industry:

Information Technology (1.00)
Health & Medicine (1.00)
Education > Curriculum > Subject-Specific Education (0.79)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
(7 more...)

Add feedback