AITopics | Undirected Networks

Collaborating Authors

Undirected Networks

News Overviews Instructional Materials AI-Alerts Classics

Clustering With Pairwise Relationships: A Generative Approach

Yu, Yen-Yun, Elhabian, Shireen Y., Whitaker, Ross T.

arXiv.org Machine LearningMay-6-2018

Semi-supervised learning (SSL) has become important in current data analysis applications, where the amount of unlabeled data is growing exponentially and user input remains limited by logistics and expense. Constrained clustering, as a subclass of SSL, makes use of user input in the form of relationships between data points (e.g., pairs of data points belonging to the same class or different classes) and can remarkably improve the performance of unsupervised clustering in order to reflect user-defined knowledge of the relationships between particular data points. Existing algorithms incorporate such user input, heuristically, as either hard constraints or soft penalties, which are separate from any generative or statistical aspect of the clustering model; this results in formulations that are suboptimal and not sufficiently general. In this paper, we propose a principled, generative approach to probabilistically model, without ad hoc penalties, the joint distribution given by user-defined pairwise relations. The proposed model accounts for general underlying distributions without assuming a specific form and relies on expectation-maximization for model fitting. For distributions in a standard form, the proposed approach results in a closed-form solution for updated parameters.

artificial intelligence, machine learning, relation, (17 more...)

arXiv.org Machine Learning

1805.02285

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)

Add feedback

5 Must-Haves On Your Machine Learning Resume - Great Learning

#artificialintelligenceMay-4-2018, 09:01:43 GMT

Companies are today hard-pressed to find good machine learning talent, What they want from the pool of candidates, is one who already comes to the table equipped with the skill-sets, theories and coding ability needed for the task. The skill requirement is not only restricted to the knowledge of machine learning algorithms and when to apply what, but also how to integrate and interface. The core skills required are technical, with a good understanding of mathematics, analytical thinking and problem-solving. The theories of probability are the mainstays of most machine learning algorithms. If you are familiar with probability, you are equipped to deal with the uncertainty of data.

algorithm, artificial intelligence, machine learning, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)

Add feedback

Open Loop Execution of Tree-Search Algorithms

Lecarpentier, Erwan, Infantes, Guillaume, Lesire, Charles, Rachelson, Emmanuel

arXiv.org Machine LearningMay-3-2018

In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking the decision of re-planning or not at each time step based on an analysis of the statistics of the sub-tree. Secondly, we show that the probability of selecting a suboptimal action at any depth of the tree can be upper bounded and converges towards zero. Moreover, this upper bound decays in a logarithmic way between subsequent depths. This leads to a distinction between node-wise optimality and state-wise optimality. Finally, we empirically demonstrate that our method achieves a compromise between loss of performance and computational gain.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

1805.01367

Country: Europe > France (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.87)

Add feedback

Approximate Temporal Difference Learning is a Gradient Descent for Reversible Policies

Ollivier, Yann

arXiv.org Machine LearningMay-2-2018

In reinforcement learning, temporal difference (TD) is the most direct algorithm to learn the value function of a policy. For large or infinite state spaces, exact representations of the value function are usually not available, and it must be approximated by a function in some parametric family. However, with \emph{nonlinear} parametric approximations (such as neural networks), TD is not guaranteed to converge to a good approximation of the true value function within the family, and is known to diverge even in relatively simple cases. TD lacks an interpretation as a stochastic gradient descent of an error between the true and approximate value functions, which would provide such guarantees. We prove that approximate TD is a gradient descent provided the current policy is \emph{reversible}. This holds even with nonlinear approximations. A policy with transition probabilities $P(s,s')$ between states is reversible if there exists a function $\mu$ over states such that $\frac{P(s,s')}{P(s',s)}=\frac{\mu(s')}{\mu(s)}$. In particular, every move can be undone with some probability. This condition is restrictive; it is satisfied, for instance, for a navigation problem in any unoriented graph. In this case, approximate TD is exactly a gradient descent of the \emph{Dirichlet norm}, the norm of the difference of \emph{gradients} between the true and approximate value functions. The Dirichlet norm also controls the bias of approximate policy gradient. These results hold even with no decay factor ($\gamma=1$) and do not rely on contractivity of the Bellman operator, thus proving stability of TD even with $\gamma=1$ for reversible policies.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Machine Learning

1805.00869

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)

Add feedback

Markov Chain Neural Networks

Awiszus, Maren, Rosenhahn, Bodo

arXiv.org Machine LearningMay-2-2018

In this work we present a modified neural network model which is capable to simulate Markov Chains. We show how to express and train such a network, how to ensure given statistical properties reflected in the training data and we demonstrate several applications where the network produces non-deterministic outcomes. One example is a random walker model, e.g.

artificial intelligence, machine learning, neural network, (15 more...)

arXiv.org Machine Learning

1805.00784

Country: North America > United States (0.28)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

The Structure and Function of Complex Networks SIAM Review Vol. 45, No. 2

@machinelearnbotApr-30-2018, 00:50:12 GMT

Journal of Parallel and Distributed Computing 104, 19-35.

constraint-based reasoning, information technology and artificial intelligence conference, vascular disease, (76 more...)

@machinelearnbot

Country:

Asia > China (1.00)
Asia > Middle East (0.45)
Oceania > Australia (0.27)
(15 more...)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (1.00)
Research Report > Experimental Study (0.92)
Research Report > New Finding (0.67)

Industry:

Water & Waste Management > Water Management (1.00)
Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
(48 more...)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Software (1.00)
(45 more...)

Add feedback

Expectation Optimization with Probabilistic Guarantees in POMDPs with Discounted-sum Objectives

Chatterjee, Krishnendu, Elgyütt, Adrián, Novotný, Petr, Rouillé, Owen

arXiv.org Artificial IntelligenceApr-30-2018

Partially-observable Markov decision processes (POMDPs) with discounted-sum payoff are a standard framework to model a wide range of problems related to decision making under uncertainty. Traditionally, the goal has been to obtain policies that optimize the expectation of the discounted-sum payoff. A key drawback of the expectation measure is that even low probability events with extreme payoff can significantly affect the expectation, and thus the obtained policies are not necessarily risk-averse. An alternate approach is to optimize the probability that the payoff is above a certain threshold, which allows obtaining risk-averse policies, but ignores optimization of the expectation. We consider the expectation optimization with probabilistic guarantee (EOPG) problem, where the goal is to optimize the expectation ensuring that the payoff is above a given threshold with at least a specified probability. We present several results on the EOPG problem, including the first algorithm to solve it.

artificial intelligence, machine learning, probability, (19 more...)

arXiv.org Artificial Intelligence

1804.10601

Country:

Europe > France > Brittany > Ille-et-Vilaine > Rennes (0.04)
Europe > Austria > Vienna (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

From Feature To Paradigm: Deep Learning In Machine Translation

Costa-jussà, Marta R.

Journal of Artificial Intelligence ResearchApr-30-2018

In the last years, deep learning algorithms have highly revolutionized several areas including speech, image and natural language processing. The specific field of Machine Translation (MT) has not remained invariant. Integration of deep learning in MT varies from re-modeling existing features into standard statistical systems to the development of a new architecture. Among the different neural networks, research works use feedforward neural networks, recurrent neural networks and the encoder-decoder schema. These architectures are able to tackle challenges as having low-resources or morphology variations. This manuscript focuses on describing how these neural networks have been integrated to enhance different aspects and models from statistical MT, including language modeling, word alignment, translation, reordering, and rescoring. Then, we report the new neural MT approach together with a description of the foundational related works and recent approaches on using subword, characters and training with multilingual languages, among others. Finally, we include an analysis of the corresponding challenges and future work in using deep learning in MT.

computational linguistic, machine translation, translation, (11 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.11198

AI Access Foundation

11198

Journal of Artificial Intelligence Research

Country:

North America > United States > Maryland > Baltimore (0.14)
Europe > Germany > Berlin (0.05)
Asia > China > Beijing > Beijing (0.05)
(21 more...)

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Memory-augmented Dialogue Management for Task-oriented Dialogue Systems

Zhang, Zheng, Huang, Minlie, Zhao, Zhongzhou, Ji, Feng, Chen, Haiqing, Zhu, Xiaoyan

arXiv.org Artificial IntelligenceApr-30-2018

Dialogue management (DM) decides the next action of a dialogue system according to the current dialogue state, and thus plays a central role in task-oriented dialogue systems. Since dialogue management requires to have access to not only local utterances, but also the global semantics of the entire dialogue session, modeling the long-range history information is a critical issue. To this end, we propose a novel Memory-Augmented Dialogue management model (MAD) which employs a memory controller and two additional memory structures, i.e., a slot-value memory and an external memory. The slot-value memory tracks the dialogue state by memorizing and updating the values of semantic slots (for instance, cuisine, price, and location), and the external memory augments the representation of hidden states of traditional recurrent neural networks through storing more context information. To update the dialogue state efficiently, we also propose slot-level attention on user utterances to extract specific semantic information for each slot. Experiments show that our model can obtain state-of-the-art performance and outperforms existing baselines.

machine learning, natural language, utterance, (18 more...)

arXiv.org Artificial Intelligence

1805.0015

Country:

Asia > China (0.28)
Europe > Spain (0.28)
North America > United States (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Crawling in Rogue's dungeons with (partitioned) A3C

Asperti, Andrea, Cortesi, Daniele, Sovrano, Francesco

arXiv.org Machine LearningApr-29-2018

Rogue is a famous dungeon-crawling video-game of the 80ies, the ancestor of its gender. Rogue-like games are known for the necessity to explore partially observable and always different randomly-generated labyrinths, preventing any form of level replay. As such, they serve as a very natural and challenging task for reinforcement learning, requiring the acquisition of complex, non-reactive behaviors involving memory and planning. In this article we show how, exploiting a version of A3C partitioned on different situations, the agent is able to reach the stairs and descend to the next level in 98% of cases.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1804.08685

Country: North America > Canada (0.68)

Genre: Research Report (0.84)

Industry: Leisure & Entertainment > Games > Computer Games (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback