Goto

Collaborating Authors

 Overview



The Math of Machine Learning - Berkeley University Textbook

#artificialintelligence

This document is an attempt to provide a summary of the mathematical background needed for an introductory class in machine learning, which at UC Berkeley is known as CS 189/289A. Our assumption is that the reader is already familiar with the basic concepts of multivariable calculus and linear algebra (at the level of UCB Math 53/54). We emphasize that this document is not a replacement for the prerequisite classes. Most subjects presented here are covered rather minimally; we intend to give an overview and point the interested reader to more comprehensive treatments for further details. Note that this document concerns math background for machine learning, not machine learning itself.


The State of Machine Learning in 2019 Analytics Insight

#artificialintelligence

Big changes are happening in the business world and one of these great shifts is directly owing to the contribution of machine learning (ML). The grid of algorithms and statistical models is a revolutionary application of AI. The technology has the ability to learn automatically and bring about changes and improvements from experiences. The self-learning capacity of ML makes it an important ingredient of businesses nowadays. It is used to solve varied problems in an organization and veers it towards the high-paced world of transformation. Machine learning drives the innovative phenomenon of a company to make it excel in an arena of hyper-converged data, mediums, content, and technology.


Machine learning Engineer - TSR Consulting Services, Inc. - New York, NY Dice.com

#artificialintelligence

Department Profile Global Banking Technology (GBT) is a dynamic and fast-paced area within the Firm's WM Technology Division. We are responsible for creating innovative technology solutions for the Private Banking Group (Client), one of the strategic growth areas of the Firm, providing cash management and lending products and services to our WM clients. This includes state-of-the-art technology for a nationwide network of Private Bankers and product specialists who work with Financial Advisors to provide access to products and services such as online banking, cards, deposit products, residential mortgages, securities-based loans and tailored lending. If you are an exceptional individual who is interested in solving complex problems and building sophisticated solutions in a dynamic team environment, GBT is the place for you. Commercial Real Estate Finance (CREF) is a Lending Product offered by *** Bank launched just months ago.


A Survey on Temporal Reasoning for Temporal Information Extraction from Text

Journal of Artificial Intelligence Research

Time is deeply woven into how people perceive, and communicate about the world. Almost unconsciously, we provide our language utterances with temporal cues, like verb tenses, and we can hardly produce sentences without such cues. Extracting temporal cues from text, and constructing a global temporal view about the order of described events is a major challenge of automatic natural language understanding. Temporal reasoning, the process of combining different temporal cues into a coherent temporal view, plays a central role in temporal information extraction. This article presents a comprehensive survey of the research from the past decades on temporal reasoning for automatic temporal information extraction from text, providing a case study on how combining symbolic reasoning with machine learning-based information extraction systems can improve performance. It gives a clear overview of the used methodologies for temporal reasoning, and explains how temporal reasoning can be, and has been successfully integrated into temporal information extraction systems. Based on the distillation of existing work, this survey also suggests currently unexplored research areas. We argue that the level of temporal reasoning that current systems use is still incomplete for the full task of temporal information extraction, and that a deeper understanding of how the various types of temporal information can be integrated into temporal reasoning is required to drive future research in this area.


Q-Search Trees: An Information-Theoretic Approach Towards Hierarchical Abstractions for Agents with Computational Limitations

arXiv.org Artificial Intelligence

In this paper, we develop a framework to obtain graph abstractions for decision-making by an agent where the abstractions emerge as a function of the agent's limited computational resources. We discuss the connection of the proposed approach with information-theoretic signal compression, and formulate a novel optimization problem to obtain tree-based abstractions as a function of the agent's computational resources. The structural properties of the new problem are discussed in detail, and two algorithmic approaches are proposed to obtain solutions to this optimization problem. We discuss the quality of, and prove relationships between, solutions obtained by the two proposed algorithms. The framework is demonstrated to generate a hierarchy of abstractions for a non-trivial environment.


Robust Knowledge Discovery via Low-rank Modeling

arXiv.org Artificial Intelligence

It is always an attractive task to discover knowledge for various learning problems; however, this knowledge discovery and maintenance process usually suffers from noise, incompleteness or knowledge domain mismatch. Thus, robust knowledge discovery by removing the noisy features or samples, complementing incomplete data, and mitigating the distribution difference becomes the key. Along this line of research, low-rank modeling is widely-used to solve these challenges. This survey covers the topic of: (1) robust knowledge recovery, (2) robust knowledge transfer, (3) robust knowledge fusion, centered around several major applications. First of all, we deliver a unified formulation for robust knowledge discovery based on a given dataset. Second, we discuss robust knowledge transfer and fusion given multiple datasets with different knowledge flows, followed by practical challenges, model variations, and remarks. Finally, we highlight future research of robust knowledge discovery for incomplete, unbalance, large-scale data analysis. This would benefit AI community from literature review to future direction.


Admiring the Great Mountain: A Celebration Special Issue in Honor of Stephen Grossbergs 80th Birthday

arXiv.org Artificial Intelligence

This editorial summarizes selected key contributions of Prof. Stephen Grossberg and describes the papers in this 80th birthday special issue in his honor. His productivity, creativity, and vision would each be enough to mark a scientist of the first caliber. In combination, they have resulted in contributions that have changed the entire discipline of neural networks. Grossberg has been tremendously influential in engineering, dynamical systems, and artificial intelligence as well. Indeed, he has been one of the most important mentors and role models in my career, and has done so with extraordinary generosity and encouragement. All authors in this special issue have taken great pleasure in hereby commemorating his extraordinary career and contributions.


A Survey of Machine Learning Applied to Computer Architecture Design

arXiv.org Artificial Intelligence

Machine learning has enabled significant benefits in diverse fields, but, with a few exceptions, has had limited impact on computer architecture. Recent work, however, has explored broader applicability for design, optimization, and simulation. Notably, machine learning based strategies often surpass prior state-of-the-art analytical, heuristic, and human-expert approaches. This paper reviews machine learning applied system-wide to simulation and run-time optimization, and in many individual components, including memory systems, branch predictors, networks-on-chip, and GPUs. The paper further analyzes current practice to highlight useful design strategies and identify areas for future work, based on optimized implementation strategies, opportune extensions to existing work, and ambitious long term possibilities. Taken together, these strategies and techniques present a promising future for increasingly automated architectural design.


Demystifying active inference

arXiv.org Artificial Intelligence

Active inference is a first (Bayesian) principles account of how autonomous agents might operate in dynamic, non-stationary environments. The optimization of congruent formulations of the free energy functional (variational and expected), in active inference, enables agents to make inferences about the environment and select optimal behaviors. The agent achieves this by evaluating (sensory) evidence in relation to its internal generative model that entails beliefs about future (hidden) states and sequence of actions that it can choose. In contrast to analogous frameworks $-$ by operating in a pure belief-based setting (free energy functional of beliefs about states) $-$ active inference agents can carry out epistemic exploration and naturally account for uncertainty about their environment. Through this review, we disambiguate these properties, by providing a condensed overview of the theory underpinning active inference. A T-maze simulation is used to demonstrate how these behaviors emerge naturally, as the agent makes inferences about the observed outcomes and optimizes its generative model (via belief updating). Additionally, the discrete state-space and time formulation presented provides an accessible guide on how to derive the (variational and expected) free energy equations and belief updating rules. We conclude by noting that this formalism can be applied in other engineering applications; e.g., robotic arm movement, playing Atari games, etc., if appropriate underlying probability distributions (i.e. generative model) can be formulated.