Industry
Super-Mixed Multiple Attribute Group Decision Making Method Based on Hybrid Fuzzy Grey Relation Approach Degree
A multiple attribute decision making (MADM), in which attributes are real number, interval real number, linguistic and uncertain linguistic value, has been already applied in practice such as the evaluation of enterprise effect, the selection of investment project, the selection of person, the research of military equipment scheme, the evaluation of strategy effect, the reliability assessment and the maintainability assessment, etc (Yongqi Xia, 2004, Dang Luo, Sifeng Liu, 2005, Yongqing Wei, Peide Liu, 2009). Extended TOPSIS Method with Interval-Valued Intuitionistic Fuzzy Numbers for Virtual Enterprise Partner Selection has been researched by Fei Ye(2010). Chuanming Ding (2007,a) defined a new similarity degree for various types of attribute and normalized the calculation of similarity degree of the attribute value of each type in unified metric space. Also, by this similarity degree, the comparison of each plan with ideal plan was performed and decision making method was given. Chuanming (2007,b), based on the TOPSIS (Technique for Order Preference by Similarity to Ideal Solution), transformed the attribute value of plan into four-dimensional attribute value, unified various types of attribute value, defined a fourdimensional approach degree, and by this approach degree, solved the multiple attribute mixed-type decision-making problem associated with real number, interval real number, linguistic and uncertain linguistic value. Yongqi Xia (2004) studied a method considering insufficiency degree of information and preference to danger on the basis of the grey-fuzzy comprehensive evaluation method of interval value preference. In the method, they represent the weight and the attribute value by two interval number pair by considering membership and grey degree at the same time. Sifeng Liu, Yaoguo Dang, Jiangling Wang, Zhengpeng Wu (2009), based on the definitions of entropy, proposed a method of getting weight that considers the character of grey cluster decision-making and 2-tuple linguistic assessment, and proposed the method of 2-tuple linguistic assessment based on grey cluster. Zhen Zhang, Chonghui Guo (2012) transformed uncertain linguistic evaluation information of each decision maker to trapezoidal fuzzy numbers, and then denoted, by solving two optimization models, the collective evaluation of the alternatives by trapezoidal fuzzy numbers.
Higher-Order Partial Least Squares (HOPLS): A Generalized Multi-Linear Regression Method
Zhao, Qibin, Caiafa, Cesar F., Mandic, Danilo P., Chao, Zenas C., Nagasaka, Yasuo, Fujii, Naotaka, Zhang, Liqing, Cichocki, Andrzej
A new generalized multilinear regression model, termed the Higher-Order Partial Least Squares (HOPLS), is introduced with the aim to predict a tensor (multiway array) $\tensor{Y}$ from a tensor $\tensor{X}$ through projecting the data onto the latent space and performing regression on the corresponding latent variables. HOPLS differs substantially from other regression models in that it explains the data by a sum of orthogonal Tucker tensors, while the number of orthogonal loadings serves as a parameter to control model complexity and prevent overfitting. The low dimensional latent space is optimized sequentially via a deflation operation, yielding the best joint subspace approximation for both $\tensor{X}$ and $\tensor{Y}$. Instead of decomposing $\tensor{X}$ and $\tensor{Y}$ individually, higher order singular value decomposition on a newly defined generalized cross-covariance tensor is employed to optimize the orthogonal loadings. A systematic comparison on both synthetic data and real-world decoding of 3D movement trajectories from electrocorticogram (ECoG) signals demonstrate the advantages of HOPLS over the existing methods in terms of better predictive ability, suitability to handle small sample sizes, and robustness to noise.
The DLR Hierarchy of Approximate Inference
Rosen-Zvi, Michal, Jordan, Michael I., Yuille, Alan
We propose a hierarchy for approximate inference based on the Dobrushin, Lanford, Ruelle (DLR) equations. This hierarchy includes existing algorithms, such as belief propagation, and also motivates novel algorithms such as factorized neighbors (FN) algorithms and variants of mean field (MF) algorithms. In particular, we show that extrema of the Bethe free energy correspond to approximate solutions of the DLR equations. In addition, we demonstrate a close connection between these approximate algorithms and Gibbs sampling. Finally, we compare and contrast various of the algorithms in the DLR hierarchy on spin-glass problems. The experiments show that algorithms higher up in the hierarchy give more accurate results when they converge but tend to be less stable.
Ordering-Based Search: A Simple and Effective Algorithm for Learning Bayesian Networks
Teyssier, Marc, Koller, Daphne
One of the basic tasks for Bayesian networks (BNs) is that of learning a network structure from data. The BN-learning problem is NP-hard, so the standard solution is heuristic search. Many approaches have been proposed for this task, but only a very small number outperform the baseline of greedy hill-climbing with tabu lists; moreover, many of the proposed algorithms are quite complex and hard to implement. In this paper, we propose a very simple and easy-to-implement method for addressing this task. Our approach is based on the well-known fact that the best network (of bounded in-degree) consistent with a given node ordering can be found very efficiently. We therefore propose a search not over the space of structures, but over the space of orderings, selecting for each ordering the best network consistent with it. This search space is much smaller, makes more global search steps, has a lower branching factor, and avoids costly acyclicity checks. We present results for this algorithm on both synthetic and real data sets, evaluating both the score of the network found and in the running time. We show that ordering-based search outperforms the standard baseline, and is competitive with recent algorithms that are much harder to implement.
Obtaining Calibrated Probabilities from Boosting
Niculescu-Mizil, Alexandru, Caruana, Richard A.
Boosted decision trees typically yield good accuracy, precision, and ROC area. However, because the outputs from boosting are not well calibrated posterior probabilities, boosting yields poor squared error and cross-entropy. We empirically demonstrate why AdaBoost predicts distorted probabilities and examine three calibration methods for correcting this distortion: Platt Scaling, Isotonic Regression, and Logistic Correction. We also experiment with boosting using log-loss instead of the usual exponential loss. Experiments show that Logistic Correction and boosting with log-loss work well when boosting weak models such as decision stumps, but yield poor performance when boosting more complex models such as full decision trees. Platt Scaling and Isotonic Regression, however, significantly improve the probabilities predicted by
Learning Bayesian Network Parameters with Prior Knowledge about Context-Specific Qualitative Influences
Feelders, Ad, van der Gaag, Linda C.
We present a method for learning the parameters of a Bayesian network with prior knowledge about the signs of influences between variables. Our method accommodates not just the standard signs, but provides for context-specific signs as well. We show how the various signs translate into order constraints on the network parameters and how isotonic regression can be used to compute order-constrained estimates from the available data. Our experimental results show that taking prior knowledge about the signs of influences into account leads to an improved fit of the true distribution, especially when only a small sample of data is available. Moreover, the computed estimates are guaranteed to be consistent with the specified signs, thereby resulting in a network that is more likely to be accepted by experts in its domain of application.
Maximum Margin Bayesian Networks
Guo, Yuhong, Wilkinson, Dana, Schuurmans, Dale
We consider the problem of learning Bayesian network classifiers that maximize the margin over a set of classification variables. We find that this problem is harder for Bayesian networks than for undirected graphical models like maximum margin Markov networks. The main difficulty is that the parameters in a Bayesian network must satisfy additional normalization constraints that an undirected graphical model need not respect. These additional constraints complicate the optimization task. Nevertheless, we derive an effective training algorithm that solves the maximum margin training problem for a range of Bayesian network topologies, and converges to an approximate solution for arbitrary network topologies. Experimental results show that the method can demonstrate improved generalization performance over Markov networks when the directed graphical structure encodes relevant knowledge. In practice, the training technique allows one to combine prior knowledge expressed as a directed (causal) model with state of the art discriminative learning methods.
Learning from Sparse Data by Exploiting Monotonicity Constraints
Altendorf, Eric E., Restificar, Angelo C., Dietterich, Thomas G.
When training data is sparse, more domain knowledge must be incorporated into the learning algorithm in order to reduce the effective size of the hypothesis space. This paper builds on previous work in which knowledge about qualitative monotonicities was formally represented and incorporated into learning algorithms (e.g., Clark & Matwin's work with the CN2 rule learning algorithm). We show how to interpret knowledge of qualitative influences, and in particular of monotonicities, as constraints on probability distributions, and to incorporate this knowledge into Bayesian network learning algorithms. We show that this yields improved accuracy, particularly with very small training sets (e.g. less than 10 examples).
Inferring land use from mobile phone activity
Toole, Jameson L., Ulm, Michael, Bauer, Dietmar, Gonzalez, Marta C.
Understanding the spatiotemporal distribution of people within a city is crucial to many planning applications. Obtaining data to create required knowledge, currently involves costly survey methods. At the same time ubiquitous mobile sensors from personal GPS devices to mobile phones are collecting massive amounts of data on urban systems. The locations, communications, and activities of millions of people are recorded and stored by new information technologies. This work utilizes novel dynamic data, generated by mobile phone users, to measure spatiotemporal changes in population. In the process, we identify the relationship between land use and dynamic population over the course of a typical week. A machine learning classification algorithm is used to identify clusters of locations with similar zoned uses and mobile phone activity patterns. It is shown that the mobile phone data is capable of delivering useful information on actual land use that supplements zoning regulations.
Agnostic System Identification for Model-Based Reinforcement Learning
Ross, Stephane, Bagnell, J. Andrew
A fundamental problem in control is to learn a model of a system from observations that is useful for controller synthesis. To provide good performance guarantees, existing methods must assume that the real system is in the class of models considered during learning. We present an iterative method with strong guarantees even in the agnostic case where the system is not in the class. In particular, we show that any no-regret online learning algorithm can be used to obtain a near-optimal policy, provided some model achieves low training error and access to a good exploration distribution. Our approach applies to both discrete and continuous domains. We demonstrate its efficacy and scalability on a challenging helicopter domain from the literature.