Africa
MACHINE INTELLIGENCE 11
Machine Intelligence 1 (1967) (eds N. Collins and D. Michie) Oliver & Boyd, Edinburgh Machine Intelligence 2 (1968) (eds E. Dale and D. Michie) Oliver & Boyd, Edinburgh (1 and 2 published as one volume in 1971 by Edinburgh University Press) (eds N. Collins, E. Dale, and D. Michie). CLARENDON PRESS OXFORD 1988 Oxford University Press, Walton Street, Oxford 0X2 6DP Oxford New York Toronto Delhi Bombay Calcutta Madras Karachi Petaling Jaya Singapore Hong Kong Tokyo Nairobi Dar es Salaam Cape Town Melbourne Auckland and associated companies in Berlin lbadan Oxford is a trade mark of Oxford University Press Published in the United States by Oxford University Press, New York J. E. Hayes, D. Michie, and J. Richards 1988 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior permission of Oxford University Press British Library Cataloguing in Publication Data Machine Intelligence. Richard J. 006.3 ISBN 0-19-853718-2 Library of Congress Cataloging in Publication Data Data available Typeset and printed in Northern Ireland at The Universities Press (Belfast) Ltd. Held at intervals in Scotland, the first seven International Machine Intelligence Workshops spanning the period of 1965-71 were involved in developing the new subject internationally--in those early days mainly as a mid-Atlantic phenomenon.
LOGLISP: an alternative to PROLOG
Seven years or so after it was first proposed (Kowalski 1974), the technique of'logic programming' today has an enthusiastic band of users and an increasingly impressive record of applications. For most of these people, logic progamming means PROLOG, the system defined and originally implemented by the Marseille group (Roussel 1975). PROLOG has since been implemented in several other places, most notably at Edinburgh (Warren et al. 1977). Much of the rapid success of logic progamming is due to these implementations of PROLOG (as well as to the inspired missionary work of Kowalski, van Emden, Clark and others). The Edinburgh PROLOG system is in particular a superb piece of software engineering which allows the logic progammer to compile assertions into DEC-10 machine code and thus run logic programs with an efficiency which compares favourably with that of compiled LISP. All other implementations of logic programming (including our own, which we describe in this paper) are based on interpreters.
A New Efficient Method for Calculating Similarity Between Web Services
Rachad, T., Boutahar, J., ghazi, S. El
Web services allow communication between heterogeneous systems in a distributed environment. Their enormous success and their increased use led to the fact that thousands of Web services are present on the Internet. This significant number of Web services which not cease to increase has led to problems of the difficulty in locating and classifying web services, these problems are encountered mainly during the operations of web services discovery and substitution . T raditional ways of search based on keywords are not successful in this context, their results do not support the structure of Web services and they consider in their search only the identifiers of the web service description language ( WSDL) interface elements. The methods based on semantics (WSDLS, OWLS, SAWSDL…) which increase the WSDL description of a Web service with a semantic description allow raising partially this problem, but their complexity and difficulty delays their adoption in real case s . Measuring the similarity between the web services interfaces is the most suitable solution for this kind of problems, it will classify available web services so as to know those that best match the searched profile and those that do not match. Thus, the main goal of this work is to study the degree of similarity between any two web services by offering a new method that is more effective than existing work s .
Learning a Concept Hierarchy from Multi-labeled Documents
Nguyen, Viet-An, Ying, Jordan L., Resnik, Philip, Chang, Jonathan
While topic models can discover patterns of word usage in large corpora, it is difficult to meld this unsupervised structure with noisy, human-provided labels, especially when the label space is large. In this paper, we present a model-Label to Hierarchy (L2H)-that can induce a hierarchy of user-generated labels and the topics associated with those labels from a set of multi-labeled documents. The model is robust enough to account for missing labels from untrained, disparate annotators and provide an interpretable summary of an otherwise unwieldy label set. We show empirically the effectiveness of L2H in predicting held-out words and labels for unseen documents.
A statistical model for tensor PCA
Richard, Emile, Montanari, Andrea
We consider the Principal Component Analysis problem for large tensors of arbitrary order k under a single-spike (or rank-one plus noise) model. On the one hand, we use information theory, and recent results in probability theory to establish necessary and sufficient conditions under which the principal component can be estimated using unbounded computational resources. It turns out that this is possible as soon as the signal-to-noise ratio beta becomes larger than C\sqrt{k log k} (and in particular beta can remain bounded has the problem dimensions increase). On the other hand, we analyze several polynomial-time estimation algorithms, based on tensor unfolding, power iteration and message passing ideas from graphical models. We show that, unless the signal-to-noise ratio diverges in the system dimensions, none of these approaches succeeds. This is possibly related to a fundamental limitation of computationally tractable estimators for this problem. For moderate dimensions, we propose an hybrid approach that uses unfolding together with power iteration, and show that it outperforms significantly baseline methods. Finally, we consider the case in which additional side information is available about the unknown signal. We characterize the amount of side information that allow the iterative algorithms to converge to a good estimate.
Using Convolutional Neural Networks to Recognize Rhythm Stimuli from Electroencephalography Recordings
Stober, Sebastian, Cameron, Daniel J., Grahn, Jessica A.
Electroencephalography (EEG) recordings of rhythm perception might contain enough information to distinguish different rhythm types/genres or even identify the rhythms themselves. We apply convolutional neural networks (CNNs) to analyze and classify EEG data recorded within a rhythm perception study in Kigali, Rwanda which comprises 12 East African and 12 Western rhythmic stimuli - each presented in a loop for 32 seconds to 13 participants. We investigate the impact of the data representation and the pre-processing steps for this classification tasks and compare different network structures. Using CNNs, we are able to recognize individual rhythms from the EEG with a mean classification accuracy of 24.4% (chance level 4.17%) over all subjects by looking at less than three seconds from a single channel. Aggregating predictions for multiple channels, a mean accuracy of up to 50% can be achieved for individual subjects.
Correlation of Data Reconstruction Error and Shrinkages in Pair-wise Distances under Principal Component Analysis (PCA)
Ibraheem, Abdulrahman Oladipupo
In this on-going work, I explore certain theoretical and empirical implications of data transformations under the PCA. In particular, I state and prove three theorems about PCA, which I paraphrase as follows: 1). PCA without discarding eigenvector rows is injective, but looses this injectivity when eigenvector rows are discarded 2). PCA without discarding eigen- vector rows preserves pair-wise distances, but tends to cause pair-wise distances to shrink when eigenvector rows are discarded. 3). For any pair of points, the shrinkage in pair-wise distance is bounded above by an L1 norm reconstruction error associated with the points. Clearly, 3). suggests that there might exist some correlation between shrinkages in pair-wise distances and mean square reconstruction error which is defined as the sum of those eigenvalues associated with the discarded eigenvectors. I therefore decided to perform numerical experiments to obtain the corre- lation between the sum of those eigenvalues and shrinkages in pair-wise distances. In addition, I have also performed some experiments to check respectively the effect of the sum of those eigenvalues and the effect of the shrinkages on classification accuracies under the PCA map. So far, I have obtained the following results on some publicly available data from the UCI Machine Learning Repository: 1). There seems to be a strong cor- relation between the sum of those eigenvalues associated with discarded eigenvectors and shrinkages in pair-wise distances. 2). Neither the sum of those eigenvalues nor pair-wise distances have any strong correlations with classification accuracies. 1
A New Approach of Learning Hierarchy Construction Based on Fuzzy Logic
Robert Gagne (1968) defined a learning hierarchy as a set of specified intellectual capabilities or intellectual skills. The capabilities in the hierarchy have an ordered relationship to each other and the hierarchy, as a whole, bears some relation to a plan for effective instruction. The hierarchy is built in a manner to reflect that a lower level skill must be acquired or mastered before an upper-level one, that is, lower level capabilities are prerequisites for upper level ones. Intellectual capabilities or skills are the nodes of the hierarchy. Gagne (1968) defines them as cognitive strategies that denote capabilities for action. Additionally, they also depict a learning route, a path, from simple skills to a final complex capability. Learning hierarchies not only serve to represent effective instruction plans in terms of skills or capabilities, but also, they serve as diagnosis instruments for providing individual or personalized remediation to students. However, for classrooms with a large number of students, the application of learning hierarchies for individualized (remedial) instruction is a highly time consuming task. Learning hierarchies belong to the behaviorist view on cognition and www.ijera.com
Feedback Solution to Optimal Switching Problems with Switching Cost
Many real-world control problems can be classified as switching problems in the sense that the system subject to control is comprised of several different modes (sometimes called subsystems) and at each instant only one of the modes can be active. A basic example of such a system is a plant equipped with on-off actuators [1]. The solution to such problems includes a switching schedule which determines the number of switching, the switching instants, and the order of the active subsystems. The developments in the field of optimal switching can be divided into different categories, two of which are nonlinear programming based methods and discretization based methods. Nonlinear programming based methods utilize the gradient of the cost with respect to the switching instants to calculate local optimal switching times using nonlinear programming [2]-[9]. In these methods, the sequence of active subsystems, known as the mode sequence, is typically selected a priori. The problem is then simplified to determining the switching instants between the modes. Discretization based methods, however, discretize the state and input space to end up with a finite number of choices [10], [11]. Among the intelligent approaches to the problem, genetic algorithm and neural networks were used in Refs.