Oceania
From Frequency to Meaning: Vector Space Models of Semantics
Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics are beginning to address these limits. This paper surveys the use of VSMs for semantic processing of text. We organize the literature on VSMs according to the structure of the matrix in a VSM. There are currently three broad classes of VSMs, based on term-document, word-context, and pair-pattern matrices, yielding three classes of applications. We survey a broad range of applications in these three categories and we take a detailed look at a specific open source project in each category. Our goal in this survey is to show the breadth of applications of VSMs for semantics, to provide a new perspective on VSMs for those who are already familiar with the area, and to provide pointers into the literature for those who are less familiar with the field.
A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning
Yu, Jin, Vishwanathan, S. V. N., Guenter, Simon, Schraudolph, Nicol N.
We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by generalizing three components of BFGS to subdifferentials: the local quadratic model, the identification of a descent direction, and the Wolfe line search conditions. We prove that under some technical conditions, the resulting subBFGS algorithm is globally convergent in objective function value. We apply its memory-limited variant (subLBFGS) to L_2-regularized risk minimization with the binary hinge loss. To extend our algorithm to the multiclass and multilabel settings, we develop a new, efficient, exact line search algorithm. We prove its worst-case time complexity bounds, and show that our line search can also be used to extend a recently developed bundle method to the multiclass and multilabel settings. We also apply the direction-finding component of our algorithm to L_1-regularized risk minimization with logistic loss. In all these contexts our methods perform comparable to or better than specialized state-of-the-art solvers on a number of publicly available datasets. An open source implementation of our algorithms is freely available.
Using CODEQ to Train Feed-forward Neural Networks
Omran, Mahamed G. H., al-Adwani, Faisal
CODEQ is a new, population-based meta-heuristic algorithm that is a hybrid of concepts from chaotic search, opposition-based learning, differential evolution and quantum mechanics. CODEQ has successfully been used to solve different types of problems (e.g. constrained, integer-programming, engineering) with excellent results. In this paper, CODEQ is used to train feed-forward neural networks. The proposed method is compared with particle swarm optimization and differential evolution algorithms on three data sets with encouraging results.
Random Indexing K-tree
De Vries, Christopher M., De Vine, Lance, Geva, Shlomo
The purpose of this paper is to present and analyse the combination of Random Indexing (RI) with the K-tree algorithm. Both RI and K-tree adapt to changing data and decrease the cost of computationally intensive vector based applications. This combination is particularly suitable to the representation and clustering of very large document collections. Documents are typically represented in vector space as very sparse high dimensional vectors. RI can reduce the dimensionality and sparsity of this representation. In turn, the condensed representation is highly effective when working with K-tree. The paper is focused on determining the effectiveness of using RI with K-tree through experiments and comparative analysis of results. Sections 2 to 6 discuss K-tree, Random Indexing, Document Representation, Experimental Setup and Experimental results respectively. The paper ends with a conclusion in Section 7.
K-tree: Large Scale Document Clustering
De Vries, Christopher M., Geva, Shlomo
We introduce K-tree in an information retrieval context. It is an efficient approximation of the k-means clustering algorithm. Unlike k-means it forms a hierarchy of clusters. It has been extended to address issues with sparse representations. We compare performance and quality to CLUTO using document collections. The K-tree has a low time complexity that is suitable for large document collections. This tree structure allows for efficient disk based implementations where space requirements exceed that of main memory.
Document Clustering with K-tree
De Vries, Christopher M., Geva, Shlomo
This paper describes the approach taken to the XML Mining track at INEX 2008 by a group at the Queensland University of Technology. We introduce the K-tree clustering algorithm in an Information Retrieval context by adapting it for document clustering. Many large scale problems exist in document clustering. K-tree scales well with large inputs due to its low complexity. It offers promising results both in terms of efficiency and quality. Document classification was completed using Support Vector Machines.
Robotics: Science and Systems IV
Brock, Oliver (University of Massachusetts) | Trinkle, Jeff (Rensselaer Polytechnic Institute) | Ramos, Fabio (Australian Centre for Field Robotics)
The conference Robotics: Science and Systems was held at the Swiss Federal Institute of Technology (ETH) in Zurich Switzerland, from June 25 to June 28, 2008. More than 280 international researchers attended this single track conference to learn about the most exciting robotics research and most advanced robotic systems. The program committee, led by sixteen area chairs, selected 40 papers out of 163 submissions. The program also included seven invited talks and two early career spotlight presentations. The plenary presentations were complemented by thirteen workshops.
The Fifth International Conference on Intelligent Environments (IE 09): A Report
Callaghan, Vic (University of Essex) | Kameas, Achilles (Hellenic Open University) | Royo, Dolors (Technical University of Catalonia) | Reyes, Angelica (Technical University of Catalonia) | Navarro, Leandro (Technical University of Catalonia)
The development of intelligent environments is considered an important step toward the realization of the ambient intelligence vision. Greece, served as program chairs. The previous four editions of the IE conference have been held at the University of Essex, UK (in 2005), at the National Technical University of Athens, Greece (in 2006), at the University of Ulm, Germany (in 2007), and at the University of Washington campus in Seattle, Washington, USA (in 2008). The development of intelligent environments is About 120 delegates attended the workshops considered the first and primary step toward the and the conference. These included representatives realization of the ambient intelligence vision.
Hierarchical Semi-Markov Conditional Random Fields for Recursive Sequential Data
Truyen, Tran T., Phung, Dinh, Bui, Hung, Venkatesh, Svetha
Inspired by the hierarchical hidden Markov models (HHMM), we present the hierarchical semi-Markov conditional random field (HSCRF), a generalisation of embedded undirected Markov chains to model complex hierarchical, nested Markov processes. It is parameterised in a discriminative framework and has polynomial time algorithms for learning and inference. Importantly, we develop efficient algorithms for learning and constrained inference in a partially-supervised setting, which is important issue in practice where labels can only be obtained sparsely. We demonstrate the HSCRF in two applications: (i) recognising human activities of daily living (ADLs) from indoor surveillance cameras, and (ii) noun-phrase chunking. We show that the HSCRF is capable of learning rich hierarchical models with reasonable accuracy in both fully and partially observed data cases.
Hierarchical Semi-Markov Conditional Random Fields for Recursive Sequential Data
Truyen, Tran T., Phung, Dinh, Bui, Hung, Venkatesh, Svetha
Inspired by the hierarchical hidden Markov models (HHMM), we present the hierarchical semi-Markov conditional random field (HSCRF), a generalisation of embedded undirected Markov chains to model complex hierarchical, nested Markov processes. It is parameterised in a discriminative framework and has polynomial time algorithms for learning and inference. Importantly, we develop efficient algorithms for learning and constrained inference in a partially-supervised setting, which is important issue in practice where labels can only be obtained sparsely. We demonstrate the HSCRF in two applications: (i) recognising human activities of daily living (ADLs) from indoor surveillance cameras, and (ii) noun-phrase chunking. We show that the HSCRF is capable of learning rich hierarchical models with reasonable accuracy in both fully and partially observed data cases.