Goto

Collaborating Authors

 SPE


Automatic Language Identification in Texts: A Survey

Journal of Artificial Intelligence Research

Language identification ("LI") is the problem of determining the natural language that a document or part thereof is written in. Automatic LI has been extensively researched for over fifty years. Today, LI is a key part of many text processing pipelines, as text processing techniques generally assume that the language of the input text is known. Research in this area has recently been especially active. This article provides a brief history of LI research, and an extensive survey of the features and methods used in the LI literature. We describe the features and methods using a unified notation, to make the relationships between methods clearer. We discuss evaluation methods, applications of LI, as well as off-the-shelf LI systems that do not require training by the end user. Finally, we identify open issues, survey the work to date on each issue, and propose future directions for research in LI.


Analyzing Cyber-Physical Systems from the Perspective of Artificial Intelligence

arXiv.org Artificial Intelligence

The notion of cyber-physical systems (CPS) describes the co mbination of Information and Communication Technology (ICT) and software (the "cyber" part) with physical compone nts. A CPS can emerge from embedded systems by internetworking them. The first big research program focusi ng on CPS has been started by the US National Science Foundation in 2006, where the term CPS is defined in as such tha t it "refers to the tight conjoining of and coordination between computational and physical resources," stating "[ w]e envision that the cyber-physical systems of tomorrow will far exceed those of today in terms of adaptability, auto nomy, efficiency, functionality, reliability, safety, and usability" [1]. While the notion of CPS by the U.S. National Science Foundati on, as outlined above, includes ICT, it does not explicitly name Artificial Intelligence (AI) as a necessary component to raise an embedded system to the status of a CPS. Y et, the availability of sensory data together with a co mmunications system and the ability to exert actions upon the physical world that have been planned for the whole compo und of embedded systems components readily suggests that issues of planning, the increase of reflectivity, effici ency, and lowering resource usage is achieved by increasing the "intelligence" of the overall system. As such, research ers in the domain of AI have found numerous application domains. However, the two worlds of CPS and AI usually operate on diffe rent terms: CPS require operation within well-defined boundaries, i.e., as far as possible deterministic behavio r within well-known, strictly enforced margins of error. In contrast, many AI techniques--Artificial Neural Networks (A NNs) foremost--are firmly rooted in the domain of statistics, which is probably very well seen in the ANN train ing process.


Reinforcement Learning Applications

arXiv.org Artificial Intelligence

We start with a brief introduction to reinforcement learning (RL), about its successful stories, basics, an example, issues, the ICML 2019 Workshop on RL for Real Life, how to use it, study material and an outlook. Then we discuss a selection of RL applications, including recommender systems, computer systems, energy, finance, healthcare, robotics, and transportation.


A Machine Learning Approach for Smartphone-based Sensing of Roads and Driving Style

arXiv.org Machine Learning

Road transportation is of critical importance for a nation, having profound effects in the economy, the health and life style of its people. With the growth of cities and populations come bigger demands for mobility and safety, creating new problems and magnifying those of the past. New tools are needed to face the challenge, to keep roads in good conditions, their users safe, and minimize the impact on the environment. This dissertation is concerned with road quality assessment and aggressive driving, two important problems in road transportation, approached in the context of Intelligent Transportation Systems by using Machine Learning techniques to analyze acceleration time series acquired with smartphone-based opportunistic sensing to automatically detect, classify, and characterize events of interest. Two aspects of road quality assessment are addressed: the detection and the characterization of road anomalies. For the first, the most widely cited works in the literature are compared and proposals capable of equal or better performance are presented, removing the reliance on threshold values and reducing the computational cost and dimensionality of previous proposals. For the second, new approaches for the estimation of pothole depth and the functional condition of speed reducers are showed. The new problem of pothole depth ranking is introduced, using a learning-to-rank approach to sort acceleration signals by the depth of the potholes that they reflect. The classification of aggressive driving maneuvers is done with automatic feature extraction, finding characteristically shaped subsequences in the signals as more effective discriminants than conventional descriptors calculated over time windows. Finally, all the previously mentioned tasks are combined to produce a robust road transport evaluation platform.


Towards automated symptoms assessment in mental health

arXiv.org Machine Learning

Activity and motion analysis has the potential to be used as a diagnostic tool for mental disorders. However, to-date, little work has been performed in turning stratification measures of activity into useful symptom markers. The research presented in this thesis has focused on the identification of objective activity and behaviour metrics that could be useful for the analysis of mental health symptoms in the above mentioned dimensions. Particular attention is given to the analysis of objective differences between disorders, as well as identification of clinical episodes of mania and depression in bipolar patients, and deterioration in borderline personality disorder patients. A principled framework is proposed for mHealth monitoring of psychiatric patients, based on measurable changes in behaviour, represented in physical activity time series, collected via mobile and wearable devices. The framework defines methods for direct computational analysis of symptoms in disorganisation and psychomotor dimensions, as well as measures for indirect assessment of mood, using patterns of physical activity, sleep and circadian rhythms. The approach of computational behaviour analysis, proposed in this thesis, has the potential for early identification of clinical deterioration in ambulatory patients, and allows for the specification of distinct and measurable behavioural phenotypes, thus enabling better understanding and treatment of mental disorders.


Reasoning-Driven Question-Answering for Natural Language Understanding

arXiv.org Artificial Intelligence

Natural language understanding (NLU) of text is a fundamental challenge in AI, and it has received significant attention throughout the history of NLP research. This primary goal has been studied under different tasks, such as Question Answering (QA) and Textual Entailment (TE). In this thesis, we investigate the NLU problem through the QA task and focus on the aspects that make it a challenge for the current state-of-the-art technology. This thesis is organized into three main parts: In the first part, we explore multiple formalisms to improve existing machine comprehension systems. We propose a formulation for abductive reasoning in natural language and show its effectiveness, especially in domains with limited training data. Additionally, to help reasoning systems cope with irrelevant or redundant information, we create a supervised approach to learn and detect the essential terms in questions. In the second part, we propose two new challenge datasets. In particular, we create two datasets of natural language questions where (i) the first one requires reasoning over multiple sentences; (ii) the second one requires temporal common sense reasoning. We hope that the two proposed datasets will motivate the field to address more complex problems. In the final part, we present the first formal framework for multi-step reasoning algorithms, in the presence of a few important properties of language use, such as incompleteness, ambiguity, etc. We apply this framework to prove fundamental limitations for reasoning algorithms. These theoretical results provide extra intuition into the existing empirical evidence in the field.


A 20-Year Community Roadmap for Artificial Intelligence Research in the US

arXiv.org Artificial Intelligence

Decades of research in artificial intelligence (AI) have produced formidable technologies that are providing immense benefit to industry, government, and society. AI systems can now translate across multiple languages, identify objects in images and video, streamline manufacturing processes, and control cars. The deployment of AI systems has not only created a trillion-dollar industry that is projected to quadruple in three years, but has also exposed the need to make AI systems fair, explainable, trustworthy, and secure. Future AI systems will rightfully be expected to reason effectively about the world in which they (and people) operate, handling complex tasks and responsibilities effectively and ethically, engaging in meaningful communication, and improving their awareness through experience. Achieving the full potential of AI technologies poses research challenges that require a radical transformation of the AI research enterprise, facilitated by significant and sustained investment. These are the major recommendations of a recent community effort coordinated by the Computing Community Consortium and the Association for the Advancement of Artificial Intelligence to formulate a Roadmap for AI research and development over the next two decades.


Automatic Fact-Checking Using Context and Discourse Information

arXiv.org Artificial Intelligence

We study the problem of automatic fact-checking, paying special attention to the impact of contextual and discourse information. We address two related tasks: (i) detecting check-worthy claims, and (ii) fact-checking claims. We develop supervised systems based on neural networks, kernel-based support vector machines, and combinations thereof, which make use of rich input representations in terms of discourse cues and contextual features. For the check-worthiness estimation task, we focus on political debates, and we model the target claim in the context of the full intervention of a participant and the previous and the following turns in the debate, taking into account contextual meta information. For the fact-checking task, we focus on answer verification in a community forum, and we model the veracity of the answer with respect to the entire question--answer thread in which it occurs as well as with respect to other related posts from the entire forum. We develop annotated datasets for both tasks and we run extensive experimental evaluation, confirming that both types of information ---but especially contextual features--- play an important role.


ICATT hosts business forum on artificial intelligence

#artificialintelligence

The Institute of Chartered Accountants of Trinidad and Tobago (ICATT), earlier this month, hosted a business forum comprising an audience of financial executives from various sectors including energy, banking and finance at the KPMG Headquarters in Port of Spain. The event themed "Artificial Intelligence (AI) – the Future of Accounting" exposed professional accountants to global developments, good practice guidance and knowledge-sharing that will enhance their roles and domain across the economy. In delivering the opening remarks, ICATT's president, Stacy-Ann Golding, praised the ICATT Professional Accountants in Business (PAIB) Committee for organising the forum, the topic of which, she noted, was critical to improving the readiness of today's accounting professionals to deal with AI and its implications. Bring a depth of insight and experience were featured speakers Nigel Romano, managing director and chief executive officer, JMMB Bank and Leslie Lee Fook, director of Artificial Intelligence, Automation and Analytics at Incus Services Ltd. Romano spoke on the use of AI, "I can recall the now obsolete, clunky computerised systems used in accounting during the 1970s and how they helped speed up work processes at that time. Today a similar shift is happening as current systems will soon be overshadowed by those powered by self-learning / machine learning capabilities."


AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

arXiv.org Machine Learning

The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions. The goal of the Challenge is to provide a common benchmark test set for multimodal information processing and to bring together the health and emotion recognition communities, as well as the audiovisual processing communities, to compare the relative merits of various approaches to health and emotion recognition from real-life data. This paper presents the major novelties introduced this year, the challenge guidelines, the data used, and the performance of the baseline systems on the three proposed tasks: state-of-mind recognition, depression assessment with AI, and cross-cultural affect sensing, respectively.