Government
Graph-Based Anomaly Detection Applied to Homeland Security Cargo Screening
Eberle, William (Tennessee Technological University) | Holder, Lawrence (Washington State University) | Massengill, Beverly (Tennessee Technological University)
Protecting our nation’s ports is a critical challenge for homeland security and requires the research, development and deployment of new technologies that will allow for the efficient securing of shipments entering this country. Most approaches look only at statistical irregularities in the attributes of the cargo, and not at the relationships of this cargo to others. However, anomalies detected in these relationships could add to the suspicion of the cargo, and therefore improve the accuracy with which we detect suspicious cargo. This paper proposes an improvement in our ability to detect suspicious cargo bound for the U.S. through a graph-based anomaly detection approach. Using anonymized data received from the Department of Homeland Security, we demonstrate the effectiveness of our approach and its usefulness to a homeland security analyst who is tasked with uncovering illegal and potentially dangerous cargo shipments.
Automatic Coherence Profile in Public Speeches of Three Latin American Heads-of-State
Venegas, René (Universidad Catolica de Valparaiso)
Different studies provide evidence that the computational psycholinguistic algorithm called Latent Semantic Analysis (LSA) allows measuring local and global coherence in texts similarly to human evaluation (Foltz, Kintsch, Landauer 1998; McNamara, Cai & Louwerse 2007; McCarthy, Briner, Rus, & McNamara, 2007; McNamara, Louwerse & Jeuniaux 2009; Louwerse, McCarthy & Graesser 2010). The texts used in all these studies are written in English and correspond to scientific and literary texts. In Spanish, there are some studies using LSA that measure the semantic similarity between texts in automatic summary assessment (Pérez, Alfonseca, Rodríguez, Gliozzo, Strapparava & Magnini 2005; León, Olmos, Escudero, Cañas & Salmerón 2006; Venegas 2007, 2009, 2011); however, automatic measurement of coherence in Spanish has not yet been sufficiently investigated. The present study aimed at identifying a global and local coherence profile in a corpus of speeches in Spanish of three Latin American Heads-of-States (Perón, Castro and Pinochet), using Latent Semantic Analysis. Local coherence is calculated through the measurement of implicit semantic similarity between adjacent sentences and global coherence through the measurement of the similarity among the semantic content of the paragraphs. The corpus under analysis corresponds to a sample of 107 speeches. The semantic space was built using a multi-register corpus and it is available through the “Interface for the measurement of lexical-semantic similarity” in the El Grial interface (www.elgrial.cl). Results showed a systematic difference between the speeches of the Heads-of-State in terms of both local and global coherence. The Bonferroni analysis established an effect that distinguishes Perón’s speeches from Pinochet’s and Castro’s speeches. This results show that Perón’s speeches are more topically related than the other leaders’, probably due to a discourse strategy to persuade voters. The identification of a profile of coherence might be relevant to predict cues of government discourse styles.
The Devil Is in the Details: New Directions in Deception Analysis
McCarthy, Philip Michael (The University of Memphis ) | Duran, Nicholas D. (University of California Merced) | Booker, Lucille M. (The University of Memphis)
In this study, we use the computational textual analysis tool, the Gramulator, to identify and examine the distinctive linguistic features of deceptive and truthful discourse. The theme of the study is abortion rights and the deceptive texts are derived from a Devil’s Advocate approach, conducted to suppress personal beliefs and values. Our study takes the form of a contrastive corpus analysis, and produces systematic differences between truthful and deceptive personal accounts. Results suggest that deceivers employ a distancing strategy that is often associated with deceptive linguistic behavior. Ultimately, these deceivers struggle to adopt a truth perspective. Perhaps of most importance, our results indicate issues of concern with current deception detection theory and methodology. From a theoretical standpoint, our results question whether deceivers are deceiving at all or whether they are merely poorly expressing a rhetorical position, caused by being forced to speculate on a perceived proto-typical position. From a methodological standpoint, our results cause us to question the validity of deception corpora. Consequently, we propose new rigorous standards so as to better understand the subject matter of the deception field. Finally, we question the prevailing approach of abstract data measurement and call for future assessment to consider contextual lexical features. We conclude by suggesting a prudent approach to future research for fear that our eagerness to analyze and theorize may cause us to misidentify deception. After-all, successful deception, which is the kind we seek to detect, is likely to be an elusive and fickle prey.
Forecasting Conflicts Using N-Grams Models
Besse, Camille (Laval University) | Bakhtiari, Alireza (Laval University) | Lamontagne, Luc (Laval University)
Analyzing international political behavior based on similar precedent circumstances is one of the basic techniques that policymakers use to monitor and assess current situations. Our goal is to investigate how to analyze geopolitical conflicts as sequences of events and to determine what probabilistic models are suitable to perform these analyses. In this paper, we evaluate the performance of N-grams models on the problem of forecasting political conflicts from sequences of events. For the current phase of the project, we focused on event data collected from the Balkans war in the 1990's. Our experimental results indicate that N-gram models have impressive results when applied to this data set, with accuracies above 90\% for most configurations.
Robustness and Accuracy Tradeoffs for Recommender Systems Under Attack
Seminario, Carlos E. (University of North Carolina at Charlotte) | Wilson, David C. (University of North Carolina at Charlotte)
Recommender systems assist users in the daunting task of sifting through large amounts of data in order to select relevant information or items. Common examples include consumer products and services, such as for songs, books, articles, etc. Unfortunately, such systems may be subject to attack by malicious users who want to manipulate the system’s recommendations to suit their needs: to promote their own (or demote a competitor’s) product/service, or to cause disruption in the recommender system. Attacks can cause the recommender system to become unreliable and untrustworthy, resulting in user dissatisfaction. Developers already face tradeoffs in system efficiency and accuracy, and designing for robustness adds an additional dimension for consideration. In this paper, we show how the underlying implementation choices for item-based and user-based Collaborative Filtering recommender systems can affect the accuracy and robustness of recommender systems. We also show how accuracy and robustness can change over a system’s lifetime by analyzing a set of temporal snapshots from system usage over time. Results provide insight into some of the tradeoffs between robustness and accuracy that operators may need to consider in development and evaluation.
Modeling the Interaction Between Mixed Teams of Humans and Robots and Local Population for a Market Patrol Task
Khan, Saad Ahmad (University of Central Florida) | Bhatia, Taranjeet Singh (University of Central Florida) | Parker, Shane (University of Central Florida) | Boloni, Ladislau (University of Central Florida)
We consider a cross-cultural interaction scenario where a group of soldiers assisted by robots interact with local vendors in a market place. We develop a model to quantify, analyze and predict the perception of the actions of the soldiers and the robot by the local population. The model assumes that humans are considering collections of concrete and intangible values which are not, in general, directly and linearly convertible into each other. We argue that satisfactory modeling accuracy can be achieved by restricting the considered intangibles to a small set of {\em culture sanctioned social values}. For these values, the culture provides a name, calculation methods, as well as associated rules of conduct. We validate our model by comparing the predicted values with the judgment of a large group of human observers cognizant of the modeled culture. We use the model to evaluate the tradeoffs between several long term strategies to maintain security as well as to increase the trust and goodwill of the local population.
Real-Time Filtering for Pulsing Public Opinion in Social Media
Finn, Samantha (Wellesley College) | Mustafaraj, Eni (Wellesley College)
When analysing social media conversations, in search of the public opinion about an unfolding event that is be- ing discussed in real-time (e.g., presidential debates, major speeches, etc.), it is important to distinguish between two groups of participants: opinion-makers and opinion-holders. To address this problem, we propose a supervised machine-learning approach, which uses inexpensively acquired labeled data from monothematic Twitter accounts to learn a binary classifier for the labels “political account” (opinion-makers) and “non-political account” (opinion-holders). While the classifier has a 83% accuracy on individual tweets, when applied to the last 200 tweets from accounts of a set of 1000 Twitter users, it classifies accounts with a 97% accuracy. This high accuracy derives from our decision to incorporate information about classifier probability into the classification. Our work demonstrates that machine learning algorithms can play a critical role in improving the quality of social media analytics and understanding, whose importance is increasing as social media adoption becomes widespread.
Maritime Threat Detection Using Probabilistic Graphical Models
Auslander, Bryan (Knexus Research Corporation) | Gupta, Kalyan Moy (Knexus Research Corporation) | Aha, David William (Naval Research Laboratory)
Maritime threat detection is a challenging problem because maritime environments can involve a complex combination of concurrent vessel activities, and only a small fraction of these may be irregular, suspicious, or threatening. Previous work on this task has been limited to analyses of single vessels using simple rule-based models that alert watchstanders when a proximity threshold is breached. We claim that Probabilistic Graphical Models (PGMs) can be used to more effectively model complex maritime situations. In this paper, we study the performance of PGMs for detecting (small boat) maritime attacks. We describe three types of PGMs that vary in their representational expressiveness and evaluate them on a threat recognition task using track data obtained from force protection naval exercises involving unmanned sea surface vehicles. We found that the best-performing PGMs can outperform the deployed rule-based approach on these tasks, though some PGMs require substantial engineering and are computationally expensive.
Invited Talks
Youngblood, Michael (University of North Carolina Charlotte)
Bill Swartout Introduced by Alan Kay at XEROX PARC in the 1970's, the desktop metaphor, which was later adopted in the Macintosh and Windows operating systems, has become the primary way we think about interacting with computers. Over the last decade, we have been developing sophisticated virtual humans at the USC Institute for Creative Technologies.