Texas A&M University
Helping Novices Avoid the Hazards of Data: Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources
Janpuangtong, Sasin (Texas A&M University) | Shell, Dylan A. (Texas A&M University)
This article describes an end-to-end learning framework that allows a novice to create models from data easily by helping structure the model building process and capturing extended aspects of domain knowledge. By treating the whole modeling process interactively and exploiting high-level knowledge in the form of an ontology, the framework is able to aid the user in a number of ways, including in helping to avoid pitfalls such as data dredging. We describe how the framework automatically exploits structured knowledge in an ontology to identify relevant concepts, and how a data extraction component can make use of online data sources to find measurements of those concepts so that their relevance can be evaluated. Prediction error on unseen examples of these models show that our framework, making use of the ontology, helps to improve model generalization.
Helping Novices Avoid the Hazards of Data: Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources
Janpuangtong, Sasin (Texas A&M University) | Shell, Dylan A. (Texas A&M University)
The infrastructure and tools necessary for large-scale data analytics, formerly the exclusive purview of experts, are increasingly available. Whereas a knowledgeable data-miner or domain expert can rightly be expected to exercise caution when required (for example, around fallacious conclusions supposedly supported by the data), the nonexpert may benefit from some judicious assistance. This article describes an end-to-end learning framework that allows a novice to create models from data easily by helping structure the model building process and capturing extended aspects of domain knowledge. By treating the whole modeling process interactively and exploiting high-level knowledge in the form of an ontology, the framework is able to aid the user in a number of ways, including in helping to avoid pitfalls such as data dredging. Prudence must be exercised to avoid these hazards as certain conclusions may only be supported if, for example, there is extra knowledge which gives reason to trust a narrower set of hypotheses. This article adopts the solution of using higher-level knowledge to allow this sort of domain knowledge to be used automatically, selecting relevant input attributes, and thence constraining the hypothesis space. We describe how the framework automatically exploits structured knowledge in an ontology to identify relevant concepts, and how a data extraction component can make use of online data sources to find measurements of those concepts so that their relevance can be evaluated. To validate our approach, models of four different problem domains were built using our implementation of the framework. Prediction error on unseen examples of these models show that our framework, making use of the ontology, helps to improve model generalization.
Mechanix: A Sketch-Based Tutoring and Grading System for Free-Body Diagrams
Valentine, Stephanie (Texas A&M University) | Vides, Francisco (Texas A&M University) | Lucchese, George (Texas A&M University) | Turner, David (Texas A&M University) | Kim, Hong-hoe (Texas A&M University) | Li, Wenzhe (Texas A&M University) | Linsey, Julie (Texas A&M University) | Hammond, Tracy (Texas A&M University)
Introductory engineering courses within large universities often have annual enrollments which can reach up to a thousand students. It is very challenging to achieve differentiated instruction in classrooms with class sizes and student diversity of such great magnitude. Professors can only assess whether students have mastered a concept by using multiple choice questions, while detailed homework assignments, such as planar truss diagrams, are rarely assigned because professors and teaching assistants would be too overburdened with grading to return assignments with valuable feedback in a timely manner. In this paper, we introduce Mechanix, a sketch-based deployed tutoring system for engineering students enrolled in statics courses. Our system not only allows students to enter planar truss and free body diagrams into the system just as they would with pencil and paper, but our system checks the student's work against a hand-drawn answer entered by the instructor, and then returns immediate and detailed feedback to the student. Students are allowed to correct any errors in their work and resubmit until the entire content is correct and thus all of the objectives are learned. Since Mechanix facilitates the grading and feedback processes, instructors are now able to assign free response questions, increasing teacher's knowledge of student comprehension. Furthermore, the iterative correction process allows students to learn during a test, rather than simply displaying memorized information.
Reports of the AAAI 2011 Spring Symposia
Buller, Mark (Brown University) | Cuddihy, Paul (General Electric Research) | Davis, Ernest (New York University) | Doherty, Patrick (Linkoping University) | Doshi-Velez, Finale (Massachusetts Institute of Technology) | Erdem, Esra (Sabanci University) | Fisher, Douglas (Vanderbilt University) | Green, Nancy (University of North Carolina, Greensboro) | Hinkelmann, Knut (University of Applied Sciences Northwestern Switzerland FHNW) | Maher, Mary Lou (University of Maryland) | McLurkin, James (Rice University) | Maheswaran, Rajiv (University of Southern California) | Rubinelli, Sara (University of Lucerne) | Schurr, Nathan (Aptima, Inc.) | Scott, Donia (University of Sussex) | Shell, Dylan (Texas A&M University) | Szekely, Pedro (University of Southern California) | Thönssen, Barbara (University of Applied Sciences Northwestern Switzerland FHNW) | Urken, Arnold B. (University of Arizona)
The Association for the Advancement of Artificial Intelligence, in cooperation with Stanford University's Department of Computer Science, presented the 2011 Spring Symposium Series Monday through Wednesday, March 21–23, 2011 at Stanford University. The titles of the eight symposia were AI and Health Communication, Artificial Intelligence and Sustainable Design, AI for Business Agility, Computational Physiology, Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, Logical Formalizations of Commonsense Reasoning, Multirobot Systems and Physical Data Structures, and Modeling Complex Adaptive Systems As If They Were Voting Processes.
Reports of the AAAI 2011 Spring Symposia
Buller, Mark (Brown University) | Cuddihy, Paul (General Electric Research) | Davis, Ernest (New York University) | Doherty, Patrick (Linkoping University) | Doshi-Velez, Finale (Massachusetts Institute of Technology) | Erdem, Esra (Sabanci University) | Fisher, Douglas (Vanderbilt University) | Green, Nancy (University of North Carolina, Greensboro) | Hinkelmann, Knut (University of Applied Sciences Northwestern Switzerland FHNW) | Maher, Mary Lou (University of Maryland) | McLurkin, James (Rice University) | Maheswaran, Rajiv (University of Southern California) | Rubinelli, Sara (University of Lucerne) | Schurr, Nathan (Aptima, Inc.) | Scott, Donia (University of Sussex) | Shell, Dylan (Texas A&M University) | Szekely, Pedro (University of Southern California) | Thönssen, Barbara (University of Applied Sciences Northwestern Switzerland FHNW) | Urken, Arnold B. (University of Arizona)
The titles of the eight symposia were Artificial Intelligence and Health Communication, Artificial Intelligence and Sustainable Design, Artificial Intelligence for Business Agility, Computational Physiology, Help Me Help You: Bridging the Gaps in Human-Agent Collaboration, Logical Formalizations of Commonsense Reasoning, Multirobot Systems and Physical Data Structures, and Modeling Complex Adaptive Systems As If They Were Voting Processes. The goal of the Artificial Intelligence and Health Communication symposium was to advance the conceptual design of automated systems that provide health services to patients and consumers through interdisciplinary insight from artificial intelligence, health communication and related areas of communication studies, discourse studies, public health, and psychology. There is a large and growing interest in the development of automated systems to provide health services to patients and consumers. In the last two decades, applications informed by research in health communication have been developed, for example, for promoting healthy behavior and for managing chronic diseases. While the value that these types of applications can offer to the community in terms of cost, access, and convenience is clear, there are still major challenges facing design of effective health communication systems. Overall, the participants found the format of the symposium engaging and constructive, and they The symposium was organized around five main expressed the desire to continue this initiative in concepts: (1) Patient empowerment and education further events.
A Sketch Recognition System for Recognizing Free-Hand Course of Action Diagrams
Hammond, Tracy Anne (Texas A&M University) | Logsdon, Drew (Texas A&M University) | Paulson, Brandon (Texas A&M University) | Johnston, Joshua (Texas A&M University) | Peschel, Joshua (Texas A&M University) | Wolin, Aaron (Texas A&M University) | Taele, Paul (Texas A&M University)
Military course-of-action (COA) diagrams are used to depict battle scenarios and include thousands of unique symbols, complete with additional textual and designator modifiers. We have created a real-time sketch recognition interface that recognizes 485 freely-drawn military course-of-action sym- bols. When the variations (not allowable by other systems) are factored in, our system is several orders of magnitude larger than the next biggest system. On 5,900 hand-drawn symbols, the system achieves an accuracy of 90% when con- sidering the top 3 interpretations and requiring every aspect of the shape (variations, text, symbol, location, orientation) to be correct.
A Low False Negative Filter for Detecting Rare Bird Species from Short Video Segments using a Probable Observation Data Set-based EKF Method
Song, Dezhen (Texas A&M University) | Xu, Yiliang (Texas A&M University)
We report a new filter for assisting the search for rare bird species. Since a rare bird only appears in front of the camera with very low occurrence (e.g. less than ten times per year) for very short duration (e.g. less than a fraction of a second), our algorithm must have very low false negative rate. We verify the bird body axis information with the known bird flying dynamics from the short video segment. Since a regular extended Kalman filter (EKF) cannot converge due to high measurement error and limited data, we develop a novel Probable Observation Data Set (PODS)-based EKF method. The new PODS-EKF searches the measurement error range for all probable observation data that ensures the convergence of the corresponding EKF in short time frame. The algorithm has been extensively tested in experiments. The results show that the algorithm achieves 95.0% area under ROC curve in physical experiment with close to zero false negative rate.
Error Aware Monocular Visual Odometry using Vertical Line Pairs for Small Robots in Urban Areas
Zhang, Ji (Texas A&M University) | Song, Dezhen (Texas A&M University)
We report a new error-aware monocular visual odometry method that only uses vertical lines, such as vertical edges of buildings and poles in urban areas as landmarks. Since vertical lines are easy to extract, insensitive to lighting conditions/ shadows, and sensitive to robot movements on the ground plane, they are robust features if compared with regular point features or line features. We derive a recursive visual odometry method based on the vertical line pairs. We analyze how errors are propagated and introduced in the continuous odometry process by deriving the closed form representation of covariance matrix. We formulate the minimum variance ego-motion estimation problem and present a method that outputs weights for different vertical line pairs. The resulting visual odometry method is tested in physical experiments and compared with two existing methods that are based on point features and line features, respectively. The experiment results show that our method outperforms its two counterparts in robustness, accuracy, and speed. The relative errors of our method are less than 2% in experiments.
Hashigo: A Next-Generation Sketch Interactive System for Japanese Kanji
Taele, Paul (Texas A&M University) | Hammond, Tracy (Texas A&M University)
Language students can increase their effectiveness in learning written Japanese by mastering the visual structure and written technique of Japanese kanji. Yet, existing kanji handwriting recognition systems do not assess the written technique sufficiently enough to discourage students from developing bad learning habits. In this paper, we describe our work on Hashigo, a kanji sketch interactive system which achieves human instructor-level critique and feedback on both the visual structure and written technique of students’ sketched kanji. This type of automated critique and feedback allows students to target and correct specific deficiencies in their sketches that, if left untreated, are detrimental to effective long-term kanji learning.
A Coh-Metrix Analysis of Variation among Biomedical Abstracts
Duncan, Benjamin (Texas A&M University) | Hall, Charles (University of Memphis)
Using the already validated Coh-Metrix tool, this study examines whether there are significant linguistic and discourse differences between biomedical abstracts for American and Korean English. Also, the current study accounts for variation among journals’ countries of origin, distinguishing between biomedical journals published in the United States from biomedical journals published in South Korea. The significance of these studies regards the growing number of second language (L2) biomedical researchers attempting to publish their research in national and international English-language journals, but who find themselves locked out of the discussion because of differences in linguistic and discourse conventions. The present study aims to provide a more thorough and quantitative understanding of the prototypical linguistic components in biomedical rhetoric, and to suggest how word-, sentence-, and discourse-level structures can be researched, taught, and developed into materials. This improved understanding is expected to provide a powerful apparatus for the promotion of L2 English writers in the biomedical field.