Goto

Collaborating Authors

 Overview


Exploiting Rich Textual User-Product Context for Improving Sentiment Analysis

arXiv.org Artificial Intelligence

User and product information associated with a review is useful for sentiment polarity prediction. Typical approaches incorporating such information focus on modeling users and products as implicitly learned representation vectors. Most do not exploit the potential of historical reviews, or those that currently do require unnecessary modifications to model architecture or do not make full use of user/product associations. The contribution of this work is twofold: i) a method to explicitly employ historical reviews belonging to the same user/product to initialize representations, and ii) efficient incorporation of textual associations between users and products via a user-product cross-context module. Experiments on IMDb, Yelp-2013 and Yelp-2014 benchmarks show that our approach substantially outperforms previous state-of-the-art. Since we employ BERT-base as the encoder, we additionally provide experiments in which our approach performs well with Span-BERT and Longformer. Furthermore, experiments where the reviews of each user/product in the training data are downsampled demonstrate the effectiveness of our approach under a low-resource setting.


Selected Trends in Artificial Intelligence for Space Applications

arXiv.org Artificial Intelligence

The development and adoption of artificial intelligence (AI) technologies in space applications is growing quickly as the consensus increases on the potential benefits introduced. As more and more aerospace engineers are becoming aware of new trends in AI, traditional approaches are revisited to consider the applications of emerging AI technologies. Already at the time of writing, the scope of AI-related activities across academia, the aerospace industry and space agencies is so wide that an in-depth review would not fit in these pages. In this chapter we focus instead on two main emerging trends we believe capture the most relevant and exciting activities in the field: differentiable intelligence and on-board machine learning. Differentiable intelligence, in a nutshell, refers to works making extensive use of automatic differentiation frameworks to learn the parameters of machine learning or related models. Onboard machine learning considers the problem of moving inference as well as learning of machine learning models onboard. Within these fields, we discuss a few selected projects originating from the European Space Agency's (ESA) Advanced Concepts Team (ACT), giving priority to advanced topics going beyond the transposition of established AI techniques and practices to the space domain, thus necessarily leaving out interesting activities with a possibly higher technology readiness level. We start with the topic of differentiable intelligence by introducing Guidance and Control Networks (G&CNets), Eclipse Networks (EclipseNETs), Neural Density Fields (geodesyNets) as well as the use of implicit representations to learn differentiable models for the shapes of asteroids and comets from LiDAR data.


OpenRAN Gym: AI/ML Development, Data Collection, and Testing for O-RAN on PAWR Platforms

arXiv.org Artificial Intelligence

Open Radio Access Network (RAN) architectures will enable interoperability, openness and programmable data-driven control in next generation cellular networks. However, developing and testing efficient solutions that generalize across heterogeneous cellular deployments and scales, and that optimize network performance in such diverse environments is a complex task that is still largely unexplored. In this paper we present OpenRAN Gym, a unified, open, and O-RAN-compliant experimental toolbox for data collection, design, prototyping and testing of end-to-end data-driven control solutions for next generation Open RAN systems. OpenRAN Gym extends and combines into a unique solution several software frameworks for data collection of RAN statistics and RAN control, and a lightweight O-RAN near-real-time RAN Intelligent Controller (RIC) tailored to run on experimental wireless platforms. We first provide an overview of the various architectural components of OpenRAN Gym and describe how it is used to collect data and design, train and test artificial intelligence and machine learning O-RAN-compliant applications (xApps) at scale. We then describe in detail how to test the developed xApps on softwarized RANs and provide an example of two xApps developed with OpenRAN Gym that are used to control a network with 7 base stations and 42 users deployed on the Colosseum testbed. Finally, we show how solutions developed with OpenRAN Gym on Colosseum can be exported to real-world, heterogeneous wireless platforms, such as the Arena testbed and the POWDER and COSMOS platforms of the PAWR program. OpenRAN Gym and its software components are open-source and publicly-available to the research community. By guiding the readers through running experiments with OpenRAN Gym, we aim at providing a key reference for researchers and practitioners working on experimental Open RAN systems.


Startup Shield AI lands $60M to build artificial intelligence 'pilots' for military aircraft

#artificialintelligence

Shield AI, a San Diego startup that's building artificial intelligence "pilots" for military aircraft and drones, has pulled in an additional $60 million in venture capital funding. The money is follow-on investment to a financing that Shield AI announced in June. It brings the total amount raised in the Series E round to $225 million -- made up of $150 million in equity and $75 million in debt. The additional capital came from the U.S. Innovative Technology Fund. Founded in 2015, Shield AI has raised just under $575 million since inception.


Fine-grained Czech News Article Dataset: An Interdisciplinary Approach to Trustworthiness Analysis

arXiv.org Artificial Intelligence

We present the Verifee Dataset: a novel dataset of news articles with fine-grained trustworthiness annotations. We develop a detailed methodology that assesses the texts based on their parameters encompassing editorial transparency, journalist conventions, and objective reporting while penalizing manipulative techniques. We bring aboard a diverse set of researchers from social, media, and computer sciences to overcome barriers and limited framing of this interdisciplinary problem. We collect over $10,000$ unique articles from almost $60$ Czech online news sources. These are categorized into one of the $4$ classes across the credibility spectrum we propose, raging from entirely trustworthy articles all the way to the manipulative ones. We produce detailed statistics and study trends emerging throughout the set. Lastly, we fine-tune multiple popular sequence-to-sequence language models using our dataset on the trustworthiness classification task and report the best testing F-1 score of $0.52$. We open-source the dataset, annotation methodology, and annotators' instructions in full length at https://verifee.ai/research to enable easy build-up work. We believe similar methods can help prevent disinformation and educate in the realm of media literacy.


Counterfactual Explanations for Misclassified Images: How Human and Machine Explanations Differ

arXiv.org Artificial Intelligence

Counterfactual explanations have emerged as a popular solution for the eXplainable AI (XAI) problem of elucidating the predictions of black-box deep-learning systems due to their psychological validity, flexibility across problem domains and proposed legal compliance. While over 100 counterfactual methods exist, claiming to generate plausible explanations akin to those preferred by people, few have actually been tested on users ($\sim7\%$). So, the psychological validity of these counterfactual algorithms for effective XAI for image data is not established. This issue is addressed here using a novel methodology that (i) gathers ground truth human-generated counterfactual explanations for misclassified images, in two user studies and, then, (ii) compares these human-generated ground-truth explanations to computationally-generated explanations for the same misclassifications. Results indicate that humans do not "minimally edit" images when generating counterfactual explanations. Instead, they make larger, "meaningful" edits that better approximate prototypes in the counterfactual class.


Machine Learning Strategies to Improve Generalization in EEG-based Emotion Assessment: \\a Systematic Review

arXiv.org Artificial Intelligence

Emotions are our internal compass and play a primary role in learning, reasoning, decision-making processes, and communication between individuals. The Information and Communication Technology (ICT) sector's interest in emotions has grown tremendously in recent years, shaping the concept of affective computing, an emerging field aimed at monitoring and predicting emotions in order to improve human-computer interaction Cambria et al. (2017); for instance, the introduction of affective loops makes it possible to implement increasingly adaptive human-machine interfaces and virtual assistants tailored to users Saganowski et al. (2020), or the outputs of emotion monitoring systems, in the healthcare context, can be useful in the treatment of psychological disorders based on emotional deficits, in autism Feng et al. (2018), in the improvement of wellbeing Healy et al. (2018), and in stress containment Saganowski (2022). In particular, in this context, there is a growing interest in the literature for Brain-Computer Interface (BCI) systems based on EEG signals Torres et al. (2020). In fact, the number of annual scientific publications indexed on Scopus database on the topic of EEG-based emotion recognition shows an exponential growth trend (see Figure 1). A critical issue underlying the processing and classification of EEG signals is their inherent variability among different subjects or different acquisition times (i.e.


Learning and Extrapolation of Robotic Skills using Task-Parameterized Equation Learner Networks

arXiv.org Artificial Intelligence

Imitation learning approaches achieve good generalization within the range of the training data, but tend to generate unpredictable motions when querying outside this range. We present a novel approach to imitation learning with enhanced extrapolation capabilities that exploits the so-called Equation Learner Network (EQLN). Unlike conventional approaches, EQLNs use supervised learning to fit a set of analytical expressions that allows them to extrapolate beyond the range of the training data. We augment the task demonstrations with a set of task-dependent parameters representing spatial properties of each motion and use them to train the EQLN. At run time, the features are used to query the Task-Parameterized Equation Learner Network (TP-EQLN) and generate the corresponding robot trajectory. The set of features encodes kinematic constraints of the task such as desired height or a final point to reach. We validate the results of our approach on manipulation tasks where it is important to preserve the shape of the motion in the extrapolation domain. Our approach is also compared with existing state-of-the-art approaches, in simulation and in real setups. The experimental results show that TP-EQLN can respect the constraints of the trajectory encoded in the feature parameters, even in the extrapolation domain, while preserving the overall shape of the trajectory provided in the demonstrations.


Deep Learning Methods for Calibrated Photometric Stereo and Beyond: A Survey

arXiv.org Artificial Intelligence

Photometric stereo recovers the surface normals of an object from multiple images with varying shading cues, i.e., modeling the relationship between surface orientation and intensity at each pixel. Photometric stereo prevails in superior per-pixel resolution and fine reconstruction details. However, it is a complicated problem because of the non-linear relationship caused by non-Lambertian surface reflectance. Recently, various deep learning methods have shown a powerful ability in the context of photometric stereo against non-Lambertian surfaces. This paper provides a comprehensive review of existing deep learning-based calibrated photometric stereo methods. We first analyze these methods from different perspectives, including input processing, supervision, and network architecture. We summarize the performance of deep learning photometric stereo models on the most widely-used benchmark data set. This demonstrates the advanced performance of deep learning-based photometric stereo methods. Finally, we give suggestions and propose future research trends based on the limitations of existing models.


Neural Rendering: A Brief Overview - weishaupt.ai

#artificialintelligence

Neural rendering uses deep neural networks to create new images and video from existing scenes. The camera angles, lighting, and other details can be rendered into a realistic model of a 3D scene. In addition, neural rendering of existing images and videos can be used to generate synthetic data. Why it matters: Traditional 3D graphic rendering needs a model with a polygon mesh describing shape, color, and textures, as well as the lighting and camera position. Neural rendering simulates camera physics to separate the 3D scene from the camera capture process, making it easier to create new images from existing images and videos with consistency.