Goto

Collaborating Authors

 South America


Automating Gamification Personalization: To the User and Beyond

arXiv.org Artificial Intelligence

Personalized gamification explores knowledge about the users to tailor gamification designs to improve one-size-fits-all gamification. The tailoring process should simultaneously consider user and contextual characteristics (e.g., activity to be done and geographic location), which leads to several occasions to tailor. Consequently, tools for automating gamification personalization are needed. The problems that emerge are that which of those characteristics are relevant and how to do such tailoring are open questions, and that the required automating tools are lacking. We tackled these problems in two steps. First, we conducted an exploratory study, collecting participants' opinions on the game elements they consider the most useful for different learning activity types (LAT) via survey. Then, we modeled opinions through conditional decision trees to address the aforementioned tailoring process. Second, as a product from the first step, we implemented a recommender system that suggests personalized gamification designs (which game elements to use), addressing the problem of automating gamification personalization. Our findings i) present empirical evidence that LAT, geographic locations, and other user characteristics affect users' preferences, ii) enable defining gamification designs tailored to user and contextual features simultaneously, and iii) provide technological aid for those interested in designing personalized gamification. The main implications are that demographics, game-related characteristics, geographic location, and LAT to be done, as well as the interaction between different kinds of information (user and contextual characteristics), should be considered in defining gamification designs and that personalizing gamification designs can be improved with aid from our recommender system.


Of Non-Linearity and Commutativity in BERT

arXiv.org Artificial Intelligence

In this work we provide new insights into the transformer architecture, and in particular, its best-known variant, BERT. First, we propose a method to measure the degree of non-linearity of different elements of transformers. Next, we focus our investigation on the feed-forward networks (FFN) inside transformers, which contain 2/3 of the model parameters and have so far not received much attention. We find that FFNs are an inefficient yet important architectural element and that they cannot simply be replaced by attention blocks without a degradation in performance. Moreover, we study the interactions between layers in BERT and show that, while the layers exhibit some hierarchical structure, they extract features in a fuzzy manner. Our results suggest that BERT has an inductive bias towards layer commutativity, which we find is mainly due to the skip connections. This provides a justification for the strong performance of recurrent and weight-shared transformer models.


Signal Processing on Higher-Order Networks: Livin' on the Edge ... and Beyond

arXiv.org Machine Learning

This tutorial paper presents a didactic treatment of the emerging topic of signal processing on higher-order networks. Drawing analogies from discrete and graph signal processing, we introduce the building blocks for processing data on simplicial complexes and hypergraphs, two common abstractions of higher-order networks that can incorporate polyadic relationships.We provide basic introductions to simplicial complexes and hypergraphs, making special emphasis on the concepts needed for processing signals on them. Leveraging these concepts, we discuss Fourier analysis, signal denoising, signal interpolation, node embeddings, and non-linear processing through neural networks in these two representations of polyadic relational structures. In the context of simplicial complexes, we specifically focus on signal processing using the Hodge Laplacian matrix, a multi-relational operator that leverages the special structure of simplicial complexes and generalizes desirable properties of the Laplacian matrix in graph signal processing. For hypergraphs, we present both matrix and tensor representations, and discuss the trade-offs in adopting one or the other. We also highlight limitations and potential research avenues, both to inform practitioners and to motivate the contribution of new researchers to the area.


The Edge: What Does It Mean For Artificial Intelligence?

#artificialintelligence

The edge is an end point where data is generated through some type of interface, device or sensor. Keep in mind that the technology is nothing new. But in light of the rapid innovations in a myriad of categories, the edge has become a major growth business. "The edge brings the intelligence as close as possible to the data source and the point of action," said Teresa Tung, who is the Managing Director at Accenture Labs. "This is important because while centralized cloud computing makes it easier and cheaper to process data at scale, there are times when it doesn't make sense to send data off to the cloud for processing."


Japan considers vehicle-mounted laser to ward off drone attacks

The Japan Times

The Defense Ministry will begin research on ways to ward off drone attacks by using vehicle-mounted laser, according to informed sources. By mounting laser equipment on vehicles, the ministry aims to raise the mobility of the system. The ministry included ¥2.8 billion in research spending in its budget for fiscal 2021. It aims to establish related technology as early as fiscal 2024 and put it into practical use at an early date. In fiscal 2018, the ministry started research on using high-energy laser to destroy drones.


"Brilliant AI Doctor" in Rural China: Tensions and Challenges in AI-Powered CDSS Deployment

arXiv.org Artificial Intelligence

Artificial intelligence (AI) technology has been increasingly used in the implementation of advanced Clinical Decision Support Systems (CDSS). Research demonstrated the potential usefulness of AI-powered CDSS (AI-CDSS) in clinical decision making scenarios. However, post-adoption user perception and experience remain understudied, especially in developing countries. Through observations and interviews with 22 clinicians from 6 rural clinics in China, this paper reports the various tensions between the design of an AI-CDSS system ("Brilliant Doctor") and the rural clinical context, such as the misalignment with local context and workflow, the technical limitations and usability barriers, as well as issues related to transparency and trustworthiness of AI-CDSS. Despite these tensions, all participants expressed positive attitudes toward the future of AI-CDSS, especially acting as "a doctor's AI assistant" to realize a Human-AI Collaboration future in clinical settings. Finally we draw on our findings to discuss implications for designing AI-CDSS interventions for rural clinical contexts in developing countries.


The Edge: What Does It Mean For AI (Artificial Ingelligence)?

#artificialintelligence

Edge computing technology with distributed network performing computation and data storage near the ... [ ] user instead of in the cloud, internet service for IoT, gamelets and AI recognition, concept The edge is an end point where data is generated through some type of interface, device or sensor. Keep in mind that the technology is nothing new. But in light of the rapid innovations in a myriad of categories, the edge has become a major growth business. "The edge brings the intelligence as close as possible to the data source and the point of action," said Teresa Tung, who is the Managing Director at Accenture Labs. "This is important because while centralized cloud computing makes it easier and cheaper to process data at scale, there are times when it doesn't make sense to send data off to the cloud for processing."


A Commonsense Reasoning Framework for Explanatory Emotion Attribution, Generation and Re-classification

arXiv.org Artificial Intelligence

In this work we present an explainable system for emotion attribution and recommendation (called DEGARI) relying on a recently introduced commonsense reasoning framework (the TCL logic) which is based on a human-like procedure for the automatic generation of novel concepts in a Description Logics knowledge base. Starting from an ontological formalization of emotions (known as ArsEmotica), the system exploits the logic TCL to automatically generate novel commonsense semantic representations of compound emotions (e.g. Love as derived from the combination of Joy and Trust according to the ArsEmotica model). The generated emotions correspond to prototypes, i.e. commonsense representations of given concepts, and have been used to reclassify emotion-related contents in a variety of artistic domains, ranging from art datasets to the editorial content available in RaiPlay, the online multimedia platform of RAI Radiotelevisione Italiana (the Italian public broadcasting company). We have tested our system (1) by reclassifying the available contents in the tested dataset with respect to the new generated compound emotions (2) with an evaluation, in the form of a controlled user study experiment, of the feasibility of using the obtained reclassifications as recommended emotional content. The obtained results are encouraging and pave the way to many possible further improvements and research directions.


Transforming Multi-Conditioned Generation from Meaning Representation

arXiv.org Artificial Intelligence

In task-oriented conversation systems, natural language generation systems that generate sentences with specific information related to conversation flow are useful. Our study focuses on language generation by considering various information representing the meaning of utterances as multiple conditions of generation. NLG from meaning representations, the conditions for sentence meaning, generally goes through two steps: sentence planning and surface realization. However, we propose a simple one-stage framework to generate utterances directly from MR (Meaning Representation). Our model is based on GPT2 and generates utterances with flat conditions on slot and value pairs, which does not need to determine the structure of the sentence. We evaluate several systems in the E2E dataset with 6 automatic metrics. Our system is a simple method, but it demonstrates comparable performance to previous systems in automated metrics. In addition, using only 10\% of the data set without any other techniques, our model achieves comparable performance, and shows the possibility of performing zero-shot generation and expanding to other datasets.


Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity

arXiv.org Artificial Intelligence

In deep learning, models typically reuse the same parameters for all inputs. Mixture of Experts (MoE) defies this and instead selects different parameters for each incoming example. The result is a sparsely-activated model -- with outrageous numbers of parameters -- but a constant computational cost. However, despite several notable successes of MoE, widespread adoption has been hindered by complexity, communication costs and training instability -- we address these with the Switch Transformer. We simplify the MoE routing algorithm and design intuitive improved models with reduced communication and computational costs. Our proposed training techniques help wrangle the instabilities and we show large sparse models may be trained, for the first time, with lower precision (bfloat16) formats. We design models based off T5-Base and T5-Large to obtain up to 7x increases in pre-training speed with the same computational resources. These improvements extend into multilingual settings where we measure gains over the mT5-Base version across all 101 languages. Finally, we advance the current scale of language models by pre-training up to trillion parameter models on the "Colossal Clean Crawled Corpus" and achieve a 4x speedup over the T5-XXL model.