South America
TopoDetect: Framework for Topological Features Detection in Graph Embeddings
Haddad, Maroun, Bouguessa, Mohamed
TopoDetect is a Python package that allows the user to investigate if important topological features, such as the Degree of the nodes, their Triangle Count, or their Local Clustering Score, are preserved in the embeddings of graph representation models. Additionally, the framework enables the visualization of the embeddings according to the distribution of the topological features among the nodes. Moreover, TopoDetect enables us to study the effect of the preservation of these features by evaluating the performance of the embeddings on downstream learning tasks such as clustering and classification.
A Mining Software Repository Extended Cookbook: Lessons learned from a literature review
Barros, Daniel, Horita, Flavio, Wiese, Igor, Silva, Kanan
The main purpose of Mining Software Repositories (MSR) is to discover the latest enhancements and provide an insight into how to make improvements in a software project. In light of it, this paper updates the MSR findings of the original MSR Cookbook, by first conducting a systematic mapping study to elicit and analyze the state-of-the-art, and then proposing an extended version of the Cookbook. This extended Cookbook was built on four high-level themes, which were derived from the analysis of a list of 112 selected studies. Hence, it was used to consolidate the extended Cookbook as a contribution to practice and research in the following areas by: 1) including studies published in all available and relevant publication venues; 2) including and updating recommendations in all four high-level themes, with an increase of 84% in comments in this study when compared with the original MSR Cookbook; 3) summarizing the tools employed for each high-level theme; and 4) providing lessons learned for future studies. Thus, the extended Cookbook examined in this work can support new research projects, as upgraded recommendations and the lessons learned are available with the aid of samples and tools.
Symbolic Register Automata for Complex Event Recognition and Forecasting
Alevizos, Elias, Artikis, Alexander, Paliouras, Georgios
We propose an automaton model which is a combination of symbolic and register automata, i.e., we enrich symbolic automata with memory. We call such automata Symbolic Register Automata (SRA). SRA extend the expressive power of symbolic automata, by allowing Boolean formulas to be applied not only to the last element read from the input string, but to multiple elements, stored in their registers. SRA also extend register automata, by allowing arbitrary Boolean formulas, besides equality predicates. We study the closure properties of SRA under union, intersection, concatenation, Kleene closure, complement and determinization and show that SRA, contrary to symbolic automata, are not in general closed under complement and they are not determinizable. However, they are closed under these operations when a window operator, quintessential in Complex Event Recognition, is used. We show how SRA can be used in Complex Event Recognition in order to detect patterns upon streams of events, using our framework that provides declarative and compositional semantics, and that allows for a systematic treatment of such automata. We also show how the behavior of SRA, as they consume streams of events, can be given a probabilistic description with the help of prediction suffix trees. This allows us to go one step beyond Complex Event Recognition to Complex Event Forecasting, where, besides detecting complex patterns, we can also efficiently forecast their occurrence.
A Study of Low-Resource Speech Commands Recognition based on Adversarial Reprogramming
Yen, Hao, Ku, Pin-Jui, Yang, Chao-Han Huck, Hu, Hu, Siniscalchi, Sabato Marco, Chen, Pin-Yu, Tsao, Yu
In this study, we propose a novel adversarial reprogramming (AR) approach for low-resource spoken command recognition (SCR), and build an AR-SCR system. The AR procedure aims to modify the acoustic signals (from the target domain) to repurpose a pretrained SCR model (from the source domain). To solve the label mismatches between source and target domains, and further improve the stability of AR, we propose a novel similarity-based label mapping technique to align classes. In addition, the transfer learning (TL) technique is combined with the original AR process to improve the model adaptation capability. We evaluate the proposed AR-SCR system on three low-resource SCR datasets, including Arabic, Lithuanian, and dysarthric Mandarin speech. Experimental results show that with a pretrained AM trained on a large-scale English dataset, the proposed AR-SCR system outperforms the current state-of-the-art results on Arabic and Lithuanian speech commands datasets, with only a limited amount of training data.
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
Song, Ziang, Mei, Song, Bai, Yu
Multi-agent reinforcement learning has made substantial empirical progresses in solving games with a large number of players. However, theoretically, the best known sample complexity for finding a Nash equilibrium in general-sum games scales exponentially in the number of players due to the size of the joint action space, and there is a matching exponential lower bound. This paper investigates what learning goals admit better sample complexities in the setting of $m$-player general-sum Markov games with $H$ steps, $S$ states, and $A_i$ actions per player. First, we design algorithms for learning an $\epsilon$-Coarse Correlated Equilibrium (CCE) in $\widetilde{\mathcal{O}}(H^5S\max_{i\le m} A_i / \epsilon^2)$ episodes, and an $\epsilon$-Correlated Equilibrium (CE) in $\widetilde{\mathcal{O}}(H^6S\max_{i\le m} A_i^2 / \epsilon^2)$ episodes. This is the first line of results for learning CCE and CE with sample complexities polynomial in $\max_{i\le m} A_i$. Our algorithm for learning CE integrates an adversarial bandit subroutine which minimizes a weighted swap regret, along with several novel designs in the outer loop. Second, we consider the important special case of Markov Potential Games, and design an algorithm that learns an $\epsilon$-approximate Nash equilibrium within $\widetilde{\mathcal{O}}(S\sum_{i\le m} A_i / \epsilon^3)$ episodes (when only highlighting the dependence on $S$, $A_i$, and $\epsilon$), which only depends linearly in $\sum_{i\le m} A_i$ and significantly improves over the best known algorithm in the $\epsilon$ dependence. Overall, our results shed light on what equilibria or structural assumptions on the game may enable sample-efficient learning with many players.
Google Is Using Artificial Intelligence To Control Traffic Lights - AI Summary
Pichai said the company was expanding the pilot program to intersections in Rio de Janeiro, Brazil, and beyond. The effort is part of a series of initiatives Google is launching to give consumers "more sustainable choices." The company will also show "authoritative information" panels from the United Nations and other sources when users search on Google for information related to climate change, Pichai said. Today we're sharing new ways people can use our products to make sustainable choices, including tools to book flights or purchase appliances with lower carbon footprints, a Nest program to support clean energy from home, eco-friendly routes on Maps & more. The company is also introducing a new feature in Google Maps beginning in 2022 that will allow users to pick the most fuel-efficient driving route when navigating.
Can a Robot Invent? The Fight Around AI and Patents Explained
Patent offices and courts around the world are being asked to tackle a similar question: can an artificial intelligence system qualify as an inventor for a patent? A test case making its way through several countries--from Saudi Arabia to Australia to Brazil--has spurred debate about advancements in artificial intelligence technology and questions about whether patent laws need to be revised to recognize machines as inventors. A judge in the U.S. District Court for the Eastern District of Virginia recently ruled that, under current U.S. law, AI can't be listed as an inventor on a patent. The ruling was in line with what U.S., British, and EU patent officials have concluded. The push to recognize AI as an inventor comes from Ryan Abbott, a University of Surrey law professor, and Stephen Thaler, a computer scientist from Missouri.
Google wants to use AI to time traffic lights more efficiently
Oct 6 (Reuters) - Alphabet Inc's (GOOGL.O) Google cut fuel use and traffic delays by 10% to 20% at four locations in Israel by using artificial intelligence to optimize signal lights and it next plans to test the software in Rio de Janeiro, the company said on Wednesday. The early-phase research project is among new software initiatives inside Google to combat climate change. Some employees as well as advocacy groups have called on the company, the world's third-most valuable, to more urgently use its influence to combat the crisis. While Google has not addressed critics' calls to stop selling technology to oil companies or funding lawmakers who deny global warming, it has prioritized sustainability features. Google plans in the coming weeks to allow its Nest thermostat users to buy renewable energy credits for $10 a month to offset emissions from heating and cooling.
Contextual Sentence Classification: Detecting Sustainability Initiatives in Company Reports
Hirlea, Dan, Bryant, Christopher, Rei, Marek
We introduce the novel task of detecting sustainability initiatives in company reports. Given a full report, the aim is to automatically identify mentions of practical activities that a company has performed in order to tackle specific societal issues. As a single initiative can often be described over multiples sentences, new methods for identifying continuous sentence spans needs to be developed. We release a new dataset of company reports in which the text has been manually annotated with sustainability initiatives. We also evaluate different models for initiative detection, introducing a novel aggregation and evaluation methodology. Our proposed architecture uses sequences of five consecutive sentences to account for contextual information when making classification decisions at the individual sentence level.
mRAT-SQL+GAP:A Portuguese Text-to-SQL Transformer
José, Marcelo Archanjo, Cozman, Fabio Gagliardi
The translation of natural language questions to SQL queries has attracted growing attention, in particular in connection with transformers and similar language models. A large number of techniques are geared towards the English language; in this work, we thus investigated translation to SQL when input questions are given in the Portuguese language. To do so, we properly adapted state-of-the-art tools and resources. We changed the RAT-SQL GAP system by relying on a multilingual BART model (we report tests with other language models), and we produced a translated version of the Spider dataset. Our experiments expose interesting phenomena that arise when non-English languages are targeted; in particular, it is better to train with original and translated training datasets together, even if a single target language is desired. This multilingual BART model fine-tuned with a double-size training dataset (English and Portuguese) achieved 83% of the baseline, making inferences for the Portuguese test dataset. This investigation can help other researchers to produce results in Machine Learning in a language different from English.