AITopics

2110.13805

Country:

South America > Brazil > São Paulo (0.04)
South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
North America > United States > Massachusetts (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)

Gonzalez, Mauricio E., Silva, Jorge F., Videla, Miguel, Orchard, Marcos E.

Data-Driven Representations for Testing Independence: Modeling, Analysis and Connection with Mutual Information Estimation

arXiv.org Machine LearningOct-26-2021

The empirical log-likelihood statistic is adopted to approximate the sufficient statistics of an oracle test against independence (that knows the two hypotheses). It is shown that approximating the sufficient statistics of the oracle test offers a learning criterion for designing a data-driven partition that connects with the problem of mutual information estimation. Applying these ideas in the context of a data-dependent tree-structured partition (TSP), we derive conditions on the TSP's parameters to achieve a strongly consistent distribution-free test of independence over the family of probabilities equipped with a density. Complementing this result, we present finite-length results that show our TSP scheme's capacity to detect the scenario of independence structurally with the data-driven partition as well as new sampling complexity bounds for this detection. Finally, some experimental analyses provide evidence regarding our scheme's advantage for testing independence compared with some strategies that do not use data-driven representations.

independence, partition, trade-off, (15 more...)

2110.14122

Country:

North America > United States > New York (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Zadeh, Amir, Benoit, Santiago, Morency, Louis-Philippe

Relay Variational Inference: A Method for Accelerated Encoderless VI

arXiv.org Machine LearningOct-26-2021

Variational Inference (VI) offers a method for approximating intractable likelihoods. In neural VI, inference of approximate posteriors is commonly done using an encoder. Alternatively, encoderless VI offers a framework for learning generative models from data without encountering suboptimalities caused by amortization via an encoder (e.g. in presence of missing or uncertain data). However, in absence of an encoder, such methods often suffer in convergence due to the slow nature of gradient steps required to learn the approximate posterior parameters. In this paper, we introduce Relay VI (RVI), a framework that dramatically improves both the convergence and performance of encoderless VI. In our experiments over multiple datasets, we study the effectiveness of RVI in terms of convergence speed, loss, representation power and missing data imputation. We find RVI to be a unique tool, often superior in both performance and convergence speed to previously proposed encoderless as well as amortized VI models (e.g.

approximate posterior, datapoint, experiment, (15 more...)

2110.13422

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)

Genre: Research Report > New Finding (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

arXiv.org Machine LearningOct-26-2021

Gradient-based Quadratic Multiform Separation

Chang, Wen-Teng

Classification as a supervised learning concept is an important content in machine learning. It aims at categorizing a set of data into classes. There are several commonly-used classification methods nowadays such as k-nearest neighbors, random forest, and support vector machine. Each of them has its own pros and cons, and none of them is invincible for all kinds of problems. In this thesis, we focus on Quadratic Multiform Separation (QMS), a classification method recently proposed by Michael Fan et al. (2019). Its fresh concept, rich mathematical structure, and innovative definition of loss function set it apart from the existing classification methods. Inspired by QMS, we propose utilizing a gradient-based optimization method, Adam, to obtain a classifier that minimizes the QMS-specific loss function. In addition, we provide suggestions regarding model tuning through explorations of the relationships between hyperparameters and accuracies. Our empirical result shows that QMS performs as good as most classification methods in terms of accuracy. Its superior performance is almost comparable to those of gradient boosting algorithms that win massive machine learning competitions.

accuracy, dataset, loss function, (15 more...)

2110.13006

Country:

North America > United States (0.14)
South America > Peru (0.04)
South America > Ecuador (0.04)
(33 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.54)

#artificialintelligenceOct-25-2021, 08:55:21 GMT

Apple selects Chinese giant for critical iPhone role - California News Times

This article is an on-site version of the #techAsia newsletter.sign up here Send newsletter directly to your inbox every Wednesday Hello, Kenji from Tokyo this week is currently undergoing home quarantine for Covid-19. For our big story, there is another scoop about Apple from Nikkei Asia. China's state-owned enterprise has become a supplier of the latest flagship iPhone displays. This shows how advanced China's technology, including artificial intelligence, has advanced, as warned by a former Pentagon chief software officer (Mercedes Top 10). Meanwhile, China is building and diversifying its sources of strategic mineral resources, including lithium, a key component of the world's leading electric vehicle industry (our views, smart data and spotlights).

beijing, china, nikkei asia, (9 more...)

#artificialintelligence

Country:

North America > United States > California (0.40)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.25)
Asia > China > Beijing > Beijing (0.09)
(9 more...)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
Materials > Metals & Mining > Lithium (1.00)
(2 more...)

Technology:

Information Technology > Communications > Mobile (0.76)
Information Technology > Artificial Intelligence (0.70)

Applications and Techniques for Fast Machine Learning in Science

Deiana, Allison McCarn, Tran, Nhan, Agar, Joshua, Blott, Michaela, Di Guglielmo, Giuseppe, Duarte, Javier, Harris, Philip, Hauck, Scott, Liu, Mia, Neubauer, Mark S., Ngadiuba, Jennifer, Ogrenci-Memik, Seda, Pierini, Maurizio, Aarrestad, Thea, Bahr, Steffen, Becker, Jurgen, Berthold, Anne-Sophie, Bonventre, Richard J., Bravo, Tomas E. Muller, Diefenthaler, Markus, Dong, Zhen, Fritzsche, Nick, Gholami, Amir, Govorkova, Ekaterina, Hazelwood, Kyle J, Herwig, Christian, Khan, Babar, Kim, Sehoon, Klijnsma, Thomas, Liu, Yaling, Lo, Kin Ho, Nguyen, Tri, Pezzullo, Gianantonio, Rasoulinezhad, Seyedramin, Rivera, Ryan A., Scholberg, Kate, Selig, Justin, Sen, Sougata, Strukov, Dmitri, Tang, William, Thais, Savannah, Unger, Kai Lukas, Vilalta, Ricardo, Krosigk, Belinavon, Warburton, Thomas K., Flechas, Maria Acosta, Aportela, Anthony, Calvet, Thomas, Cristella, Leonardo, Diaz, Daniel, Doglioni, Caterina, Galati, Maria Domenica, Khoda, Elham E, Fahim, Farah, Giri, Davide, Hawks, Benjamin, Hoang, Duc, Holzman, Burt, Hsu, Shih-Chieh, Jindariani, Sergo, Johnson, Iris, Kansal, Raghav, Kastner, Ryan, Katsavounidis, Erik, Krupa, Jeffrey, Li, Pan, Madireddy, Sandeep, Marx, Ethan, McCormack, Patrick, Meza, Andres, Mitrevski, Jovan, Mohammed, Mohammed Attia, Mokhtar, Farouk, Moreno, Eric, Nagu, Srishti, Narayan, Rohin, Palladino, Noah, Que, Zhiqiang, Park, Sang Eon, Ramamoorthy, Subramanian, Rankin, Dylan, Rothman, Simon, Sharma, Ashish, Summers, Sioni, Vischia, Pietro, Vlimant, Jean-Roch, Weng, Olivia

In this community review report, we discuss applications and techniques for fast machine learning (ML) in science -- the concept of integrating power ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML across a number of scientific domains; techniques for training and implementing performant and resource-efficient ML algorithms; and computing architectures, platforms, and technologies for deploying these algorithms. We also present overlapping challenges across the multiple scientific domains where common solutions can be found. This community report is intended to give plenty of examples and inspiration for scientific discovery through integrated and accelerated ML solutions. This is followed by a high-level overview and organization of technical advances, including an abundance of pointers to source material, which can enable these breakthroughs.

neural information processing system, pattern recognition, real time system, (20 more...)

doi: 10.3389/fdata.2022.787421

2110.13041

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > Washington > King County > Seattle (0.13)
North America > United States > Florida > Alachua County > Gainesville (0.13)
(43 more...)

Genre:

Research Report > Promising Solution (1.00)
Overview (1.00)

Industry:

Semiconductors & Electronics (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Architecture > Real Time Systems (1.00)
(5 more...)

Schratz, Patrick, Becker, Marc, Lang, Michel, Brenning, Alexander

Mlr3spatiotempcv: Spatiotemporal resampling methods for machine learning in R

arXiv.org Machine LearningOct-25-2021

Spatial and spatiotemporal prediction tasks are common in applications ranging from environmental sciences to archaeology and epidemiology. While sophisticated mathematical frameworks have long been developed in spatial statistics to characterize predictive uncertainties under well-defined mathematical assumptions such as intrinsic stationarity (e.g., Cressie 1993), computational estimation procedures have only been proposed more recently to assess predictive performances of spatial and spatiotemporal prediction models (Brenning 2005, 2012; Pohjankukka, Pahikkala, Nevalainen, and Heikkonen 2017; Roberts, Bahn, Ciuti, Boyce, Elith, Guillera-Arroita, Hauenstein, Lahoz-Monfort, Schröder, Thuiller, Warton, Wintle, Hartig, and Dormann 2017). Although alternatives such as the bootstrap exist since some decades (Efron and Gong 1983; Hand 1997), cross-validation (CV) is a particularly well-established, easy-to-implement algorithm for model assessment of supervised machine-learning models (Efron and Gong 1983, and next section) and model selection (Arlot and Celisse 2010). In its basic form, CV is based on resampling the data without paying attention to any possible dependence structure, which may arise from, e.g., grouped or structured data, or underlying environmental processes inducing some sort of spatial coherence at the landscape scale. In treating dependent observations as independent, or ignoring autocorrelation, CV test samples may in fact be heavily correlated with, or even pseudo-replicates of, the data used for training the model, which introduces a potentially severe bias in assessing the transferability of flexible machine-learning (ML) models.

model assessment, partition, spatiotemporal, (14 more...)

2110.12674

Country:

North America > United States > New York (0.04)
Europe > Spain > Aragón (0.04)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry:

Energy (0.46)
Food & Agriculture > Agriculture (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Bertoli, Gustavo de Carvalho, Junior, Lourenço Alves Pereira, Verri, Filipe Alves Neto, Santos, Aldri Luiz dos, Saotome, Osamu

Bridging the gap to real-world for network intrusion detection systems with data-centric approach

Most research using machine learning (ML) for network intrusion detection systems (NIDS) uses well-established datasets such as KDD-CUP99, NSL-KDD, UNSW-NB15, and CICIDS-2017. In this context, the possibilities of machine learning techniques are explored, aiming for metrics improvements compared to the published baselines (model-centric approach). However, those datasets present some limitations as aging that make it unfeasible to transpose those ML-based solutions to real-world applications. This paper presents a systematic data-centric approach to address the current limitations of NIDS research, specifically the datasets. This approach generates NIDS datasets composed of the most recent network traffic and attacks, with the labeling process integrated by design.

dataset, intrusion detection system, traffic, (16 more...)

2110.13655

Country:

South America > Brazil > Minas Gerais (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Portugal (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Peregrino, Ana Alice, Pradhan, Soham, Liu, Zhicheng, Ferreira, Nivan, Miranda, Fabio

Transportation Scenario Planning with Graph Neural Networks

To enable data-driven scenario planning, we take the flows is, therefore, a requisite to better plan urban areas. In this first steps in leveraging the Geo-contextual Multitask Embedding context, an important task is to study hypothetical scenarios in Learner (GMEL) model, previously proposed in Liu et al. [16], as our which possible future changes are evaluated. For instance, how the base model for predicting commuting flows based on geographic increase in residential units or transportation modes in a neighborhood information (e.g., infrastructure, land use, transportation). Commuting will change the commuting flows to or from that region? In flows are defined as flows between a workers' residence this paper, we propose to leverage GMEL, a recently introduced location and a workplace location. While major cities have the resources graph neural network model, to evaluate changes in commuting to collect and process high-resolution land use data, other flows taking into account different land use and infrastructure scenarios.

census tract, curitiba, scenario, (13 more...)

2110.13202

Country:

South America > Brazil > Pernambuco > Recife (0.09)
South America > Brazil > Paraná > Curitiba (0.07)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > New York (0.04)

Genre: Research Report (0.64)

Industry: Law (0.78)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Probabilistic Hierarchical Forecasting with Deep Poisson Mixtures

Olivares, Kin G., Meetei, Nganba, Ma, Ruijun, Reddy, Rohan, Cao, Mengfei

Hierarchical forecasting problems arise when time series compose a group structure that naturally defines aggregation and disaggregation coherence constraints for the predictions. In this work, we explore a new forecast representation, the Poisson Mixture Mesh (PMM), that can produce probabilistic, coherent predictions; it is compatible with the neural forecasting innovations, and defines simple aggregation and disaggregation rules capable of accommodating hierarchical structures, unknown during its optimization. We performed an empirical evaluation to compare the PMM \ to other hierarchical forecasting methods on Australian domestic tourism data, where we obtain a 20 percent relative improvement.

dataset, prediction, time sery, (13 more...)

2110.13179

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Oceania > Australia > New South Wales (0.04)
(9 more...)

Genre: Research Report (1.00)

Industry:

Energy (1.00)
Consumer Products & Services > Travel (0.56)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)