Performance Analysis
Extrapolating continuous color emotions through deep learning
Ram, Vishaal, Schaposnik, Laura P., Konstantinou, Nikos, Volkan, Eliz, Papadatou-Pastou, Marietta, Manav, Banu, Jonauskaite, Domicele, Mohr, Christine
To carry out our mathematical study, we have used the standard Decimal Code (R,G,B) to represent the 12 colours of [12], a depiction of which is in Figure 1. The relation between colours and human emotion has been studied for more than a century (e.g., see for instance [1-8]). Even longer ago, colours were commonly associated to emotions in a universal manner that allowed populations to understand quickly the given emotions. Figure 1: A depiction of the 12 colors used in [12]. For example, for centuries in many cultures it has been said that someone "had the blues" [29] or "is feeling In the last decades colours have also been studied in blue" when being down or sad. As explained in [9], the terms of emotional reactions to color hue, saturation, and phrase "feeling blue" comes from deepwater sailing ships: brightness (e.g., [14, 15]). Here, we shall put the two If a ship lost the captain or any of the officers during its approaches together to consider a novel path, where we voyage, then blue flags would be shown, and a blue band let the colour association within our neural network take would be painted along the entire hull when returning to a continuum of colours, hence considering a continuous home port. RGB analysis [30], depicted in Figure 2. Inspired by [10, 11] we consider their data base [12] to analize the correlation between colours and emotions via a deep learning approach. Whilst machine learning techniques have been used before in this direction (e.g.
Sketching Datasets for Large-Scale Learning (long version)
Gribonval, Rรฉmi, Chatalic, Antoine, Keriven, Nicolas, Schellekens, Vincent, Jacques, Laurent, Schniter, Philip
This article considers "sketched learning," or "compressive learning," an approach to large-scale machine learning where datasets are massively compressed before learning (e.g., clustering, classification, or regression) is performed. In particular, a "sketch" is first constructed by computing carefully chosen nonlinear random features (e.g., random Fourier features) and averaging them over the whole dataset. Parameters are then learned from the sketch, without access to the original dataset. This article surveys the current state-of-the-art in sketched learning, including the main concepts and algorithms, their connections with established signal-processing methods, existing theoretical guarantees---on both information preservation and privacy preservation, and important open problems.
How Traditional Machine Learning Is Holding Cybersecurity Back
While global cybersecurity spending now surpasses $100 billion annually, 64 percent of enterprises were compromised in 2018, according to a study by the Ponemon Institute. The standard answer is that wily cyber-criminals are employing ever-evolving, increasingly sophisticated attack methods, part of a never-ending game of cat-and-mouse in which they all too often outsmart the good guys. This is undoubtedly true โ but the root of the problem is that traditional machine learning-based cybersecurity solutions fail to keep up with the growing sophistication of today's cyber threats, both those that are created by hackers and AI alike. Why does machine learning so often come up short โ and how should cybersecurity evolve to meet the scale and complexity of the challenge? There's no question that machine learning has driven significant improvements in cybersecurity.
Analyzing the Performance of the Classification Models in Machine Learning
Confusion matrix (also called Error matrix) is used to analyze how well the Classification Models (like Logistic Regression, Decision Tree Classifier, etc.) performs. Why do we analyze the performance of the models? Analyzing the performance of the models helps us to find and eliminate the bias and variance problem if exist and it also helps us to fine-tune the model so that the model produces more accurate results. Confusion Matrix is usually applied to Binary classification problems but can be extended to Multi-class classification problems as well. Concepts are comprehended better when illustrated with examples so let us consider an example.
A Survey on the Use of AI and ML for Fighting the COVID-19 Pandemic
Islam, Muhammad Nazrul, Inan, Toki Tahmid, Rafi, Suzzana, Akter, Syeda Sabrina, Sarker, Iqbal H., Islam, A. K. M. Najmul
Artificial intelligence (AI) and machine learning (ML) have made a paradigm shift in health care which, eventually can be used for decision support and forecasting by exploring the medical data. Recent studies showed that AI and ML can be used to fight against the COVID-19 pandemic. Therefore, the objective of this review study is to summarize the recent AI and ML based studies that have focused to fight against COVID-19 pandemic. From an initial set of 634 articles, a total of 35 articles were finally selected through an extensive inclusion-exclusion process. In our review, we have explored the objectives/aims of the existing studies (i.e., the role of AI/ML in fighting COVID-19 pandemic); context of the study (i.e., study focused to a specific country-context or with a global perspective); type and volume of dataset; methodology, algorithms or techniques adopted in the prediction or diagnosis processes; and mapping the algorithms/techniques with the data type highlighting their prediction/classification accuracy. We particularly focused on the uses of AI/ML in analyzing the pandemic data in order to depict the most recent progress of AI for fighting against COVID-19 and pointed out the potential scope of further research.
Modeling and Prediction of Human Driver Behavior: A Survey
Brown, Kyle, Driggs-Campbell, Katherine, Kochenderfer, Mykel J.
We present a review and taxonomy of 200 models from the literature on driver behavior modeling. We begin by introducing a mathematical formulation based on the partially observable stochastic game, which serves as a common framework for comparing and contrasting different driver models. Our taxonomy is constructed around the core modeling tasks of state estimation, intention estimation, trait estimation, and motion prediction, and also discusses the auxiliary tasks of risk estimation, anomaly detection, behavior imitation and microscopic traffic simulation. Existing driver models are categorized based on the specific tasks they address and key attributes of their approach.
Accuracy and Fairness Trade-offs in Machine Learning: A Stochastic Multi-Objective Approach
Liu, Suyun, Vicente, Luis Nunes
In the application of machine learning to real-life decision-making systems, e.g., credit scoring and criminal justice, the prediction outcomes might discriminate against people with sensitive attributes, leading to unfairness. The commonly used strategy in fair machine learning is to include fairness as a constraint or a penalization term in the minimization of the prediction loss, which ultimately limits the information given to decision-makers. In this paper, we introduce a new approach to handle fairness by formulating a stochastic multi-objective optimization problem for which the corresponding Pareto fronts uniquely and comprehensively define the accuracy-fairness trade-offs. We have then applied a stochastic approximation-type method to efficiently obtain well-spread and accurate Pareto fronts, and by doing so we can handle training data arriving in a streaming way.
Instance Explainable Temporal Network For Multivariate Timeseries
Madiraju, Naveen, Karimabadi, Homa
Although deep networks have been widely adopted, one of their shortcomings has been their blackbox nature. One particularly difficult problem in machine learning is multivariate time series (MVTS) classification. MVTS data arise in many applications and are becoming ever more pervasive due to explosive growth of sensors and IoT devices. Here, we propose a novel network (IETNet) that identifies the important channels in the classification decision for each instance of inference. This feature also enables identification and removal of non-predictive variables which would otherwise lead to overfit and/or inaccurate model. IETNet is an end-to-end network that combines temporal feature extraction, variable selection, and joint variable interaction into a single learning framework. IETNet utilizes an 1D convolutions for temporal features, a novel channel gate layer for variable-class assignment using an attention layer to perform cross channel reasoning and perform classification objective. To gain insight into the learned temporal features and channels, we extract region of interest attention map along both time and channels. The viability of this network is demonstrated through a multivariate time series data from N body simulations and spacecraft sensor data.
F*: An Interpretable Transformation of the F-measure
Hand, David J., Christen, Peter, Kirielle, Nishadi
The F-measure is widely used to assess the performance of classification algorithms. However, some researchers find it lacking in intuitive interpretation, questioning the appropriateness of combining two aspects of performance as conceptually distinct as precision and recall, and also questioning whether the harmonic mean is the best way to combine them. To ease this concern, we describe a simple transformation of the F-measure, which we call F* (F-star), which has an immediate practical interpretation.
New approach to MPI program execution time prediction
Chupakhin, A., Kolosov, A., Smeliansky, R., Antonenko, V., Ishelev, G.
The problem of MPI programs execution time prediction on a certain set of computer installations is considered. This problem emerges with orchestration and provisioning a virtual infrastructure in a cloud computing environment over a heterogeneous network of computer installations: supercomputers or clusters of servers (e.g. mini data centers). One of the key criteria for the effectiveness of the cloud computing environment is the time staying by the program inside the environment. This time consists of the waiting time in the queue and the execution time on the selected physical computer installation, to which the computational resource of the virtual infrastructure is dynamically mapped. One of the components of this problem is the estimation of the MPI programs execution time on a certain set of computer installations. This is necessary to determine a proper choice of order and place for program execution. The article proposes two new approaches to the program execution time prediction problem. The first one is based on computer installations grouping based on the Pearson correlation coefficient. The second one is based on vector representations of computer installations and MPI programs, so-called embeddings. The embedding technique is actively used in recommendation systems, such as for goods (Amazon), for articles (Arxiv.org), for videos (YouTube, Netflix). The article shows how the embeddings technique helps to predict the execution time of a MPI program on a certain set of computer installations.