Performance Analysis
Coupled Graphs and Tensor Factorization for Recommender Systems and Community Detection
Ioannidis, Vassilis N., Zamzam, Ahmed S., Giannakis, Georgios B., Sidiropoulos, Nicholas D.
Single and coupled matrix-tensor factorization (CMTF) has been widely used in this context for imputation-based recommendation from ratings, social network, and other user-item data. When this side information is in the form of item-item correlation matrices or graphs, existing CMTF algorithms may fall short. Alleviating current limitations, we introduce a novel model coined coupled graph-tensor factorization (CGTF) that judiciously accounts for graph-related side information. The CGTF model has the potential to overcome practical challenges, such as missing slabs from the tensor and/or missing rows/columns from the correlation matrices. A novel alternating direction method of multipliers (ADMM) is also developed that recovers the nonnegative factors of CGTF. Our algorithm enjoys closed-form updates that result in reduced computational complexity and allow for convergence claims. A novel direction is further explored by employing the interpretable factors to detect graph communities having the tensor as side information. The resulting community detection approach is successful even when some links in the graphs are missing. Results with real data sets corroborate the merits of the proposed methods relative to state-of-the-art competing factorization techniques in providing recommendations and detecting communities.
Attention-based Encoder-Decoder Networks for Spelling and Grammatical Error Correction
Automatic spelling and grammatical correction systems are one of the most widely used tools within natural language applications. In this thesis, we assume the task of error correction as a type of monolingual machine translation where the source sentence is potentially erroneous and the target sentence should be the corrected form of the input. Our main focus in this project is building neural network models for the task of error correction. In particular, we investigate sequence-to-sequence and attention-based models which have recently shown a higher performance than the state-of-the-art of many language processing problems. We demonstrate that neural machine translation models can be successfully applied to the task of error correction. While the experiments of this research are performed on an Arabic corpus, our methods in this thesis can be easily applied to any language.
The unreasonable effectiveness of small neural ensembles in high-dimensional brain
Gorban, A. N., Makarov, V. A., Tyukin, I. Y.
Despite the widely-spread consensus on the brain complexity, sprouts of the single neuron revolution emerged in neuroscience in the 1970s. They brought many unexpected discoveries, including grandmother or concept cells and sparse coding of information in the brain. In machine learning for a long time, the famous curse of dimensionality seemed to be an unsolvable problem. Nevertheless, the idea of the blessing of dimensionality becomes gradually more and more popular. Ensembles of non-interacting or weakly interacting simple units prove to be an effective tool for solving essentially multidimensional problems. This approach is especially useful for one-shot (non-iterative) correction of errors in large legacy artificial intelligence systems. These simplicity revolutions in the era of complexity have deep fundamental reasons grounded in geometry of multidimensional data spaces. To explore and understand these reasons we revisit the background ideas of statistical physics. In the course of the 20th century they were developed into the concentration of measure theory. New stochastic separation theorems reveal the fine structure of the data clouds. We review and analyse biological, physical, and mathematical problems at the core of the fundamental question: how can high-dimensional brain organise reliable and fast learning in high-dimensional world of data by simple tools? Two critical applications are reviewed to exemplify the approach: one-shot correction of errors in intellectual systems and emergence of static and associative memories in ensembles of single neurons.
Colombia false positive scandal: Families demand 'greater truth'
Bogota - Carmenza Gomez was planning a surprise Christmas dinner in the winter of 2008 to celebrate having her eight children back together under one roof in their home in an impoverished suburb in Bogota, the capital of Colombia. That summer, the family had finally been reunited after years apart due to the sons' military service. It was months away, but Carmenza wanted to throw an elaborate dinner to share their first Christmas together in years. But just days after the last of her sons arrived home, 23-year-old Victor Fernando, her third youngest, disappeared. "I didn't tell any of them what I was planning [for Christmas]," Carmenza recalled nearly a decade later.
Distances for WiFi Based Topological Indoor Mapping
Schäfermeier, Bastian, Hanika, Tom, Stumme, Gerd
For localization and mapping of indoor environments through WiFi signals, locations are often represented as likelihoods of the received signal strength indicator. In this work we compare various measures of distance between such likelihoods in combination with different methods for estimation and representation. In particular, we show that among the considered distance measures the Earth Mover's Distance seems the most beneficial for the localization task. Combined with kernel density estimation we were able to retain the topological structure of rooms in a real-world office scenario.
Inferring short-term volatility indicators from Bitcoin blockchain
Antulov-Fantulin, Nino, Tolic, Dijana, Piskorec, Matija, Ce, Zhang, Vodenska, Irena
Blockchain as a new technology has a potential to change the traditional way of communication, contracting, and financial management. The first and still most popular use of blockchain technology is its use as a digital currency, or cryptocurrency, as a part of the the Bitcoin protocol [1]. There the payments are processed by a peer-to-peer Bitcoin network where users announce new transactions and which are verified by network nodes and recorded in a blockchain - a public distributed ledger. Beyond its usage in cryptocurrencies, blockchain technology's essential importance is to offer a new way to record and store confidential information.
Runtime Monitoring Neuron Activation Patterns
Cheng, Chih-Hong, Nührenberg, Georg, Yasuoka, Hirotoshi
For using neural networks in safety critical domains, it is important to know if a decision made by a neural network is supported by prior similarities in training. We propose runtime neuron activation pattern monitoring - after the standard training process, one creates a monitor by feeding the training data to the network again in order to store the neuron activation patterns in abstract form. In operation, a classification decision over an input is further supplemented by examining if a pattern similar (measured by Hamming distance) to the generated pattern is contained in the monitor. If the monitor does not contain any pattern similar to the generated pattern, it raises a warning that the decision is not based on the training data. Our experiments show that, by adjusting the similarity-threshold for activation patterns, the monitors can report a significant portion of misclassfications to be not supported by training with a small false-positive rate, when evaluated on a test set.
Efficient Formal Safety Analysis of Neural Networks
Wang, Shiqi, Pei, Kexin, Whitehouse, Justin, Yang, Junfeng, Jana, Suman
Neural networks are increasingly deployed in real-world safety-critical domains such as autonomous driving, aircraft collision avoidance, and malware detection. However, these networks have been shown to often mispredict on inputs with minor adversarial or even accidental perturbations. Consequences of such errors can be disastrous and even potentially fatal as shown by the recent Tesla autopilot crash. Thus, there is an urgent need for formal analysis systems that can rigorously check neural networks for violations of different safety properties such as robustness against adversarial perturbations within a certain $L$-norm of a given image. An effective safety analysis system for a neural network must be able to either ensure that a safety property is satisfied by the network or find a counterexample, i.e., an input for which the network will violate the property. Unfortunately, most existing techniques for performing such analysis struggle to scale beyond very small networks and the ones that can scale to larger networks suffer from high false positives and cannot produce concrete counterexamples in case of a property violation. In this paper, we present a new efficient approach for rigorously checking different safety properties of neural networks that significantly outperforms existing approaches by multiple orders of magnitude. Our approach can check different safety properties and find concrete counterexamples for networks that are 10$\times$ larger than the ones supported by existing analysis techniques. We believe that our approach to estimating tight output bounds of a network for a given input range can also help improve the explainability of neural networks and guide the training process of more robust neural networks.
Towards Accountable AI: Hybrid Human-Machine Analyses for Characterizing System Failure
Nushi, Besmira, Kamar, Ece, Horvitz, Eric
As machine learning systems move from computer-science laboratories into the open world, their accountability becomes a high priority problem. Accountability requires deep understanding of system behavior and its failures. Current evaluation methods such as single-score error metrics and confusion matrices provide aggregate views of system performance that hide important shortcomings. Understanding details about failures is important for identifying pathways for refinement, communicating the reliability of systems in different settings, and for specifying appropriate human oversight and engagement. Characterization of failures and shortcomings is particularly complex for systems composed of multiple machine learned components. For such systems, existing evaluation methods have limited expressiveness in describing and explaining the relationship among input content, the internal states of system components, and final output quality. We present Pandora, a set of hybrid human-machine methods and tools for describing and explaining system failures. Pandora leverages both human and system-generated observations to summarize conditions of system malfunction with respect to the input content and system architecture. We share results of a case study with a machine learning pipeline for image captioning that show how detailed performance views can be beneficial for analysis and debugging.
Wine Ratings Prediction using Machine Learning – Towards Data Science
There hasn't been a day I hadn't heard "Machine Learning", "Deep Learning" or "AI" from a colleague, hacker news etc…the hype is super strong nowadays! After reading through a few books, articles, tutorials about ML, I wanted to graduate from this theory beginner level. I needed to experiment on a real life example. It always worked better when the subject is something that passionates me. So for this practice, I picked wine ().