Goto

Collaborating Authors

 South America


Sampling random graph homomorphisms and applications to network data analysis

arXiv.org Machine Learning

A graph homomorphism is a map between two graphs that preserves adjacency relations. We consider the problem of sampling a random graph homomorphism from a graph $F$ into a large network $\mathcal{G}$. When $\mathcal{G}$ is the complete graph with $q$ nodes, this becomes the well-known problem of sampling uniform $q$-colorings of $F$. We propose two complementary MCMC algorithms for sampling a random graph homomorphisms and establish bounds on their mixing times and concentration of their time averages. Based on our sampling algorithms, we propose a novel framework for network data analysis that circumvents some of the drawbacks in methods based on independent and neigborhood sampling. Various time averages of the MCMC trajectory give us real-, function-, and network-valued computable observables, including well-known ones such as homomorphism density and average clustering coefficient. One of the main observable we propose is called the conditional homomorphism density profile, which reveals hierarchical structure of the network. Furthermore, we show that these network observables are stable with respect to a suitably renormalized cut distance between networks. We also provide various examples and simulations demonstrating our framework through synthetic and real-world networks. For instance, we apply our framework to analyze Word Adjacency Networks of a 45 novels data set and propose an authorship attribution scheme using motif sampling and conditional homomorphism density profiles.


The SWAX Benchmark: Attacking Biometric Systems with Wax Figures

arXiv.org Artificial Intelligence

A face spoofing attack occurs when an intruder attempts to impersonate someone who carries a gainful authentication clearance. It is a trending topic due to the increasing demand for biometric authentication on mobile devices, high-security areas, among others. This work introduces a new database named Sense Wax Attack dataset (SWAX), comprised of real human and wax figure images and videos that endorse the problem of face spoofing detection. The dataset consists of more than 1800 face images and 110 videos of 55 people/waxworks, arranged in training, validation and test sets with a large range in expression, illumination and pose variations. Experiments performed with baseline methods show that despite the progress in recent years, advanced spoofing methods are still vulnerable to high-quality violation attempts.


Global Artificial Intelligence (AI) in Automotive Market – Global Industry Analysis and Forecast (2017-2026) - Markets Gazette

#artificialintelligence

Global Artificial Intelligence (AI) in Automotive Market has valued 566.80 Mn in 2016 and is estimated to reach US$ 10,600.3 Global Artificial Intelligence (AI) in Automotive Market is segmented by technology, offering, process, application, and geography. By technology, Global Artificial Intelligence (AI) in the automotive market is divided into Computer Vision, Machine Learning, Context Awareness, natural language processing. Based on the offering, Artificial Intelligence (AI) in Automotive Market is categorized hardware and software. By process, the market is fragmented into Data Mining, Signal Recognition, and Image Recognition.


Retool AI to forecast and limit wars

#artificialintelligence

Armed violence is on the rise and we don't know how to stop it1. Since 2011, conflicts worldwide have killed up to 100,000 people a year, three-quarters of whom were in Afghanistan, Iraq and Syria. The rate of major wars has decreased over the past few decades. But the number of civil conflicts has doubled since the 1960s, and terrorist attacks have become more frequent in the past ten years. The nature of conflict is changing.


A Deepfake Deep Dive into the Murky World of Digital Imitation

#artificialintelligence

About a year ago, top deepfake artist Hao Li came to a disturbing realization: Deepfakes, i.e. the technique of human-image synthesis based on artificial intelligence (AI) to create fake content, is rapidly evolving. In fact, Li believes that in as soon as six months, deepfake videos will be completely undetectable. And that's spurring security and privacy concerns as the AI behind the technology becomes commercialized – and gets in the hands of malicious actors. Li, for his part, has seen the positives of the technology as a pioneering computer graphics and vision researcher, particularly for entertainment. He has worked his magic on various high-profile deepfake applications – from leading the charge in putting Paul Walker into Furious 7 after the actor died before the film finished production, to creating the facial-animation technology that Apple now uses in its Animoji feature in the iPhone X. But now, "I believe it will soon be a point where it isn't possible to detect if videos are fake or not," Li told Threatpost.


Artificial Intelligence in Education Market Projected to Garner Significant Revenues by 2017 - 2025 - StatsFlash

#artificialintelligence

The global artificial intelligence and education Market is significantly driven by the integration of intelligent algorithms as well as Advanced Technologies in to e-learning platforms. Education software, machine learning, and artificial intelligence are some of the Innovative learning models and Technologies change the rules and creating tremendous shift from the teaching methods. These technologies have completely transformed with a classroom. The sophistication level has increased tremendously with the increasing adoption of artificial intelligence and machine learning algorithms. These Technologies are becoming extremely useful for developing user-friendly decision support systems and used in knowledge acquisition applications, language translation, and information retrieval.


Beyond Word Embedding: Key Ideas in Document Embedding - KDnuggets

#artificialintelligence

Word embedding -- the mapping of words into numerical vector spaces -- has proved to be an incredibly important method for natural language processing (NLP) tasks in recent years, enabling various machine learning models that rely on vector representation as input to enjoy richer representations of text input. These representations preserve more semantic and syntactic information on words, leading to improved performance in almost every imaginable NLP task. Both the novel idea itself and its tremendous impact have led researchers to consider the problem of how to provide this boon of richer vector representations to larger units of texts -- from sentences to books. This effort has resulted in a slew of new methods to produce these mappings, with various innovative solutions to the problem and some notable breakthroughs. This post is meant to present the different ways practitioners have come up with to produce document embeddings. Note: I use the word document here to refer to any sequence of words, ranging from sentences and paragraphs through social media posts all way up to articles, books and more complexly structured text documents (e.g. In this post, I will touch upon not only approaches that are direct extensions of word embedding techniques (e.g., in the way doc2vec extends word2vec), but also other notable techniques that produce -- sometimes among other outputs -- a mapping of documents to vectors in ℝⁿ. I will also try to provide links and references to both the original papers and code implementations of the reviewed methods whenever possible. Note: This topic is somewhat related, but not equivalent, to the problem of learning structured text representations (e.g., Liu & Lapata, 2018). The ability to map documents to informative vector representations has a wide range of applications.


Attention Enriched Deep Learning Model for Breast Tumor Segmentation in Ultrasound Images

arXiv.org Machine Learning

Incorporating human expertise and domain knowledge is particularly important for medical image processing applications, marked with small datasets, and objects of interests in the form of organs or lesions not typically seen in traditional datasets. However, the incorporation of prior knowledge for breast tumor detection is challenging, since shape, boundary, curvature, intensity, or other common medical priors vary significantly across patients and cannot be employed. This work proposes an approach for integrating visual saliency into a deep learning model for breast tumor segmentation in ultrasound images. Visual saliency emphasizes regions that are more likely to attract radiologists' visual attention and stand out from its surrounding. Our approach is based on a U-Net model and employs attention blocks to introduce visual saliency. Such model forces learning feature representations that prioritize spatial regions with high levels of saliency. The approach is validated using a dataset of 510 breast ultrasound images.


Privacy- and Utility-Preserving Textual Analysis via Calibrated Multivariate Perturbations

arXiv.org Machine Learning

Accurately learning from user data while providing quantifiable privacy guarantees provides an opportunity to build better ML models while maintaining user trust. This paper presents a formal approach to carrying out privacy preserving text perturbation using the notion of dx-privacy designed to achieve geo-indistinguishability in location data. Our approach applies carefully calibrated noise to vector representation of words in a high dimension space as defined by word embedding models. We present a privacy proof that satisfies dx-privacy where the privacy parameter epsilon provides guarantees with respect to a distance metric defined by the word embedding space. We demonstrate how epsilon can be selected by analyzing plausible deniability statistics backed up by large scale analysis on GloVe and fastText embeddings. We conduct privacy audit experiments against 2 baseline models and utility experiments on 3 datasets to demonstrate the tradeoff between privacy and utility for varying values of epsilon on different task types. Our results demonstrate practical utility (< 2% utility loss for training binary classifiers) while providing better privacy guarantees than baseline models.


Global Artificial Intelligence Robots Market Business Planning Research and Resources, Supply and Revenue By 2025 - WeeklySpy

#artificialintelligence

The Artificial Intelligence Robots Market report is a complete overview of the market, covering various aspects product definition, segmentation based on various parameters, and the prevailing vendor landscape. Analysis and discussion of important industry trends, market size, market share estimates are mentioned in the report. Artificial Intelligence Robots Market report includes historic data, present market trends, environment, technological innovation, upcoming technologies and the technical progress in the related industry. The Global Artificial Intelligence Robots Market accounted for USD 3.0 billion in 2017 and is projected to grow at a CAGR of 30.1% forecast to 2025. Some of the major countries covered in this report are U.S., Canada, Germany, France, U.K., Netherlands, Switzerland, Turkey, Russia, China, India, South Korea, Japan, Australia, Singapore, Saudi Arabia, South Africa and Brazil among others.