South America
A discriminative condition-aware backend for speaker verification
Ferrer, Luciana, McLaren, Mitchell
We present a scoring approach for speaker verification that mimics the standard PLDA-based backend process used in most current speaker verification systems. However, unlike the standard backends, all parameters of the model are jointly trained to optimize the binary cross-entropy for the speaker verification task. We further integrate the calibration stage inside the model, making the parameters of this stage depend on metadata vectors that represent the conditions of the signals. We show that the proposed backend has excellent out-of-the-box calibration performance on most of our test sets, making it an ideal approach for cases in which the test conditions are not known and development data is not available for training a domain-specific calibration model.
Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions
Nasir, Osaid Rehman, Jha, Shailesh Kumar, Grover, Manraj Singh, Yu, Yi, Kumar, Ajit, Shah, Rajiv Ratn
--Powerful generative adversarial networks (GAN) have been developed to automatically synthesize realistic images from text. However, most existing tasks are limited to generating simple images such as flowers from captions. In this work, we extend this problem to the less addressed domain of face generation from fine-grained textual descriptions of face, e.g., "A person has curly hair, oval face, and mustache" . We are motivated by the potential of automated face generation to impact and assist critical tasks such as criminal face reconstruction. Since current datasets for the task are either very small or do not contain captions, we generate captions for images in the CelebA dataset by creating an algorithm to automatically convert a list of attributes to a set of captions. We then model the highly multi-modal problem of text to face generation as learning the conditional distribution of faces (conditioned on text) in same latent space. We utilize the current state-of-the-art GAN (DC-GAN with GAN-CLS loss) for learning conditional multi-modality. The presence of more fine-grained details and variable length of the captions makes the problem easier for a user but more difficult to handle compared to the other text-to-image tasks. We flipped the labels for real and fake images and added noise in discriminator . Generated images for diverse textual descriptions show promising results. In the end, we show how the widely used inceptions score is not a good metric to evaluate the performance of generative models used for synthesizing faces from text. I NTRODUCTION Photographic text-to-face synthesis is a mainstream problem with potential applications in image editing, video games, or for accessibility.
Let's train humans first...before we train machines P2P Foundation
In reality, there is nothing artificial about these algorithms or their intelligence, and the term "AI" is a mystification! The term that describes the reality is "Human-Trained Machine Learning", in today's mad scramble to train these algorithms to mimic human intelligence and brain functioning. In the techie magazine WIRED, October 2018, we meet a pioneering computer scientist, Fei-Fei LI, testifying at a Congressional hearing, who underlines this truth. She said, "Humans train these algorithms" and she talked about the horrendous mistakes these machines make in mis-identifying people, using the term "bias in--bias out" updating the old computer saying, "garbage in--garbage out". Professor LI described how we are ceding our authority to these algorithms to judge who gets hired, who goes to jail, who gets a loan, a mortgage or good insurance rates -- and how these machines code our behavior, change our rules and our lives.
Facebook built a facial recognition app that could 'identify any member of the social network'
Facebook is under fire for privacy concerns once again, as the social media giant tested a facial recognition app on its employees. Using real-time facial recognition, the firm was able to identify a person by pointing a smartphone camera at them. It was reported that the app has been discontinued, but the technology was capable of bringing up someone's Facebook profile who had enabled facial recognition on their profiles. Facebook did confirm that it developed the app, but denied it was capable of identifying members of its social media network and pulling up their profile. Facebook is under fire for privacy concerns once again, as the social media giant revealed it tested a facial recognition app on its employees.
Best Artificial Intelligence Logistics Startups -- Transmetrics Blog
This article about the best artificial intelligence logistics startups is part of the "Logistics of the Future" series looking at the top logistics startups today. We are officially living in the age of Artificial Intelligence. It's everywhere we look, from AI-powered personal assistants to predictive analytics to making medical diagnoses, Artificial Intelligence is making incredible advances across all industries. In fact, a recent report on the state of Artificial Intelligence for enterprises found that supply chain and operations are some of the top areas where businesses are driving revenue from AI investment. Why is AI making such a big difference in the logistics and supply chain, particularly?
How Artificial Intelligence (AI) is Transforming Mobile Technology? - Media Releases - CSO
Marketresearch.biz points out that the competitive landscape in the global Mobile Artificial Intelligence market is fairly consolidated. "If you are involved in the Mobile Artificial Intelligence industry or intend to be, then this study will provide you a comprehensive outlook. It's vital information to keep your market knowledge up to date." Mobile Artificial Intelligence Market 2019 report gives key quantification available status of the Mobile Artificial Intelligence Manufacturers and is a consequential wellspring of direction and bearing for organizations and people inspired by the Mobile Artificial Intelligence Industry. In the Mobile Artificial Intelligence Market report, there is an area for rivalry scenes of the ecumenical Mobile Artificial Intelligence Industry.
Global AI Survey: AI proves its worth, but few scale impact
Adoption of artificial intelligence (AI) continues to increase, and the technology is generating returns. 1 1. We define artificial intelligence (AI) as the ability of a machine to perform cognitive functions that we associate with human minds (such as perceiving, reasoning, learning, and problem solving) and to perform physical tasks using cognitive functions (for example, physical robotics, autonomous driving, and manufacturing work). The findings of the latest McKinsey Global Survey on the subject show a nearly 25 percent year-over-year increase in the use of AI 2 2. We define AI use in standard business processes as embedded AI in at least one product or business process for at least one function or business unit. The online survey was in the field from March 26 to April 5, 2019, and garnered responses from 2,360 participants representing the full range of regions, industries, company sizes, functional specialties, and tenures. Of these respondents, 1,872 work at companies they say have piloted AI in at least one function or business unit, embedded at least one AI capability in at least one product or business process for at least one function or business unit, or embedded at least one AI capability in products or business processes across multiple functions or business units.
Efficient Global String Kernel with Random Features: Beyond Counting Substructures
Wu, Lingfei, Yen, Ian En-Hsu, Huo, Siyu, Zhao, Liang, Xu, Kun, Ma, Liang, Ji, Shouling, Aggarwal, Charu
Analysis of large-scale sequential data has been one of the most crucial tasks in areas such as bioinformatics, text, and audio mining. Existing string kernels, however, either (i) rely on local features of short substructures in the string, which hardly capture long discriminative patterns, (ii) sum over too many substructures, such as all possible subsequences, which leads to diagonal dominance of the kernel matrix, or (iii) rely on non-positive-definite similarity measures derived from the edit distance. Furthermore, while there have been works addressing the computational challenge with respect to the length of string, most of them still experience quadratic complexity in terms of the number of training samples when used in a kernel-based classifier. In this paper, we present a new class of global string kernels that aims to (i) discover global properties hidden in the strings through global alignments, (ii) maintain positive-definiteness of the kernel, without introducing a diagonal dominant kernel matrix, and (iii) have a training cost linear with respect to not only the length of the string but also the number of training string samples. To this end, the proposed kernels are explicitly defined through a series of different random feature maps, each corresponding to a distribution of random strings. We show that kernels defined this way are always positive-definite, and exhibit computational benefits as they always produce \emph{Random String Embeddings (RSE)} that can be directly used in any linear classification models. Our extensive experiments on nine benchmark datasets corroborate that RSE achieves better or comparable accuracy in comparison to state-of-the-art baselines, especially with the strings of longer lengths. In addition, we empirically show that RSE scales linearly with the increase of the number and the length of string.
Interpretable Charge Prediction for Criminal Cases with Dynamic Rationale Attention
Chao, Wenhan (State Key Laboratory of Software Development Environment, Beijing, China, School of Computer Science and Engineering, Beihang University, Beijing, China) | Jiang, Xin (School of Computer Science and Engeering, Beihang University, Beijing, China) | Luo, Zhunchen (Information Research Center of Military Science, PLA Academy of Military Science, Beijing, China) | Hu, Yakun (School of Computer Science and Engineering, Beihang University, Beijing, China) | Ma, Wenjia (School of Computer Science and Engineering, Beihang University, Beijing, China)
Charge prediction which aims to determine appropriate charges for criminal cases based on textual fact descriptions, is an important technology in the field of AI&Law. Previous works focus on improving prediction accuracy, ignoring the interpretability, which limits the methods' applicability. In this work, we propose a deep neural framework to extract short but charge-decisive text snippets - rationales - from input fact description, as the interpretation of charge prediction. To solve the scarcity problem of rationale annotated corpus, rationales are extracted in a reinforcement style with the only supervision in the form of charge labels. We further propose a dynamic rationale attention mechanism to better utilize the information in extracted rationales and predict the charges. Experimental results show that besides providing charge prediction interpretation, our approach can also capture subtle details to help charge prediction.