Goto

Collaborating Authors

Tennis


AI player creates strikingly realistic virtual tennis matches based on real players

#artificialintelligence

A team of researchers at Stanford University has created an artificial intelligence-based player called the Vid2Player that is capable of generating startlingly realistic tennis matches--featuring real professional players. They have written a paper describing their work and have uploaded it to the arXiv preprint server. They have also uploaded a YouTube video demonstrating their player. Video game companies have put a lot of time and effort into making their games look realistic, but thus far, have found it tough going when depicting human beings. In this new effort, the researchers have taken a different approach to the task--instead of trying to create human-looking characters from scratch, they use sprites, which are characters based on video of real people.


Inside the 'brain' of IBM Watson: how 'cognitive computing' is poised to change your life

#artificialintelligence

During the British summer, conversations about sport become almost ubiquitous. This year, however, one participant in those conversations was very different: IBM Watson, IBM's cognitive intelligence. The All England Lawn Tennis Club knew that 2016 would feature unusually fierce competition for attention, with the Tour de France and Euro 2016 taking place alongside Wimbledon. More than ever before, social media was going to be a vital tool in directing that conversation, and directing attention to SW19. Wimbledon's "Cognitive Command Centre" – powered by Watson's intelligence running on a hybrid, IBM-managed cloud - scanned social media for emerging news and trends.


Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

arXiv.org Machine Learning

Describing visual content in a natural-language utterance is an emerging interdisciplinary problem, which lies at the intersection of computer vision (CV) and natural language processing (NLP) ((1)). As a sentence-level short image caption ((2, 3, 4)) has a limited descriptive capacity, (5) introduce a paragraphlevel caption method that aims to generate a detailed and coherent paragraph for describing an image in a finer manner. Recent advances in image paragraph generation focus on building different types of hierarchical recurrent neural network (HRNN), e.g., LSTM ((6)), to generate the visual paragraphs. For HRNN, the high-level RNN recursively produces a sequence of sentence-level topic vectors given the image features as the input, while the low-level RNN is subsequently adopted to decode each topic vector into an output sentence. By modeling each sentence and coupling the sentences into one paragraph, these hierarchical architectures often outperform the flat models ((5)). To improve the performance and generate more diverse paragraphs, advanced methods, extending the HRNN based on generative adversarial network (GAN) ((7)) or variational auto-encoders (VAE) ((8)), are proposed by (9) and (10).


Game Plan: What AI can do for Football, and What Football can do for AI

Journal of Artificial Intelligence Research

The rapid progress in artificial intelligence (AI) and machine learning has opened unprecedented analytics possibilities in various team and individual sports, including baseball, basketball, and tennis. More recently, AI techniques have been applied to football, due to a huge increase in data collection by professional teams, increased computational power, and advances in machine learning, with the goal of better addressing new scientific challenges involved in the analysis of both individual players’ and coordinated teams’ behaviors. The research challenges associated with predictive and prescriptive football analytics require new developments and progress at the intersection of statistical learning, game theory, and computer vision. In this paper, we provide an overarching perspective highlighting how the combination of these fields, in particular, forms a unique microcosm for AI research, while offering mutual benefits for professional teams, spectators, and broadcasters in the years to come. We illustrate that this duality makes football analytics a game changer of tremendous value, in terms of not only changing the game of football itself, but also in terms of what this domain can mean for the field of AI. We review the state-of-the-art and exemplify the types of analysis enabled by combining the aforementioned fields, including illustrative examples of counterfactual analysis using predictive models, and the combination of game-theoretic analysis of penalty kicks with statistical learning of player attributes. We conclude by highlighting envisioned downstream impacts, including possibilities for extensions to other sports (real and virtual).


Chat about anything with Human-Like Open-Domain Chatbot

#artificialintelligence

Most of today's chatbots are highly specific in their conversations (according to their domain of usage) and users can't afford to drift away from their expected use. They are not good with retaining context from past conversations, sometimes give meaningless, illogical responses and quite easily give the response, "I don't know". Open-domain chatbots are conversational agents that can chat about anything and have basic knowledge about the real world. In the research paper "Towards a Human-like Open-Domain Chatbot", Google introduced Meena. Meena is claimed to be the smartest chatbot, highly sensible and specific in its responses, unlike other chatbots.


Bias in facial recognition isn't hard to discover, but it's hard to get rid of

#artificialintelligence

Joy Buolamwini is a researcher at the MIT Media Lab who pioneered research into bias that's built into artificial intelligence and facial recognition. And the way she came to this work is almost a little too on the nose. As a graduate student at MIT, she created a mirror that would project aspirational images onto her face, like a lion or tennis star Serena Williams. But the facial-recognition software she installed wouldn't work on her Black face, until she literally put on a white mask. Buolamwini is featured in a documentary called "Coded Bias," airing tonight on PBS.


Council Post: We Need To Talk About An Energy Label For AI

#artificialintelligence

Artificial intelligence (AI) can distinguish a dog from a cat, but the billions of calculations needed to do so demand quite a lot of energy. The human brain can do the same thing while using only a small fraction of this energy. Could this phenomenon inspire us to develop more energy-efficient AI systems? Our computational power has risen exponentially, enabling the widespread use of artificial intelligence, a technology that relies on processing huge amounts of data to recognize patterns. When we use the recommendation algorithm of our favorite streaming service, we usually don't realize the gigantic energy consumption behind it.


Deep Dynamic Neural Network to trade-off between Accuracy and Diversity in a News Recommender System

arXiv.org Artificial Intelligence

The news recommender systems are marked by a few unique challenges specific to the news domain. These challenges emerge from rapidly evolving readers' interests over dynamically generated news items that continuously change over time. News reading is also driven by a blend of a reader's long-term and short-term interests. In addition, diversity is required in a news recommender system, not only to keep the reader engaged in the reading process but to get them exposed to different views and opinions. In this paper, we propose a deep neural network that jointly learns informative news and readers' interests into a unified framework. We learn the news representation (features) from the headlines, snippets (body) and taxonomy (category, subcategory) of news. We learn a reader's long-term interests from the reader's click history, short-term interests from the recent clicks via LSTMSs and the diversified reader's interests through the attention mechanism. We also apply different levels of attention to our model. We conduct extensive experiments on two news datasets to demonstrate the effectiveness of our approach.


VisualGPT: Data-efficient Image Captioning by Balancing Visual Input and Linguistic Knowledge from Pretraining

arXiv.org Artificial Intelligence

In this paper, we aim to improve the data efficiency of image captioning. We propose VisualGPT, a data-efficient image captioning model that leverages the linguistic knowledge from a large pretrained language model (LM). A crucial challenge is to balance between the use of visual information in the image and prior linguistic knowledge acquired from pretraining.We designed a novel self-resurrecting encoder-decoder attention mechanism to quickly adapt the pretrained LM as the language decoder on a small amount of in-domain training data. The pro-posed self-resurrecting activation unit produces sparse activations but is not susceptible to zero gradients. When trained on 0.1%, 0.5% and 1% of MSCOCO and Conceptual Captions, the proposed model, VisualGPT, surpasses strong image captioning baselines. VisualGPT outperforms the best baseline model by up to 10.8% CIDEr on MS COCO and up to 5.4% CIDEr on Conceptual Captions.We also perform a series of ablation studies to quantify the utility of each system component. To the best of our knowledge, this is the first work that improves data efficiency of image captioning by utilizing LM pretrained on unimodal data. Our code is available at: https://github.com/Vision-CAIR/VisualGPT.


Council Post: We Need To Talk About An Energy Label For AI

#artificialintelligence

Artificial intelligence (AI) can distinguish a dog from a cat, but the billions of calculations needed to do so demand quite a lot of energy. The human brain can do the same thing while using only a small fraction of this energy. Could this phenomenon inspire us to develop more energy-efficient AI systems? Our computational power has risen exponentially, enabling the widespread use of artificial intelligence, a technology that relies on processing huge amounts of data to recognize patterns. When we use the recommendation algorithm of our favorite streaming service, we usually don't realize the gigantic energy consumption behind it.