Information Retrieval
A Game Theoretic Analysis of the Adversarial Retrieval Setting
Ben Basat, Ran, Tennenholtz, Moshe, Kurland, Oren
The main goal of search engines is ad hoc retrieval: ranking documents in a corpus by their relevance to the information need expressed by a query. The Probability Ranking Principle (PRP) --- ranking the documents by their relevance probabilities --- is the theoretical foundation of most existing ad hoc document retrieval methods. A key observation that motivates our work is that the PRP does not account for potential post-ranking effects; specifically, changes to documents that result from a given ranking. Yet, in adversarial retrieval settings such as the Web, authors may consistently try to promote their documents in rankings by changing them. We prove that, indeed, the PRP can be sub-optimal in adversarial retrieval settings. We do so by presenting a novel game theoretic analysis of the adversarial setting. The analysis is performed for different types of documents (single-topic and multi-topic) and is based on different assumptions about the writing qualities of documents' authors. We show that in some cases, introducing randomization into the document ranking function yields an overall user utility that transcends that of applying the PRP.
Solving Google Cache Issue Multilingual Search Engine Optimization
After Google Penguin 6 ( aka Penguin 3.0) update, some webmasters asked me why Google does not cache their updated pages. Before talking about Google Cache issue let's have an overview on the differences between Google index and Google Cache. An "indexed" webpage is a site that has been crawled by a search engine spider and filed away in their index for later use, and a "cached" page is one that may show up in search results. Submitting pages listed in the SERPs is a multi-step task and process. First, the crawlers need to access your website and scan the page.
Search engine optimization tips and tricks for 2018
Search engine optimization tips and tricks for 2018: SEO stands for SEARCH ENGINE OPTIMIZATION. It is the technique to appear your website on top of Search Engine's results. It is the part of the DIGITAL MARKETING. In another word you can say it is the process to drive the huge traffic to your website. Traffic should be organic, editorial, natural.
A Gentle Introduction to Applied Machine Learning as a Search Problem - Machine Learning Mastery
Applied machine learning is challenging because the designing of a perfect learning system for a given problem is intractable. There is no best training data or best algorithm for your problem, only the best that you can discover. The application of machine learning is best thought of as search problem for the best mapping of inputs to outputs given the knowledge and resources available to you for a given project. In this post, you will discover the conceptualization of applied machine learning as a search problem. A Gentle Introduction to Applied Machine Learning as a Search Problem Photo by tonko43, some rights reserved.
Ella brings smart searching to home security cameras
Her name is Ella and she promises to bring smart searching to your home security cameras. Ella is an AI-powered search engine, developed by IC Realtime, that augments residential and commercial surveillance systems with natural language search capabilities. The average continuously recording security camera captures less than two minutes of noteworthy footage in a 24-hour period, IC Realtime CEO Matt Sailor said during an embargoed briefing last week. Scrubbing through a day's worth of video just to retrieve that data is a tedious time-suck. Even with the time- and date-sorting parameters typically offered by most consumer cameras, reviewing video can be arduous.
Multilingual Topic Models
Krstovski, Kriste, Kurtz, Michael J., Smith, David A., Accomazzi, Alberto
Scientific publications have evolved several features for mitigating vocabulary mismatch when indexing, retrieving, and computing similarity between articles. These mitigation strategies range from simply focusing on high-value article sections, such as titles and abstracts, to assigning keywords, often from controlled vocabularies, either manually or through automatic annotation. Various document representation schemes possess different cost-benefit tradeoffs. In this paper, we propose to model different representations of the same article as translations of each other, all generated from a common latent representation in a multilingual topic model. We start with a methodological overview on latent variable models for parallel document representations that could be used across many information science tasks. We then show how solving the inference problem of mapping diverse representations into a shared topic space allows us to evaluate representations based on how topically similar they are to the original article. In addition, our proposed approach provides means to discover where different concept vocabularies require improvement.
microsoft-bing-reddit-search-engine-partnership-ama-subreddit-content
Microsoft today announced a partnership with Reddit to surface the social news site's content high up in Bing search results, so that searching for specific Reddit communities, pages, and info will spit back information gleaned from subreddits, AMAs, and other threads in a dedicated section. The news, part of a suite of new artificial intelligence-powered features underlying Bing's search function, works by way of what Microsoft calls intelligent search. The company says intelligent search uses natural language processing to pair Reddit content with the appropriate search terms. The Reddit integration works in three ways. First, by typing in the name of a subreddit on Bing, you'll get a live snapchat of that subreddit's top threads displayed in the search results with links to the individual pages.
Microsoft is making subreddits more searchable with Bing
Microsoft today announced a partnership with Reddit to surface the social news site's content high up in Bing search results, so that searching for specific Reddit communities, pages, and info will spit back information gleaned from subreddits, AMAs, and other threads in a dedicated section. The news, part of a suite of new artificial intelligence-powered features underlying Bing's search function, works by way of what Microsoft calls intelligent search. The company says intelligent search uses natural language processing to pair Reddit content with the appropriate search terms. The Reddit integration works in three ways. First, by typing in the name of a subreddit on Bing, you'll get a live snapchat of that subreddit's top threads displayed in the search results with links to the individual pages. Bing will also include Reddit AMA questions and answers for a celebrity who's done an AMA, with the content showing up on the person's search card, which will be accessible just by searching that person's name.
Patterns in Fruit Fly Brains Could Soon Power Your Netflix Recommendations
Researchers have identified an incredibly smart method used by fruit flies to categorise odours โ and it's so clever it could be applied to powering recommendation algorithms for the likes of Netflix or Spotify. In the same way that YouTube might want to flag up videos similar to the one you've just watched, fruit flies โ like many other animals โ need to know which smells are similar, for finding food and avoiding poisonous substances. The team from the University of California San Diego (UCSD) and the Salk Institute for Biological Studies in California has found that fruit flies have an especially clever way of categorising odours which lets them recognise differences with a very fine level of accuracy. "In the natural world, you're not going to encounter exactly the same odour every time; there's going to be some noise and fluctuation," says one of the researchers, Saket Navlakha from Salk. "But if you smell something that you've previously associated with a behaviour, you need to be able to identify that similarity and recall that behaviour."
Microsoft announces new AI-powered search features for Bing
Today, Microsoft announced a series of artificial intelligence-driven features for its Bing search engine to make it more conversational and nuanced. The news, unveiled at an event in San Francisco, means that Bing will make better use of object recognition, so-called machine reading (for parsing text and extracting meaning), and other techniques tuned and improved using AI training methods. Search results will now show both multiple perspectives and multiple sources, culled from a list of pre-approved news sources, to show Bing users different sides of issues ranging from the benefits and downsides of kale to the pros and cons of contentious political issues. This builds on an earlier feature, announced back in September, in which Bing added fact checks to search results in an effort to cut down on misinformation, fake news, and other distorted stories from manipulative information sources. In a new partnership with social news site Reddit, Bing will also surface information from subreddits right in search results by using algorithms to read and analyze the user-generated text across Reddit's many communities. The integration includes AMA questions and answers populated within the search card for popular celebrities, AskReddit-sourced answers to broad service questions, and top threads for specific subreddits that will show up in search results just by searching the name of the community.