Goto

Collaborating Authors

Information Retrieval: AI-Alerts


Tackling the US Government's PDF Mountain With Computer Vision

#artificialintelligence

Adobe's PDF format has entrenched itself so deeply in US government document pipelines that the number of state-issued documents currently in existence is conservatively estimated to be in the hundreds of millions. Often opaque and lacking metadata, these PDFs – many created by automated systems – collectively tell no stories or sagas; if you don't know exactly what you're looking for, you'll probably never find a pertinent document. And if you did know, you probably didn't need the search. However a new project is using computer vision and other machine learning approaches to change this almost unapproachable mountain of data into a valuable and explorable resource for researchers, historians, journalists and scholars. When the US government discovered Adobe's Portable Document Format (PDF) in the 1990s, it decided that it liked it.


Alibaba Develops Search Engine Simulation AI That Uses Live Data

#artificialintelligence

In collaboration with academic researchers in China, Alibaba has developed a search engine simulation AI that uses real world data from the ecommerce giant's live infrastructure in order to develop new ranking models that are not hamstrung by'historic' or out-of-date information. The engine, called AESim, represents the second major announcement in a week to acknowledge the need for AI systems to be able to evaluate and incorporate live and current data, instead of just abstracting the data that was available at the time the model was trained. The earlier announcement was from Facebook, which last week unveiled the BlenderBot 2.0 language model, an NLP interface that features live polling of internet search results in response to queries. The objective of the AESim project is to provide an experimental environment for the development of new Learning-To-Rank (LTR) solutions, algorithms and models in commercial information retrieval systems. In testing the framework, the researchers found that it accurately reflected online performance within useful and actionable parameters.


This High Schooler Created a Drug Discovery Search Engine

#artificialintelligence

Between his mom's place in Manhattan, his dad in Queens, and his high school in the Bronx, Noah Getz is on the subway a lot. It gives him time to read and to think. Our first coronavirus summer was waning, and he'd been wrestling with a weighty science problem: using machine learning to hunt down tiny molecules that may help treat Alzheimer's. Thus far, his AI had been spitting out results that were "almost comically bad." The problem was that the algorithms Getz was using did their best when they had massive amounts of data to sift through and discover patterns in. Getz' data set was far smaller; he was working with one lab at Mount Sinai, not a multinational pharmaceutical company with a galaxy-sized drug library.


Extreme Classification

Communications of the ACM

What would you do if you had the super-power to accurately answer, in a few milliseconds, a multiple-choice question with a billion choices? Would you design the next generation of Web search engines, which could predict which of the billions of documents might be relevant to a given query? Would you build the next generation of retail recommender systems that have things delivered to your doorstep just as you need them? Or would you try and predict the next word about to be uttered by U.S. President Donald Trump? The objective in extreme classification, a new research area in machine learning, is to develop algorithms with such capabilities.


China Appears To Block Microsoft's Bing Search Engine

NPR Technology

This is a visualization of global internet attacks, seen during the 4th China Internet Security Conference in Beijing. Microsoft's Bing search engine is no longer accessible in China, the company reports. This is a visualization of global internet attacks, seen during the 4th China Internet Security Conference in Beijing. Microsoft's Bing search engine is no longer accessible in China, the company reports. The Microsoft search engine, Bing, appears to have been blocked in China since Wednesday.


Google Testing A Censored Search Engine Just For China

NPR Technology

The Google.cn for China website is seen on a computer screen in this photo illustration. Google is reportedly working on a censored version of its search engine to comply with China's government's demands. The Google.cn for China website is seen on a computer screen in this photo illustration. Google is reportedly working on a censored version of its search engine to comply with China's government's demands. Google is testing a mobile version of its search engine that will adhere to the Chinese government's censorship demands, including the blocking of certain websites and search terms, according to multiple reports.


Russian Search Engine Alerts Google to Possible Data Problem

U.S. News

Yandex spokesman Ilya Grabovsky said Thursday that some Internet users contacted the company Wednesday to say that its public search engine was yielding what looked like personal Google files. Grabovsky said the company has alerted Google.


Amazon Is More Than A Shopping Site. It's A Search Engine Too

NPR Technology

NPR-Marist poll finds that almost half of online shoppers go to Amazon first when they look for an item. Other search engines know what customers look for but Amazon knows what they ultimately buy.


Microsoft Looks at Whether Russians Bought U.S. Ads on Search Engine

U.S. News

SAN FRANCISCO (Reuters) - Microsoft Corp said on Monday it was looking into whether Russians bought U.S. election ads on its Bing search engine or on other Microsoft-owned products and platforms, after rival Google said it had discovered such ads on its products.


Microsoft Search Engine Bing to Focus on PC Search Market: CEO

U.S. News

NEW YORK (Reuters) - Microsoft Corp Chief Executive Officer Satya Nadella said on Wednesday the company's search engine, Bing, will focus on expanding in the PC search market after losing its deal with Apple Inc's Siri.