AITopics | Information Retrieval

Collaborating Authors

Information Retrieval

Our accustomed systems of retrieving particular bits of information no longer fill the needs of many people. Searching traditional indexes of print publications has been aided by computerized databases, but still usually requires time-consuming serial searching of one database after the other, and then moving on to other methods of searching for internet sources. And what if the information being sought is a sound byte? A video clip? Yesterday's e-mail exchange between respected scientists? Artificial intelligence may hold the key to information retrieval in an age where widely different formats contain the information being sought, and the universe of knowledge is simply too big and growing too rapidly for successful searching to proceed at a human's slow speed.

News Overviews Instructional Materials AI-Alerts Classics

A visual search engine for Bangladeshi laws

Mandal, Manash Kumar, Nath, Pinku Deb, Mizan, Arpeeta Shams, Saquib, Nazmus

arXiv.org Machine LearningNov-14-2017

Browsing and finding relevant information for Bangladeshi laws is a challenge faced by all law students and researchers in Bangladesh, and by citizens who want to learn about any legal procedure. Some law archives in Bangladesh are digitized, but lack proper tools to organize the data meaningfully. We present a text visualization tool that utilizes machine learning techniques to make the searching of laws quicker and easier. Using Doc2Vec to layout law article nodes, link mining techniques to visualize relevant citation networks, and named entity recognition to quickly find relevant sections in long law articles, our tool provides a faster and better search experience to the users. Qualitative feedback from law researchers, students, and government officials show promise for visually intuitive search tools in the context of governmental, legal, and constitutional data in developing countries, where digitized data does not necessarily pave the way towards an easy access to information.

information retrieval, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1711.05233

Country:

Asia > Bangladesh (1.00)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.15)

Genre: Research Report (0.40)

Industry:

Law (1.00)
Government > Regional Government > Asia Government > Bangladesh Government (0.87)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.91)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.87)

Add feedback

Information Retrieval Document Search Engine in R

@machinelearnbotNov-13-2017, 18:30:11 GMT

In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query.

artificial intelligence, information retrieval document search engine, natural language, (3 more...)

@machinelearnbot

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

Fruit Fly Brain Patterns Can Improve Algorithms that Power Netflix, Youtube Recommendations

International Business TimesNov-11-2017, 08:55:09 GMT

Researchers have ventured into uncharted territory to find ways to improve computer algorithms -- the brains of fruit flies. While search algorithms work by analyzing users' previous searches, a fruit fly searches for fruits by remembering the odor of the fruit they have fed on. "This is a problem that pretty much every technology company with any kind of information retrieval system has to solve, so it's been something that computer scientists have studied for years. Now, we have this new approach to similarity searches thanks to the fly," said Saket Navlakha, assistant professor at Salk's Integrative Biology Laboratory and lead author of the research paper titled "A neural algorithm for a fundamental computing problem." The paper was published in the Science Journal on Thursday.

artificial intelligence, information retrieval, natural language, (11 more...)

International Business Times

Genre: Research Report (0.38)

Industry:

Media > Television (0.42)
Media > Film (0.42)
Information Technology > Services (0.42)

Technology:

Information Technology > Communications > Social Media (0.80)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.58)

Add feedback

A neural algorithm for a fundamental computing problem

ScienceNov-9-2017, 19:20:49 GMT

Similarity search--for example, identifying similar images in a database or similar documents on the web--is a fundamental computing problem faced by large-scale information retrieval systems. We discovered that the fruit fly olfactory circuit solves this problem with a variant of a computer science algorithm (called locality-sensitive hashing). The fly circuit assigns similar neural activity patterns to similar odors, so that behaviors learned from one odor can be applied when a similar odor is experienced. The fly algorithm, however, uses three computational strategies that depart from traditional approaches. These strategies can be translated to improve the performance of computational similarity searches.

artificial intelligence, information retrieval, natural language, (4 more...)

Science

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.69)

Add feedback

5 Ways Machine Learning Can Improve Access to Enterprise Data - insideBIGDATA

@machinelearnbotNov-9-2017, 17:15:15 GMT

In this special guest feature, Grant Ingersoll, Founder and CTO of Lucidworks, discusses how machine learning is helping companies manage big data and make sense of it for their customers and employees. With smarter search tools, business leaders can more quickly retrieve information and deliver a better user experience for customers. Here are 5 ways that machine learning is powering more intuitive enterprise search. Grant is an active member of the Lucene community. He is a Lucene and Solr committer, co-founder of the Apache Mahout machine learning project, and a longstanding member of the Apache Software Foundation. Grant's prior experience includes work at the Center for Natural Language Processing at Syracuse University in natural language processing and information retrieval.

information retrieval, machine learning, natural language, (13 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.37)

Add feedback

Indonesia to Summon Messenger, Search Engine Providers Over Content

U.S. NewsNov-7-2017, 06:40:39 GMT

JAKARTA (Reuters) - Indonesia's communications ministry said on Tuesday it will summon representatives from messenger services and search engine providers including Alphabet Inc's Google to push them to clean up obscene content.

artificial intelligence, information retrieval, natural language, (5 more...)

U.S. News

Country: Asia > Indonesia > Java > Jakarta > Jakarta (0.53)

Technology:

Information Technology > Information Management > Search (0.89)
Information Technology > Communications > Social Media (0.89)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.89)

Add feedback

Schema Independent Relational Learning

Picado, Jose, Termehchy, Arash, Fern, Alan, Ataei, Parisa

arXiv.org Artificial IntelligenceNov-6-2017

Learning novel concepts and relations from relational databases is an important problem with many applications in database systems and machine learning. Relational learning algorithms learn the definition of a new relation in terms of existing relations in the database. Nevertheless, the same data set may be represented under different schemas for various reasons, such as efficiency, data quality, and usability. Unfortunately, the output of current relational learning algorithms tends to vary quite substantially over the choice of schema, both in terms of learning accuracy and efficiency. This variation complicates their off-the-shelf application. In this paper, we introduce and formalize the property of schema independence of relational learning algorithms, and study both the theoretical and empirical dependence of existing algorithms on the common class of (de) composition schema transformations. We study both sample-based learning algorithms, which learn from sets of labeled examples, and query-based algorithms, which learn by asking queries to an oracle. We prove that current relational learning algorithms are generally not schema independent. For query-based learning algorithms we show that the (de) composition transformations influence their query complexity. We propose Castor, a sample-based relational learning algorithm that achieves schema independence by leveraging data dependencies. We support the theoretical results with an empirical study that demonstrates the schema dependence/independence of several algorithms on existing benchmark and real-world datasets under (de) compositions.

logic & formal reasoning, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

1508.03846

Country:

North America > United States > Oregon > Benton County > Corvallis (0.05)
North America > United States > Illinois (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.49)
Health & Medicine > Therapeutic Area > Immunology (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.48)

Add feedback

What do you need to know about Chinese search engine Sogou?

#artificialintelligenceNov-3-2017, 15:10:07 GMT

A few days ago, the news emerged that Chinese search engine Sogou (搜狗) is aiming to raise up to $585 million in a U.S. Initial Public Offering. Sogou, which is owned by internet company Sohu, Inc., announced the terms for its proposed IPO on Friday. The news has caused a stir among those keeping an eye on the Chinese tech space, as Sogou is backed by Chinese tech giant Tencent, the company behind the hugely popular messaging apps WeChat and QQ. But for those of us who might not be up on the state of search in China, what do you need to know about Sogou, and how does its IPO play into the wider search landscape? And could there be any potential knock-on effects for the rest of the industry?

artificial intelligence, information retrieval, natural language, (19 more...)

#artificialintelligence

Country: Asia > China (0.36)

Industry: Information Technology > Services (0.51)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.81)

Add feedback

Opportunities for Women, Minorities in Information Retrieval

Communications of the ACMOct-24-2017, 21:45:06 GMT

Diversity was a central theme in the ACM SIGIR 2017 held in Shinjuku Ward in Tokyo, Japan. Fuji, a view of Shinjuku sky-scrapers, including the Tokyo Metropolitan Government (Office), as seen from Keio Plaza the conference hotel, and fireworks celebrating the 40th anniversary. The colorfulness of the fireworks and the circles within and enclosing the logo represent diversity and inclusion." SIGIR 2017 featured a session on Women in IR (Information Retrieval) organized by Laura Dietz of the University of New Hampshire on the first day, just before the welcome party. A week before the conference, I received an email from the secretary of the session, Maram Hasanain, a graduate student in computer science (CS) at Qatar University, asking if I would like to prepare a one-minute introduction of myself for the session. I was so overwhelmed by her beautifully written e-mail, and the excitement of a first-time contact with someone from Qatar, that I immediately accepted her invitation.

artificial intelligence, information retrieval, natural language, (15 more...)

Communications of the ACM

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (1.00)
Asia > Middle East > Qatar (0.46)
North America > United States > New Hampshire (0.25)
(5 more...)

Industry:

Government (0.55)
Education > Educational Setting > Higher Education (0.36)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.61)

Add feedback

Who's the most influential biomedical scientist? Computer program guided by artificial intelligence says it knows

#artificialintelligenceOct-18-2017, 13:20:04 GMT

Eric Lander, president and founding director of the Broad Institute and a biologist at the Massachusetts Institute of Technology in Cambridge, is the most influential biomedical researcher of the modern era, according to a computer program. Lander, a geneticist and mathematician, ranks first on a new list of top biomedical researchers produced by the scientific literature search tool Semantic Scholar. Semantic Scholar, launched in 2015, is an academic search engine aiming to tackle the problem of information overload. It uses artificial intelligence (AI) to help users sift through huge numbers of scientific papers and understand (to a limited extent) their content. The free tool was developed by the Allen Institute for Artificial Intelligence (AI2), a nonprofit based in Seattle, Washington, that was co-founded in 2014 by Microsoft Co-Founder Paul Allen.

artificial intelligence, information retrieval, natural language, (15 more...)

#artificialintelligence

Country:

North America > United States > Washington > King County > Seattle (0.26)
North America > United States > Massachusetts (0.26)
North America > United States > Pennsylvania (0.06)

Genre: Research Report (0.32)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.41)

Add feedback