AITopics

1807.09754

Country:

North America > United States > California > Santa Clara County > San Jose (0.24)
North America > Canada > Alberta (0.14)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Materials > Chemicals (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.76)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

arXiv.org Artificial IntelligenceJul-6-2018

Natural Language Processing for Information Extraction

Singh, Sonit

With rise of digital age, there is an explosion of information in the form of news, articles, social media, and so on. Much of this data lies in unstructured form and manually managing and effectively making use of it is tedious, boring and labor intensive. This explosion of information and need for more sophisticated and efficient information handling tools gives rise to Information Extraction(IE) and Information Retrieval(IR) technology. Information Extraction systems takes natural language text as input and produces structured information specified by certain criteria, that is relevant to a particular application. Various sub-tasks of IE such as Named Entity Recognition, Coreference Resolution, Named Entity Linking, Relation Extraction, Knowledge Base reasoning forms the building blocks of various high end Natural Language Processing (NLP) tasks such as Machine Translation, Question-Answering System, Natural Language Understanding, Text Summarization and Digital Assistants like Siri, Cortana and Google Now. This paper introduces Information Extraction technology, its various sub-tasks, highlights state-of-the-art research in various IE subtasks, current challenges and future research directions.

information retrieval, machine learning, natural language, (15 more...)

1807.02383

Country:

Europe > Czechia > Prague (0.04)
Asia > China > Beijing > Beijing (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.93)
Information Technology (0.93)
Media > News (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
(3 more...)

U.S. NewsJul-5-2018, 13:46:58 GMT

Russian Search Engine Alerts Google to Possible Data Problem

Yandex spokesman Ilya Grabovsky said Thursday that some Internet users contacted the company Wednesday to say that its public search engine was yielding what looked like personal Google files. Grabovsky said the company has alerted Google.

information retrieval, natural language, russian search engine alert google, (4 more...)

U.S. News

AI-Alerts: 2018 > 2018-07 > AAAI AI-Alert for Jul 10, 2018 (1.00)

Industry: Media > News (0.40)

Technology:

Information Technology > Information Management > Search (0.87)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.87)

#artificialintelligenceJul-4-2018, 07:26:38 GMT

Machine Learning Sifts & Searches Complex Scientific Data

As scientific datasets increase in both size and complexity, the ability to label, filter and search this deluge of information has become a laborious, time-consuming and sometimes impossible task, without the help of automated tools enabled by machine learning. With this in mind, a team of researchers from the Department of Energy's Lawrence Berkeley National Laboratory (Berkeley Lab) and UC Berkeley are developing innovative machine learning tools to pull contextual information from scientific datasets and automatically generate metadata tags for each file. Scientists can then search these files via a web-based search engine for scientific data, called Science Search, that the Berkeley team is building. As a proof-of-concept, the team is working with staff at Berkeley Lab's Molecular Foundry, to demonstrate the concepts of Science Search on the images captured by the facility's instruments. A beta version of the platform has been made available to Foundry researchers.

information, information retrieval, machine learning, (15 more...)

Industry:

Energy (0.89)
Education > Educational Setting > Higher Education (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.37)

NPR TechnologyJul-2-2018, 10:55:04 GMT

Amazon Is More Than A Shopping Site. It's A Search Engine Too

NPR-Marist poll finds that almost half of online shoppers go to Amazon first when they look for an item. Other search engines know what customers look for but Amazon knows what they ultimately buy.

amazon, artificial intelligence, natural language, (2 more...)

NPR Technology

AI-Alerts: 2018 > 2018-07 > AAAI AI-Alert for Jul 3, 2018 (1.00)

Technology:

Information Technology > Information Management > Search (0.88)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.88)

#artificialintelligenceJun-29-2018, 16:46:19 GMT

Search in Pics: Google ice cream pool, AI powered piano & watching the World Cup - Search Engine Land

Note: By submitting this form, you agree to Third Door Media's terms. In this week's Search In Pictures, here are the latest images culled from the web, showing what people eat at the search engine companies, how they play, who they meet, where they speak, what toys they have and more. Note: By submitting this form, you agree to Third Door Media's terms. Have something to say about this article?

artificial intelligence, information retrieval, natural language, (10 more...)

Country: Asia > Singapore (0.10)

Industry: Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.46)

Technology:

Information Technology > Information Management > Search (0.88)
Information Technology > Communications > Social Media (0.77)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.66)

#artificialintelligenceJun-29-2018, 11:32:19 GMT

Doctrine raises $11.6 million for its legal search engine

French startup Doctrine is raising a $11.6 million funding round (€10 million) from existing investors Otium Venture and Xavier Niel. Doctrine is building a search engine for court decisions and other legal texts. This is a key tool if you're a lawyer or you're working in the legal industry in general. There are now a thousand companies using the service. It currently costs around €129 per user per month.

artificial intelligence, information retrieval, natural language, (6 more...)

Country: Europe > France (0.08)

Industry: Law (1.00)

Technology:

Information Technology > Information Management > Search (0.64)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.64)

Fatemi, Bahare, Kazemi, Seyed Mehran, Poole, David

Record Linkage to Match Customer Names: A Probabilistic Approach

arXiv.org Artificial IntelligenceJun-26-2018

Consider the following problem: given a database of records indexed by names (e.g., name of companies, restaurants, businesses, or universities) and a new name, determine whether the new name is in the database, and if so, which record it refers to. This problem is an instance of record linkage problem and is a challenging problem because people do not consistently use the official name, but use abbreviations, synonyms, different order of terms, different spelling of terms, short form of terms, and the name can contain typos or spacing issues. We provide a probabilistic model using relational logistic regression to find the probability of each record in the database being the desired record for a given query and find the best record(s) with respect to the probabilities. Building on term-matching and translational approaches for search, our model addresses many of the aforementioned challenges and provides good results when existing baselines fail. Using the probabilities outputted by the model, we can automate the search process for a portion of queries whose desired documents get a probability higher than a trust threshold. We evaluate our model on a large real-world dataset from a telecommunications company and compare it to several state-of-the-art baselines. The obtained results show that our model is a promising probabilistic model for record linkage for names. We also test if the knowledge learned by our model on one domain can be effectively transferred to a new domain. For this purpose, we test our model on an unseen test set from the business names of the secondString dataset. Promising results show that our model can be effectively applied to unseen datasets. Finally, we study the sensitivity of our model to the statistics of datasets.

information retrieval, machine learning, natural language, (20 more...)

1806.10928

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.89)

Industry:

Telecommunications (1.00)
Consumer Products & Services > Restaurants (0.54)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
(2 more...)

#artificialintelligenceJun-25-2018, 00:51:17 GMT

Senzing's Software for Real-Time AI for Entity Resolution to Fight Financial Crime - insideBIGDATA

Senzing, a new artificial intelligence-based (AI) software company, announced its Senzing software product to address the $14.37 billion financial fraud market. Senzing is an IBM spinout that has reinvented entity resolution, which senses who is who in real time across multiple big data sources. Senzing is disrupting the fraud solutions market by offering the first real-time, plug-and-play, AI entity resolution software product for fraud detection, insider threats and more. Now, any company can deploy Senzing to quickly and effectively detect bad actors in their big data. Senzing uses entity-centric learning and other unique techniques to pierce through falsified identities and networks to find criminals.

data mining, real time system, senzing, (16 more...)

Industry:

Information Technology (1.00)
Law Enforcement & Public Safety > Fraud (0.72)

Technology:

Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.95)
Information Technology > Data Science > Data Mining > Big Data (0.62)

Al-Natsheh, Hussein T., Martinet, Lucie, Muhlenbach, Fabrice, Rico, Fabien, Zighed, Djamel A.

Metadata Enrichment of Multi-Disciplinary Digital Library: A Semantic-based Approach

arXiv.org Artificial IntelligenceJun-21-2018

In the scientific digital libraries, some papers from different research communities can be described by community-dependent keywords even if they share a semantically similar topic. Articles that are not tagged with enough keyword variations are poorly indexed in any information retrieval system which limits potentially fruitful exchanges between scientific disciplines. In this paper, we introduce a novel experimentally designed pipeline for multi-label semantic-based tagging developed for open-access metadata digital libraries. The approach starts by learning from a standard scientific categorization and a sample of topic tagged articles to find semantically relevant articles and enrich its metadata accordingly. Our proposed pipeline aims to enable researchers reaching articles from various disciplines that tend to use different terminologies. It allows retrieving semantically relevant articles given a limited known variation of search terms. In addition to achieving an accuracy that is higher than an expanded query based method using a topic synonym set extracted from a semantic network, our experiments also show a higher computational scalability versus other comparable techniques. We created a new benchmark extracted from the open-access metadata of a scientific digital library and published it along with the experiment code to allow further research in the topic.

digital library, information retrieval, machine learning, (19 more...)

1806.08202

Country:

North America > United States > Tennessee > Knox County > Knoxville (0.04)
North America > United States > Nevada (0.04)
North America > Mexico > Mexico City > Mexico City (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(2 more...)