AITopics

2010.12091

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)
(5 more...)

#artificialintelligenceOct-20-2020, 15:05:08 GMT

3Diligent Expands ProdEX and Shopsight Applications

Its Shopsight application provides users access to project opportunities from ProdEX and enables remote assessment, quoting, and project management. Both systems incorporate 3Diligent's Connect interface which enables customers and manufacturers to communicate directly using a secure online portal and Zoom video conferencing tools. Operating similarly to traditional search engine marketing, manufacturers can create text ads that will display based on a customer's material and technology requirements. However, unlike traditional search engines, Connect is driven by RFQ inputs rather than generic keyword searches. As a result, manufacturers can customize their bids and visibility on dimensions such as material, technology, and program size to drive higher ROI.

artificial intelligence, information retrieval, natural language, (12 more...)

Industry: Information Technology (0.69)

Technology:

Information Technology > Information Management > Search (0.79)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.79)
Information Technology > Communications > Social Media (0.57)

Radhakrishnan, Karthik, Srikantan, Arvind, Lin, Xi Victoria

ColloQL: Robust Cross-Domain Text-to-SQL Over Search Queries

arXiv.org Artificial IntelligenceOct-19-2020

Translating natural language utterances to executable queries is a helpful technique in making the vast amount of data stored in relational databases accessible to a wider range of non-tech-savvy end users. Prior work in this area has largely focused on textual input that is linguistically correct and semantically unambiguous. However, real-world user queries are often succinct, colloquial, and noisy, resembling the input of a search engine. In this work, we introduce data augmentation techniques and a sampling-based content-aware BERT model (ColloQL) to achieve robust text-to-SQL modeling over natural language search (NLS) questions. Due to the lack of evaluation data, we curate a new dataset of NLS questions and demonstrate the efficacy of our approach. ColloQL's superior performance extends to well-formed text, achieving 84.9% (logical) and 90.7% (execution) accuracy on the WikiSQL dataset, making it, to the best of our knowledge, the highest performing model that does not use execution guided decoding.

information retrieval, machine learning, natural language, (19 more...)

2010.09927

Country:

Asia > China > Hong Kong (0.04)
Oceania > Australia (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(9 more...)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Sports > Tennis (1.00)

Technology:

Information Technology > Databases (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.70)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.66)

Zhao, Tony, Choi, Jaeyoung, Friedland, Gerald

DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models

arXiv.org Artificial IntelligenceOct-19-2020

Cross-modal retrieval relies on accurate models to retrieve relevant results for queries across modalities such as image, text, and video. In this paper, we build upon previous work by tackling the difficulty of evaluating models both quantitatively and qualitatively quickly. We present DIME (Dataset, Index, Model, Embedding), a modality-agnostic tool that handles multimodal datasets, trained models, and data preprocessors to support straightforward model comparison with a web browser graphical user interface. DIME inherently supports building modality-agnostic queryable indexes and extraction of relevant feature embeddings, and thus effectively doubles as an efficient cross-modal tool to explore and search through datasets.

information retrieval, machine learning, natural language, (16 more...)

doi: 10.1007/978-3-030-37734-2_61

2010.09641

Country: North America > United States > California (0.05)

Genre: Research Report (0.40)

Industry:

Government > Regional Government (0.30)
Energy (0.30)

Technology:

Information Technology > Human Computer Interaction (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.31)

arXiv.org Artificial IntelligenceOct-19-2020

Knowledge Graph-based Question Answering with Electronic Health Records

Park, Junwoo, Cho, Youngwoo, Lee, Haneol, Choo, Jaegul, Choi, Edward

Question Answering (QA) on Electronic Health Records (EHR), namely EHR QA, can work as a crucial milestone towards developing an intelligent agent in healthcare. EHR data are typically stored in a relational database, which can also be converted to a Directed Acyclic Graph (DAG), allowing two approaches for EHR QA: Table-based QA and Knowledge Graph-based QA. We hypothesize that the graph-based approach is more suitable for EHR QA as graphs can represent relations between entities and values more naturally compared to tables, which essentially require JOIN operations. To validate our hypothesis, we first construct EHR QA datasets based on MIMIC-III, where the same question-answer pairs are represented in SQL (table-based) and SPARQL (graph-based), respectively. We then test a state-of-the-art EHR QA model on both datasets where the model demonstrated superior QA performance on the SPARQL version. Finally, we open-source both MIMICSQL* and MIMIC-SPARQL* to encourage further EHR QA research in both direction

machine learning, natural language, question answering, (15 more...)

2010.09394

Country:

Asia > South Korea > Daejeon > Daejeon (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.40)

Industry: Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.63)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.62)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

#artificialintelligenceOct-16-2020, 03:50:17 GMT

What is Cognitive Search?

Powerful Indexing Cognitive search, unlike keyword search, crawls and ingests both structured and unstructured data. Keep in mind: experts estimate that as much as 80-90% of your data is unstructured, including email, customer surveys and social media. Cognitive search solutions also enable developers to embed search in other applications using SDKs, APIs, and other tools. This is important because your data isn't confined to databases: it's scattered across the enterprise. So, search has to work where your teams work -- in Slack, Salesforce, Jira, Amazon Web Services (AWS), etc. Natural Language Processing (NLP) Keyword search is basically a matching game played with digital data.

artificial intelligence, cognitive search, information retrieval, (5 more...)

Industry:

Information Technology (0.60)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.37)
Health & Medicine > Therapeutic Area > Immunology (0.37)
Health & Medicine > Epidemiology (0.37)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.67)

Berger, Mark, Zavrel, Jakub, Groth, Paul

Effective Distributed Representations for Academic Expert Search

arXiv.org Artificial IntelligenceOct-16-2020

Expert search aims to find and rank experts based on a user's query. In academia, retrieving experts is an efficient way to navigate through a large amount of academic knowledge. Here, we study how different distributed representations of academic papers (i.e. embeddings) impact academic expert retrieval. We use the Microsoft Academic Graph dataset and experiment with different configurations of a document-centric voting model for retrieval. In particular, we explore the impact of the use of contextualized embeddings on search performance. We also present results for paper embeddings that incorporate citation information through retrofitting. Additionally, experiments are conducted using different techniques for assigning author weights based on author order. We observe that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the most effective paper representations for document-centric expert retrieval. However, retrofitting the paper embeddings and using elaborate author contribution weighting strategies did not improve retrieval performance.

data mining, information retrieval, machine learning, (19 more...)

2010.08269

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > China > Hong Kong (0.04)
(7 more...)

Genre:

Research Report (0.82)
Overview (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(4 more...)

#artificialintelligenceOct-14-2020, 23:18:59 GMT

Mastering Presto: Hands-On Learning

Mastering Presto: Hands-On Learning Learn Presto - distributed SQL Query Engine for Big Data! Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from the ground up for interactive analytics and approaches the speed of commercial data warehouses while scaling to the size of organisations like Facebook. In the first part of the course I will talk about Presto's theory including Presto's architecture and components - coordinator, worker, connector, query execution model, etc. Additionally, I will explain to you how Kafka, Cassandra, Hive, PostgreSQL and Redshift work before I mention the specifics to their connectors.

artificial intelligence, natural language, presto, (8 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.41)

Technology:

Information Technology > Databases (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.84)
Information Technology > Communications > Social Media (0.51)

arXiv.org Artificial IntelligenceOct-9-2020

A Survey of Knowledge-Enhanced Text Generation

Yu, Wenhao, Zhu, Chenguang, Li, Zaitang, Hu, Zhiting, Wang, Qingyun, Ji, Heng, Jiang, Meng

The goal of text generation is to make machines express in human language. It is one of the most important yet challenging tasks in natural language processing (NLP). Since 2014, various neural encoder-decoder models pioneered by Seq2Seq have been proposed to achieve the goal by learning to map input text to output text. However, the input text alone often provides limited knowledge to generate the desired output, so the performance of text generation is still far from satisfaction in many real-world scenarios. To address this issue, researchers have considered incorporating various forms of knowledge beyond the input text into the generation models. This research direction is known as knowledge-enhanced text generation. In this survey, we present a comprehensive review of the research on knowledge enhanced text generation over the past five years. The main content includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data. This survey can have broad audiences, researchers and practitioners, in academia and industry.

information retrieval, machine learning, question answering, (20 more...)

2010.04389

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Asia > China > Hong Kong (0.04)
(11 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
(6 more...)

#artificialintelligenceOct-7-2020, 14:06:25 GMT

Efficient open-domain question-answering on Vespa.ai

We use Recall@position as the main evaluation metric for the retriever. The obvious goal of the retriever is to have the highest recall possible at the lowest possible position. Since the final top position passages are re-ranked using the BERT-based reader, the fewer passages we need to evaluate the better the run time complexity and performance.

information retrieval, machine learning, question answering, (22 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.57)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)