AITopics | Information Retrieval

Collaborating Authors

Information Retrieval

Our accustomed systems of retrieving particular bits of information no longer fill the needs of many people. Searching traditional indexes of print publications has been aided by computerized databases, but still usually requires time-consuming serial searching of one database after the other, and then moving on to other methods of searching for internet sources. And what if the information being sought is a sound byte? A video clip? Yesterday's e-mail exchange between respected scientists? Artificial intelligence may hold the key to information retrieval in an age where widely different formats contain the information being sought, and the universe of knowledge is simply too big and growing too rapidly for successful searching to proceed at a human's slow speed.

News Overviews Instructional Materials AI-Alerts Classics

The Proof is in the PPC: The Definitive Guide to Search Advertising MarTechExec

#artificialintelligenceApr-16-2018, 21:10:59 GMT

Search advertising is the placement of ads in search engine results. Businesses pay to place these ads at the top of search results. The price paid depends on several factors, such as search term popularity, competition and website quality. Taking a scroll through Google AdWords' lessons and picking up a certificate is a great way to start yourself off with paid search. But it doesn't give you everything you need to know.

google, search advertising, search advertising martechexec, (11 more...)

#artificialintelligence

Industry:

Marketing (1.00)
Information Technology > Services (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.53)

Add feedback

Information Retrieval and Mining Massive Data Sets

@machinelearnbotApr-12-2018, 05:47:50 GMT

The goal is to introduce various techniques required to build an IR System. In this course we will explore various methods to solve big data problem. We will evaluate alternative solutions and trade offs. In the later part of the course we will discuss various data mining algorithms to make sense of massive data sets.

information retrieval and mining, mining massive data set

@machinelearnbot

Genre:

Instructional Material > Online (0.40)
Instructional Material > Course Syllabus & Notes (0.40)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.87)

Add feedback

A plug-in approach to maximising precision at the top and recall at the top

Tasche, Dirk

arXiv.org Machine LearningApr-9-2018

Information retrieval and binary classification can be considered equivalent problems in principle. Information retrieval means to mark documents in a set of candidate documents as relevant or non-relevant for some question, on the basis of the properties of the documents. For binary classification, the problem is to distinguish between the'positive' and'negative' instances from a dataset, based on the features of the instances. Hence, from an abstract point of view, information retrieval is a special case of binary classification, with the documents being instances, the document properties being features and'relevant' being translated as'positive'. In practice, however, the general concepts from binary classification are not always helpful for information retrieval applications. The fact that often the proportion of relevant documents in a set of documents subject to a search is small or even very small is only one of the reasons for information retrieval to be considered a field of research for its own. As a consequence, some performance measures for information retrieval methods differ from those in use for binary classifiers or are called by different names. Precision and recall are possibly the most popular performance measures(see Chapter 8 of Manning et al., 2008, for a list of performance measures) for information retrieval methods: - Precision is the proportion of documents (instances) that are truly relevant (positive) among those documents which have been predicted relevant (positive). The term precision is also commonly used (with the same meaning) in binary classification.

classifier, information retrieval, machine learning, (19 more...)

arXiv.org Machine Learning

1804.03077

Country: Asia > Japan (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

SEO Reporting Has Already Changed: Farewell HubSpot Keywords Tool

#artificialintelligenceApr-5-2018, 09:45:15 GMT

Being one of HubSpot's largest agency partners, we stay close to the pulse of the ever-changing platform. A few weeks back, they announced the keywords tool is being sunsetted at the end of May 2018. There was and still is a lot of backlash against this move because customers use the HubSpot Keywords tool for tracking their overall level of SEO success. Keyword rank tracking has long been a metric that marketers follow, and they will continue to do so. However, although HubSpot has continued to invest, grow, and update sales, marketing, and analytics tools within the platform, their keywords tool hasn't been updated since Google started encrypting keyword search data prior to 2013.

farewell hubspot keyword tool, functionality, hubspot, (9 more...)

#artificialintelligence

Technology:

Information Technology > Information Management > Search (0.77)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.38)

Add feedback

Tight Query Complexity Lower Bounds for PCA via Finite Sample Deformed Wigner Law

Simchowitz, Max, Alaoui, Ahmed El, Recht, Benjamin

arXiv.org Machine LearningApr-3-2018

We prove a \emph{query complexity} lower bound for approximating the top $r$ dimensional eigenspace of a matrix. We consider an oracle model where, given a symmetric matrix $\mathbf{M} \in \mathbb{R}^{d \times d}$, an algorithm $\mathsf{Alg}$ is allowed to make $\mathsf{T}$ exact queries of the form $\mathsf{w}^{(i)} = \mathbf{M} \mathsf{v}^{(i)}$ for $i$ in $\{1,...,\mathsf{T}\}$, where $\mathsf{v}^{(i)}$ is drawn from a distribution which depends arbitrarily on the past queries and measurements $\{\mathsf{v}^{(j)},\mathsf{w}^{(i)}\}_{1 \le j \le i-1}$. We show that for every $\mathtt{gap} \in (0,1/2]$, there exists a distribution over matrices $\mathbf{M}$ for which 1) $\mathrm{gap}_r(\mathbf{M}) = \Omega(\mathtt{gap})$ (where $\mathrm{gap}_r(\mathbf{M})$ is the normalized gap between the $r$ and $r+1$-st largest-magnitude eigenvector of $\mathbf{M}$), and 2) any algorithm $\mathsf{Alg}$ which takes fewer than $\mathrm{const} \times \frac{r \log d}{\sqrt{\mathtt{gap}}}$ queries fails (with overwhelming probability) to identity a matrix $\widehat{\mathsf{V}} \in \mathbb{R}^{d \times r}$ with orthonormal columns for which $\langle \widehat{\mathsf{V}}, \mathbf{M} \widehat{\mathsf{V}}\rangle \ge (1 - \mathrm{const} \times \mathtt{gap})\sum_{i=1}^r \lambda_i(\mathbf{M})$. Our bound requires only that $d$ is a small polynomial in $1/\mathtt{gap}$ and $r$, and matches the upper bounds of Musco and Musco '15. Moreover, it establishes a strict separation between convex optimization and \emph{randomized}, "strict-saddle" non-convex optimization of which PCA is a canonical example: in the former, first-order methods can have dimension-free iteration complexity, whereas in PCA, the iteration complexity of gradient-based methods must necessarily grow with the dimension.

abs, probability, proposition 3, (17 more...)

arXiv.org Machine Learning

1804.01221

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.87)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.60)

Add feedback

How artificial intelligence drives PPC automation - Search Engine Land

#artificialintelligenceApr-1-2018, 22:51:51 GMT

From the 1970s through today, there have been three waves of technology innovation in the Silicon Valley that have spawned many industries, including ours (online marketing). First, there were semiconductors, then personal computers, and most recently, the internet. Now we're on the cusp of the next wave of innovation, driven by advances in artificial intelligence (AI). Given that our industry exists thanks to these waves of innovation and that automation -- a top trend in PPC for 2017 -- can be driven by AI, how might things change for us during this next wave? And what risks do we face when we start to use far more automation without knowing the extent of its abilities?

artificial intelligence, information retrieval, natural language, (18 more...)

#artificialintelligence

Country: North America > United States > California (0.25)

Industry:

Information Technology (0.70)
Transportation (0.49)
Automobiles & Trucks (0.47)

Technology:

Information Technology > Information Management > Search (0.86)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.41)

Add feedback

SEO - Best SEO and Digital Marketing Services 21CenturyWeb

#artificialintelligenceMar-31-2018, 21:52:01 GMT

Based on analytics, on-site and off-site considerations, do a health check on your website and create a baseline for ongoing website SEO performance. Find potential partners and influencers that can be utilized for link-building and PR efforts which supports the project's approach and timeline – Associate Find potential partners to publish content on their website with links to the client's website – Associate Significantly increase exposure of the client's business by encouraging targeted users to share and discuss related content We work continuously in order to stay ahead of the rapidly-changing SEO curve, providing regular, up-to-date training and professional development for all members of your team. We utilize proven, best practices that maximize short and long term results, creating quality content and structuring web pages in an efficient and effective manner. Our team of SEO specialists is defined by persistence and tenacity so there will never be wasted potential opportunities.

artificial intelligence, information retrieval, natural language, (12 more...)

#artificialintelligence

Industry: Marketing (0.52)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.42)

Add feedback

How to build a search engine - Part 2: Configuring elasticsearch

@machinelearnbotMar-18-2018, 16:54:55 GMT

In this post we will focus on configuring the elasticsearch bit. I have chosen the Wikipedia people dump for the dataset. This is the wiki pages of a subset of people on Wikipedia. This dataset consists of three columns – URI, name, text. As the column names suggest, URI is the actual wiki link to that person's page, name is the person's name.

artificial intelligence, information retrieval, natural language, (8 more...)

@machinelearnbot

Technology:

Information Technology > Communications (1.00)
Information Technology > Information Management > Search (0.62)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.51)

Add feedback

Gradient Augmented Information Retrieval with Autoencoders and Semantic Hashing

Billings, Sean

arXiv.org Machine LearningMar-12-2018

This paper will explore the use of autoencoders for semantic hashing in the context of Information Retrieval. This paper will summarize how to efficiently train an autoencoder in order to create meaningful and low-dimensional encodings of data. This paper will demonstrate how computing and storing the closest encodings to an input query can help speed up search time and improve the quality of our search results. The novel contributions of this paper involve using the representation of the data learned by an auto-encoder in order to augment our search query in various ways. I present and evaluate the new gradient search augmentation (GSA) approach, as well as the more well-known pseudo-relevance-feedback (PRF) adjustment. I find that GSA helps to improve the performance of the TF-IDF based information retrieval system, and PRF combined with GSA works best overall for the systems compared in this paper.

autoencoder, gradient augmented information retrieval, information retrieval, (11 more...)

arXiv.org Machine Learning

1803.04494

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

What real-world problems can AI really solve? An interview with YITU Technology · TechNode

#artificialintelligenceMar-8-2018, 15:51:07 GMT

You might have heard about those ATMs that use facial recognition instead of cards and PIN numbers for authentication. You might also have seen on the news a smart security algorithm that helps police identify suspects and cracks criminal cases. Artificial intelligence (AI), the wiz behind these advanced technologies, is permeating our daily lives--everything from financial services to public safety to healthcare and transportation. YITU Technology, one of China's front-running AI startups, has developed solutions that help solve real-world problems. YITU now has the ability to enable accurate facial recognition with a large database of over 1 billion faces in just one second, and their technology has in fact assisted Chinese law enforcement in criminal investigations.

artificial intelligence, information retrieval, natural language, (17 more...)

#artificialintelligence

Country: Asia > China (1.00)

Genre: Personal > Interview (0.40)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.50)
Information Technology > Information Management > Search (0.31)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.31)

Add feedback