Collaborating Authors

Information Management

Brave is developing its own privacy-focused search engine


Privacy-focused browser Brave is working on its own search engine. It has bought Tailcat, an open-source engine created by a team who worked on the defunct anti-tracking browser and search engine Cliqz, to power Brave Search. The company will allow others to use Brave Search tech to build their own search engines. Brave says the search engine will provide an alternative to Google Search and Chrome. It's developing Brave Search using the same principles as its browser, which now has more than 25 million monthly active users.

AI Content Creator: 5 Ways it Helps Your Blog Grow


Every blogger wants their blog to rank in the top position in Google search results since users commonly select results contained on the first page, especially those in one of the top 3 positions, as you can see in the graphic below. And, for years, the Google search algorithm made content king. This explains why companies invest more into content creation, with 24% of marketers planning to increase their budget for content marketing from 2020 levels. But, content creation is expensive; costing between $2000 and $10,000 a month for the average SME (small and mid-sized enterprise). If you want to get those costs down, consider using an AI-fueled content creator to make your job efficient at a lower cost. Moreover, if you write the content yourself, or you hire a writer to create content, it doesn't take long before you run out of topic ideas.

What Is Big Data?


To really understand huge information, it is helpful to get some historic background. Here is Gartner's definition, circa 2001 (that is still the go-to expression): Big information is information which contains better variety arriving in increasing quantities and using ever-higher velocity. This is known as the three Vs. To put it differently, large info is bigger, more complicated data sets, especially from new information sources. These data sets are so voluminous that traditional data processing software simply can't manage them.

7 Emerging Technologies in SEO and Their Applications


Here's an example of some listicles I created for the topic "quality blog content" using this tool: As you can see, if I wanted to write an article on this topic, I can use some of these suggestions as an outline for my post. With these, I can focus instead on researching the individual sub-topics. Here's another example of some website taglines that I created for Moz by entering the brand name and a brief description of "The Ultimate SEO tool you can trust" into the tool: If you were starting a new brand as an SEO, you can use NLG tools such as this, to discover awesome taglines to use for your brand. TF*IDF stands for "Term Frequency times Inverse Document Frequency". This measures how you use a term on a particular page and how it compares to a collection of pages for that specific keyword.

Visual Search Saga How the Technology is Dominating the Web and the Future of Search


Humans are inherently visual beings. From time immemorial, we rely on visual cues for the basic adaptive behaviors, as well as complex behaviors. Most of us process information based on what we see rather than what we hear or read. And this age-old trend of visual learning has evolved into visual search, as the world became more and more digital and Internet-oriented. In comparison to the speed with which we understand and process pictures, we are terrible listeners and even slower readers. And this happens mostly because of science, as the neurons involved in processing visuals constitute almost 30% of the human brain.

Building a Complete AI Based Search Engine with Elasticsearch, Kubeflow and Katib


Building search systems is hard. Preparing them to work with machine learning is really hard. Developing a complete search engine framework integrated with AI is really really hard. In this post, we'll build a search engine from scratch and discuss on how to further optimize results by adding a machine learning layer using Kubeflow and Katib. This new layer will be capable of retrieving results considering the context of users and is the main focus of this article. As we'll see, thanks to Kubeflow and Katib, final result is rather quite simple, efficient and easy to maintain. To understand the concepts in practice, we'll implement the system with hands-on experience. As it's been built on top of Kubernetes, you can use any infrastructure you like (given appropriate adaptations).

Nvidia reports record sales in gaming and data center segments


Nvidia on Wednesday published fourth quarter financial results above market expectations, with record revenue in both its Gaming and Data Center segments. Fourth quarter non-GAAP earnings per diluted share were $3.10 on revenue of $5 billion, up 61 percent year-over-year. Analysts were expecting earnings of $2.81 on revenue of $4.82 billion. For the full fiscal year, non-GAAP earnings per diluted share were $10. Revenue was a record $16.68 billion, up 53 percent.

Gaming, datacenters boost Nvidia's Q4 revenues to $5 billion


Nvidia reported revenues of $5.0 billion for its fourth fiscal quarter ended January 31, up 61% from a year earlier. The revenues and non-GAAP earnings per share of $3.10 beat expectations as new gaming hardware and AI products generated strong demand. A year ago, Nvidia reported non-GAAP earnings per share of $1.89 on revenues of $3.1 billion. The Santa Clara, California-based company makes graphics processing units (GPUs) that can be used for games, AI, and datacenter computing. While many businesses have been hit hard by the pandemic, Nvidia has seen a boost in those areas.

Improving the Stack Overflow search algorithm using Semantic Search and NLP


If you a person who has ever tried to write a piece of code, you are sure to come across Stack Overflow (it's that famous). For those who live under a rock, Stack Overflow provides one of the largest QA platforms for programmers. Users post questions/doubts and their fellow peers try to provide solutions in the most helpful manner possible. The better an answer, the higher the votes it gets, which also increase a user's reputation. Given its popularity, it's safe to say that there is a buttload of data on there.

Knowledge Graphs

Communications of the ACM

The 1980s saw the evolution of computing as it transitioned from industry to homes through the boom of personal computers. In the field of data management, the Relational Database industry was developing rapidly (Oracle, Sybase, IBM, among others). Object-oriented abstractions were developed as a new form of representational independence. The Internet changed the way people communicated and exchanged information.