AITopics

2511.18931

Country:

Asia (0.69)
North America > United States (0.46)
North America > Mexico (0.28)
Europe > Austria > Vienna (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

arXiv.org Artificial IntelligenceJun-23-2025

ScholarSearch: Benchmarking Scholar Searching Ability of LLMs

Zhou, Junting, Li, Wang, Liao, Yiyan, Zhang, Nengyuan, Miao, Tingjia, Qi, Zhihui, Wu, Yuhan, Yang, Tong

Large Language Models (LLMs)' search capabilities have garnered significant attention. Existing benchmarks, such as OpenAI's BrowseComp, primarily focus on general search scenarios and fail to adequately address the specific demands of academic search. These demands include deeper literature tracing and organization, professional support for academic databases, the ability to navigate long-tail academic knowledge, and ensuring academic rigor. Here, we proposed ScholarSearch, the first dataset specifically designed to evaluate the complex information retrieval capabilities of Large Language Models (LLMs) in academic research. ScholarSearch possesses the following key characteristics: Academic Practicality, where question content closely mirrors real academic learning and research environments, avoiding deliberately misleading models; High Difficulty, with answers that are challenging for single models (e.g., Grok DeepSearch or Gemini Deep Research) to provide directly, often requiring at least three deep searches to derive; Concise Evaluation, where limiting conditions ensure answers are as unique as possible, accompanied by clear sources and brief solution explanations, greatly facilitating subsequent audit and verification, surpassing the current lack of analyzed search datasets both domestically and internationally; and Broad Coverage, as the dataset spans at least 15 different academic disciplines. Through ScholarSearch, we expect to more precisely measure and promote the performance improvement of LLMs in complex academic information retrieval tasks.

large language model, machine learning, natural language, (18 more...)

2506.13784

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceMay-29-2025

EvolveSearch: An Iterative Self-Evolving Search Agent

Zhang, Dingchu, Zhao, Yida, Wu, Jialong, Li, Baixuan, Yin, Wenbiao, Zhang, Liwen, Jiang, Yong, Li, Yufeng, Tu, Kewei, Xie, Pengjun, Huang, Fei

The rapid advancement of large language models (LLMs) has transformed the landscape of agentic information seeking capabilities through the integration of tools such as search engines and web browsers. However, current mainstream approaches for enabling LLM web search proficiency face significant challenges: supervised fine-tuning struggles with data production in open-search domains, while RL converges quickly, limiting their data utilization efficiency. To address these issues, we propose EvolveSearch, a novel iterative self-evolution framework that combines SFT and RL to enhance agentic web search capabilities without any external human-annotated reasoning data. Extensive experiments on seven multi-hop question-answering (MHQA) benchmarks demonstrate that EvolveSearch consistently improves performance across iterations, ultimately achieving an average improvement of 4.7\% over the current state-of-the-art across seven benchmarks, opening the door to self-evolution agentic capabilities in open web search domains.

arxiv preprint arxiv, large language model, machine learning, (20 more...)

2505.22501

Country: North America > United States (0.14)

Genre: Research Report (1.00)

Industry:

Media > Music (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

EngadgetSep-28-2023, 19:40:04 GMT

Meta's metaverse is getting an AI makeover

Meta's Connect keynote felt different this year, and not just because it marked the return of an in-person event. It's been nearly two years since Mark Zuckerberg used Connect to announce that Facebook was changing its name to Meta and reorienting the entire company around the metaverse. But at this year's event, it felt almost as if Zuckerberg was trying to avoid saying the word "metaverse." While he did utter the word a couple of times, he spent much more time talking up Meta's new AI features, many of which will be available on Instagram and Facebook and other non-metaverse apps. Horizon Worlds, the company's signature metaverse experience that was highlighted at last year's Connect, was barely mentioned. That may not be particularly surprising if you've been following the company's metaverse journey lately.

ai assistant, meta, metaverse, (8 more...)

Engadget

Industry: Information Technology > Services (0.36)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

Zhao, Lingjun, Nguyen, Khanh, Daumé, Hal III

Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models

arXiv.org Artificial IntelligenceMay-28-2023

Recent work studies the cognitive capabilities of language models through psychological tests designed for humans. While these studies are helpful for understanding the general capabilities of these models, there is no guarantee that a model possessing sufficient capabilities to pass those tests would actually use those capabilities in performing real-life tasks. In this work, we formulate task-oriented cognitive capabilities, which are human-like cognitive capabilities that language models leverage to perform tasks. These capabilities are (i) the ability to quickly generate good candidate utterances (the search capability) (ii) the ability to predict how a listener interprets those utterances and choose the most appropriate one (the pragmatic capability). We design an evaluation scheme for comparing these capabilities of a language model with those of a human. Applying this scheme to examine various models in a navigation instruction generation problem, we find that their pragmatic capability is severely lacking. This insight leads us to augment them with better models of the listener and obtain a significant boost of 11% in success rate in guiding real humans. Our work advocates for having a principled procedure for aligning language models with humans that involves (i) formulating task-oriented capabilities, (ii) devising a method to quantify their deficiency, and (iii) iteratively improving them.

large language model, machine learning, natural language, (20 more...)

2301.05149

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Pennsylvania (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.67)

#artificialintelligenceDec-8-2022, 16:05:07 GMT

How AI search is overcoming the unstructured data challenge

With 80 per cent of company data being unstructured, including text, images and video, getting the most possible value from rising amounts of these assets is proving a challenge across all business sectors. Businesses often meet pitfalls in keyword search capabilities that fail to properly take context, formats or languages into account, leaving users with insufficient results. To solve this challenge, Barcelona-headquartered data startup Nuclia is delivering an API that leverages what company CEO and co-founder Eudald Camprubi has named'AI search as a service', capable of finding and indexing data across any source. An end-to-end solution, it can extract data from file repositories, audio, video, URLs and databases, split it into paragraphs, and present an index that shows exactly where any chosen piece of information is in the file. This is based on continuously trained language models, the creation of which owes much to data annotation.

ai search, nuclia, unstructured data, (9 more...)

Country:

North America > United States (0.06)
Europe > United Kingdom (0.06)
Europe > Portugal > Lisbon > Lisbon (0.06)
Europe > France (0.06)

Technology:

Information Technology > Information Management > Search (0.55)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.39)

#artificialintelligenceSep-12-2022, 00:59:42 GMT

how-ai-is-shaping-the-future-of-live-shopping-e-commerce

In fact, odds are you've already worked for a company that uses AI and/or machine learning tools to some extent! But AI will also have wide-reaching consequences for the economy be on the workplace. In fact, AI is sure to shape the future of live shopping and e-commerce marketplaces for years to come. Let's take a look at six major ways AI will change e-commerce and live shopping in the near future. For starters, AI technology developments will allow the further development of visual search capabilities and programs.

ai technology, business owner, consumer, (8 more...)

Industry:

Information Technology > Services > e-Commerce Services (1.00)
Retail (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.57)

#artificialintelligenceJun-22-2021, 22:55:15 GMT

The Future of Search Is Now! - Expert.ai

Every day, billions of internet users type questions into search engines via smartphones, desktop computers or IoT devices, 90 percent of whom are using Google. As a result, each time the company releases a new algorithm into cyberspace, top-ranked SEO marketers and webpage owners become fearful of losing their page-one rankings. However, the company's latest iteration is notably different from those previously released. Now, the tech giant has decided to take the next step and marry its latest algorithm with natural language processing (NLP). Many believe that this dynamic pairing could prove to be a game changer for search. As the primary tool for people to access information, the importance of search engines can't be overestimated.

algorithm, information, search engine, (12 more...)

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)
North America > United States > California (0.05)
Europe > Spain (0.05)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

#artificialintelligenceOct-29-2020, 23:02:46 GMT

Arcanum makes Hungarian heritage accessible with Amazon Rekognition

Arcanum specializes in digitizing Hungarian language content, including newspapers, books, maps, and art. With over 30 years of experience, Arcanum serves more than 30,000 global subscribers with access to Hungarian culture, history, and heritage. Amazon Rekognition Solutions Architects worked with Arcanum to add highly scalable image analysis to Arcanum Digitheca, a free service provided by Arcanum, which enables you to search and explore Hungarian cultural heritage, including 600,000 faces over 500,000 images. For example, you can find historical works by author Mór Jókai or photos on topics like weddings. The Arcanum team chose Amazon Rekognition to free valuable staff from time and cost-intensive manual labeling, and improved label accuracy to make 200,000 previously unsearchable images (approximately 40% of image inventory), available to users.

amazon rekognition, artificial intelligence, machine learning, (13 more...)

Country:

Europe > Serbia (0.05)
Europe > Eastern Europe (0.05)
Europe > Croatia (0.05)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceOct-28-2020, 21:19:41 GMT

Configuring your Amazon Kendra Confluence Server connector

These types of workspaces are rich with data and contain sets of knowledge and information that can be a great source of truth to answer organizational questions. Unfortunately, it isn't always easy to tap into these data sources to extract the information you need. For example, the data source might not be connected to an enterprise search service within the organization, or the service is outdated and lacks natural language search capabilities, leading to poorer search experiences. Amazon Kendra is an intelligent search service powered by machine learning (ML). Amazon Ken dra reimagines enterprise search for your websites and applications so your employees and customers can easily find the content they're looking for, even when it's scattered across multiple locations and content repositories within your organization.

artificial intelligence, information management, machine learning, (13 more...)

Industry: Retail > Online (0.40)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.76)