AITopics | extract information

Collaborating Authors

extract information

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

How to DP-fy Your Data: A Practical Guide to Generating Synthetic Data With Differential Privacy

Ponomareva, Natalia, Xu, Zheng, McMahan, H. Brendan, Kairouz, Peter, Rosenblatt, Lucas, Cohen-Addad, Vincent, Guzmán, Cristóbal, McKenna, Ryan, Andrew, Galen, Bie, Alex, Yu, Da, Kurakin, Alex, Zadimoghaddam, Morteza, Vassilvitskii, Sergei, Terzis, Andreas

arXiv.org Machine LearningDec-4-2025

High quality data is needed to unlock the full potential of AI for end users. However finding new sources of such data is getting harder: most publicly-available human generated data will soon have been used. Additionally, publicly available data often is not representative of users of a particular system -- for example, a research speech dataset of contractors interacting with an AI assistant will likely be more homogeneous, well articulated and self-censored than real world commands that end users will issue. Therefore unlocking high-quality data grounded in real user interactions is of vital interest. However, the direct use of user data comes with significant privacy risks. Differential Privacy (DP) is a well established framework for reasoning about and limiting information leakage, and is a gold standard for protecting user privacy. The focus of this work, \emph{Differentially Private Synthetic data}, refers to synthetic data that preserves the overall trends of source data,, while providing strong privacy guarantees to individuals that contributed to the source dataset. DP synthetic data can unlock the value of datasets that have previously been inaccessible due to privacy concerns and can replace the use of sensitive datasets that previously have only had rudimentary protections like ad-hoc rule-based anonymization. In this paper we explore the full suite of techniques surrounding DP synthetic data, the types of privacy protections they offer and the state-of-the-art for various modalities (image, tabular, text and decentralized). We outline all the components needed in a system that generates DP synthetic data, from sensitive data handling and preparation, to tracking the use and empirical privacy testing. We hope that work will result in increased adoption of DP synthetic data, spur additional research and increase trust in DP synthetic data approaches.

membership inference attack, private evolution algorithm, synthetic data algorithm, (17 more...)

arXiv.org Machine Learning

2512.03238

Country:

Europe > Austria > Vienna (0.14)
Oceania > Australia > Victoria > Melbourne (0.13)
Europe > Switzerland > Zürich > Zürich (0.13)
(11 more...)

Genre:

Workflow (1.00)
Overview (1.00)
Research Report > New Finding (0.92)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
(12 more...)

Add feedback

Prompt Engineering Guidance for Conceptual Agent-based Model Extraction using Large Language Models

Khatami, Siamak, Frantz, Christopher

arXiv.org Artificial IntelligenceDec-5-2024

This document contains detailed information about the prompts used in the experimental process discussed in the paper "Toward Automating Agent-based Model Generation: A Benchmark for Model Extraction using Question-Answering Techniques". The paper aims to utilize Question-answering (QA) models to extract the necessary information to implement Agent-based Modeling (ABM) from conceptual models. It presents the extracted information in formats that can be read by both humans and computers (i.e., JavaScript Object Notation (JSON)), enabling manual use by humans and auto-code generation by Large Language Models (LLM).

extract, information, json, (16 more...)

arXiv.org Artificial Intelligence

2412.04056

Genre: Research Report (0.53)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

HTML-LSTM: Information Extraction from HTML Tables in Web Pages using Tree-Structured LSTM

Kawamura, Kazuki, Yamamoto, Akihiro

arXiv.org Artificial IntelligenceSep-28-2024

In this paper, we propose a novel method for extracting information from HTML tables with similar contents but with a different structure. We aim to integrate multiple HTML tables into a single table for retrieval of information containing in various Web pages. The method is designed by extending tree-structured LSTM, the neural network for tree-structured data, in order to extract information that is both linguistic and structural information of HTML data. We evaluate the proposed method through experiments using real data published on the WWW.

artificial intelligence, information, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-030-88942-5_3

2409.19445

Country: Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)

Genre: Research Report (1.00)

Industry: Education > Educational Setting > Higher Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Question Answering as Programming for Solving Time-Sensitive Questions

Zhu, Xinyu, Yang, Cheng, Chen, Bei, Li, Siheng, Lou, Jian-Guang, Yang, Yujiu

arXiv.org Artificial IntelligenceOct-20-2023

Question answering plays a pivotal role in human daily life because it involves our acquisition of knowledge about the world. However, due to the dynamic and ever-changing nature of real-world facts, the answer can be completely different when the time constraint in the question changes. Recently, Large Language Models (LLMs) have shown remarkable intelligence in question answering, while our experiments reveal that the aforementioned problems still pose a significant challenge to existing LLMs. This can be attributed to the LLMs' inability to perform rigorous reasoning based on surface-level text semantics. To overcome this limitation, rather than requiring LLMs to directly answer the question, we propose a novel approach where we reframe the $\textbf{Q}$uestion $\textbf{A}$nswering task $\textbf{a}$s $\textbf{P}$rogramming ($\textbf{QAaP}$). Concretely, by leveraging modern LLMs' superior capability in understanding both natural language and programming language, we endeavor to harness LLMs to represent diversely expressed text as well-structured code and select the best matching answer from multiple candidates through programming. We evaluate our QAaP framework on several time-sensitive question answering datasets and achieve decent improvement, up to $14.5$% over strong baselines. Our codes and data are available at https://github.com/TianHongZXY/qaap

datetime, information, llm, (17 more...)

arXiv.org Artificial Intelligence

2305.14221

Country:

Asia > China > Liaoning Province > Dalian (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Oregon > Klamath County > Klamath Falls (0.04)
(8 more...)

Genre:

Personal (0.93)
Research Report > New Finding (0.46)

Industry:

Education (0.93)
Government > Regional Government > North America Government > United States Government (0.93)
Leisure & Entertainment > Sports > Soccer (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

GETTING OVER A.I 🤖. The cult of Artificial Intelligence…

#artificialintelligenceDec-18-2022, 11:16:15 GMT

The cult of Artificial Intelligence seems to have taken over the world! From facial data recognition sensors to chatbots, everything is bound to a certain algorithm. But when it comes to innovation in AI, a particular company seems to be the forerunner. OpenAI is a research organization that conducts research in the field of artificial intelligence (AI). It was founded in 2015 by a group of entrepreneurs, including Elon Musk and Sam Altman, with the goal of advancing the field of AI in a way that is safe and beneficial to humanity.

application, chatgpt, information, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Which models are interpretable?

#artificialintelligenceDec-8-2022, 09:10:20 GMT

Model explanation is an essential task in supervised machine learning. Explaining how a model can represent the information is crucial to understanding the dynamics that rule our data. Let's see some models that are easy to interpret. Data Scientists have the role to extract information from raw data. They aren't engineers, nor they are software developers.

feature importance, information, interpretation, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.52)

Add feedback

La veille de la cybersécurité

#artificialintelligenceOct-29-2022, 05:49:03 GMT

Technological breakthroughs have revolutionized the way individuals work and conduct business. For instance, people must develop skills that will enable them to find new jobs because it is predicted that automation could replace up to a third of all jobs by 2030. Consider the following to demonstrate how crucial document AI will be in the future: Did you know that 70% of enterprise documents are free-form text, such as written documents and emails? This indicates that the software used to automatically extract information and decode text from all of your documents has been processed (without human input). As a result, document AI has been made possible via machine learning.

document ai, extract information, veille, (3 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.87)

Add feedback

How AI is transforming chat channels?

#artificialintelligenceOct-17-2022, 11:20:23 GMT

AI is used in chat channels to assist with tasks such as customer service, order fulfillment, and product research. For example, customer service can use AI to answer customer questions, identify customer needs, and make recommendations. AI can also be used to monitor chat channels for problem keywords and phrases and automatically respond with appropriate solutions. Conversational AI is the process of using machine learning and deep neural networks to enable users to communicate with computer systems in natural language. The system extracts user intent from text or voice input and transforms the text into structured data.

chat channel, computer, language processing, (11 more...)

#artificialintelligence

Country: North America > United States (0.06)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.58)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.42)

Add feedback

Is there any difference between data science and machine learning?

#artificialintelligenceJul-14-2022, 08:05:26 GMT

Data Science and machine learning are two wonderful and exciting disciplines and are a great part of our lives. Sometimes people confuse them, but they are quite different things. Data Science is, like the name suggests, the science of data. It's a set of techniques and tools that make the data scientist extract information behind data. Such a mining process can be done using statistical tools or mathematical models.

artificial intelligence, data science, machine learning, (6 more...)

#artificialintelligence

Industry: Education (0.33)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Breaking Privacy in Federated Learning

#artificialintelligenceJun-30-2022, 18:28:49 GMT

Federated learning is a new way of training a machine learning using distributed data that is not centralized in a server. It works by training a generic (shared) model with a given user’s private…

federated learning, information, private data, (12 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback