AITopics | data retention

Collaborating Authors

data retention

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Zero Data Retention in LLM-based Enterprise AI Assistants: A Comparative Study of Market Leading Agentic AI Products

Gupta, Komal, Shrivastava, Aditya

arXiv.org Artificial IntelligenceOct-14-2025

Governance of data, compliance, and business privacy matters, particularly for healthcare and finance businesses. Since the recent emergence of AI enterprise AI assistants enhancing business productivity, safeguarding private data and compliance is now a priority. With the implementation of AI assistants across the enterprise, the zero data retention can be achieved by implementing zero data retention policies by Large Language Model businesses like Open AI and Anthropic and Meta. In this work, we explore zero data retention policies for the Enterprise apps of large language models (LLMs). Our key contribution is defining the architectural, compliance, and usability trade-offs of such systems in parallel. In this research work, we examine the development of commercial AI assistants with two industry leaders and market titans in this arena - Salesforce and Microsoft. Both of these companies used distinct technical architecture to support zero data retention policies. Salesforce AgentForce and Microsoft Copilot are among the leading AI assistants providing much-needed push to business productivity in customer care. The purpose of this paper is to analyze the technical architecture and deployment of zero data retention policy by consuming applications as well as big language models service providers like Open Ai, Anthropic, and Meta.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2510.11558

Country: Asia (0.28)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Information Technology > Software (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DRIP: DRop unImportant data Points -- Enhancing Machine Learning Efficiency with Grad-CAM-Based Real-Time Data Prioritization for On-Device Training

Rüb, Marcus, Konegen, Daniel, Selle, Patrick, Sikora, Axel, Mueller-Gritschneder, Daniel

arXiv.org Artificial IntelligenceMay-23-2025

Embedded Computing Systems Faculty of Informatics, TU Wien Vienna, Austria daniel.mueller-gritschneder@tuwien.ac.at Abstract --Selecting data points for model training is critical in machine learning. Effective selection methods can reduce the labeling effort, optimize on-device training for embedded systems with limited data storage, and enhance the model performance. This paper introduces a novel algorithm that uses Grad-CAM to make online decisions about retaining or discarding data points. Optimized for embedded devices, the algorithm computes a unique DRIP Score to quantify the importance of each data point. This enables dynamic decision-making on whether a data point should be stored for potential retraining or discarded without compromising model performance. Experimental evaluations on four benchmark datasets demonstrate that our approach can match or even surpass the accuracy of models trained on the entire dataset, while achieving storage savings of up to 39%. T o our knowledge, this is the first algorithm to make online decisions about data point retention without requiring access to the entire dataset. In the rapidly evolving domain of machine learning, the quantity of available data have reached unprecedented levels. While large datasets have traditionally been the bedrock of robust machine learning models, the sheer magnitude of data now available poses both opportunities and challenges. One of the primary challenges is efficient data management, especially but not only in scenarios with constrained computational and storage resources [1].

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2504.08364

Country: Europe > Austria > Vienna (0.54)

Genre: Research Report > New Finding (0.93)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Architecture (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Clear Storage: The Ethics of Deletion Policies for Stored Facial Images

#artificialintelligenceDec-18-2021, 04:35:28 GMT

Although the topic of facial recognition is uncomfortable, hearing of its application in a place associated with child-like innocence was particularly jarring to those concerned about the Disney Company violating their privacy. It should be noted that, in Disney's case, guests uncomfortable with facial scanners could opt for a less invasive ticket scan and there are no reports of the technology being implemented in a non-voluntary manner. However, not all companies allow customers to opt out of facial data collection. In July The Verge reported that Lowe's, Macy's and Ace Hardware all currently employ facial recognition algorithms, while McDonalds, Walgreens and even 7–11 are considering using facial recognition in the future. Although that may sound scary, there is nothing illegal about the practice, since facial recognition techniques are unregulated in the U.S. and throughout most of the world.

facial recognition, retention policy, stored facial image, (11 more...)

#artificialintelligence

Country: North America > United States > California (0.06)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)

Add feedback

AI Fear Factor - How Much Do They Know?

#artificialintelligenceJan-6-2019, 11:46:58 GMT

Modern societies are formed on concepts of free will and self-governance. Societal authority is put on opinions and feelings of people. While people have common ideas around universal concepts like voter knows best, customer is always right etc., subconsciously, free will itself is influenced by a lot of data points - cultural, spiritual, personal etc. In this age, AI can automatically manage and manipulate these data points at a massive scale- in effect, hacking human feelings, attitudes, beliefs, behaviours and in essence, changing the very fibre of society itself. AI with deep learning neural networks improve themselves by learning on more and more data and establishing complex associations and patterns.

artificial intelligence, data retention, machine learning, (14 more...)

#artificialintelligence

Industry:

Information Technology > Security & Privacy (0.52)
Government (0.51)
Law (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.40)

Add feedback

Data Retention: Tough Choices Ahead

#artificialintelligenceAug-27-2018, 15:11:07 GMT

As the cost per byte of storage has declined, it has become a habit to simply store data "just in case." At a time when the overwhelming majority of data was generated by human beings, nobody thought much of it. Data was summarized, information extracted from it and the raw data points were still kept should they be needed later. Cisco tells us that as of 2008, there were more things connected to the internet than people, so we can use that as the point in time when the amount of data being generated and stored had its hockey stick moment. Now we have more sensors in more places monitoring more and more activity and generating more and more data points.

artificial intelligence, information management, reduction, (9 more...)

#artificialintelligence

Industry: Information Technology (0.31)

Technology:

Information Technology > Data Science (0.32)
Information Technology > Artificial Intelligence (0.32)
Information Technology > Information Management (0.31)

Add feedback

Machine Learning and Big Data: Predictive Analytics Make Big Data Meaningful

#artificialintelligenceJul-17-2018, 18:04:12 GMT

Predictive analytics use data, statistical algorithms, and Machine Learning techniques to predict the likelihood of business trends and financial performance, based on the past. They bring together several technologies and disciplines such as statistical analysis, data mining, predictive modelling and Machine Learning to predict the future of businesses. For example, it is possible to anticipate the consequences of a decision or the reactions of consumers. Predictive analytics provide insightful insights from large datasets, allowing companies to decide where to go next and provide a better customer experience. With increased data, computing power and the development of easier-to-use AI software and analytical tools, many companies can now use predictive analytics.

artificial intelligence, data mining, machine learning, (9 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.77)

Add feedback

Managing the tricky balance between data pooling and data retention with predictive platforms

@machinelearnbotJul-14-2017, 03:50:08 GMT

Likewise, spam detection is most effective when the learning algorithm has been populated with relevant examples. The steering system of a self-driving car is the same story. It won't function properly until it has learned to recognise other vehicles and road signage via traffic imaging. Similarly, a healthcare diagnostic support tool depends upon medical image matching. And an automatic translation tool will need to draw on a body of existing translated texts.

artificial intelligence, data retention, predictive platform, (1 more...)

@machinelearnbot

Industry: Health & Medicine (0.80)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.80)

Add feedback

Avoiding industrial IoT digital exhaust with machine learning - IoT Agenda

#artificialintelligenceJun-1-2017, 20:55:45 GMT

The internet of things is speeding from concept to reality, with sensors and smart connected devices feeding us a deluge of data 24/7. A study conducted by Cisco estimates that IoT devices will generate 600 zettabytes of data per year by 2020. Most of that data is likely to be generated by automotive, manufacturing, heavy industrial and energy sectors. Such massive growth in industrial IoT data suggests we're about to enter a new industrial revolution where industries undergo as radical a transformation as that of the first industrial revolution. With the Industry 4.0 factory automation trend catching on, data-driven artificial intelligence promises to create cyber-physical systems that learn as they grow, predict failures before they impact performance, and connect factories and supply chains more efficiently than we could ever have imagined.

artificial intelligence, data mining, internet of things, (16 more...)

#artificialintelligence

Country: North America > United States (0.04)

Industry: Information Technology > Security & Privacy (0.95)

Technology:

Information Technology > Internet of Things (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Data Science > Data Mining (0.96)

Add feedback