Goto

Collaborating Authors

 data retention


Zero Data Retention in LLM-based Enterprise AI Assistants: A Comparative Study of Market Leading Agentic AI Products

arXiv.org Artificial Intelligence

Governance of data, compliance, and business privacy matters, particularly for healthcare and finance businesses. Since the recent emergence of AI enterprise AI assistants enhancing business productivity, safeguarding private data and compliance is now a priority. With the implementation of AI assistants across the enterprise, the zero data retention can be achieved by implementing zero data retention policies by Large Language Model businesses like Open AI and Anthropic and Meta. In this work, we explore zero data retention policies for the Enterprise apps of large language models (LLMs). Our key contribution is defining the architectural, compliance, and usability trade-offs of such systems in parallel. In this research work, we examine the development of commercial AI assistants with two industry leaders and market titans in this arena - Salesforce and Microsoft. Both of these companies used distinct technical architecture to support zero data retention policies. Salesforce AgentForce and Microsoft Copilot are among the leading AI assistants providing much-needed push to business productivity in customer care. The purpose of this paper is to analyze the technical architecture and deployment of zero data retention policy by consuming applications as well as big language models service providers like Open Ai, Anthropic, and Meta.


DRIP: DRop unImportant data Points -- Enhancing Machine Learning Efficiency with Grad-CAM-Based Real-Time Data Prioritization for On-Device Training

arXiv.org Artificial Intelligence

Embedded Computing Systems Faculty of Informatics, TU Wien Vienna, Austria daniel.mueller-gritschneder@tuwien.ac.at Abstract --Selecting data points for model training is critical in machine learning. Effective selection methods can reduce the labeling effort, optimize on-device training for embedded systems with limited data storage, and enhance the model performance. This paper introduces a novel algorithm that uses Grad-CAM to make online decisions about retaining or discarding data points. Optimized for embedded devices, the algorithm computes a unique DRIP Score to quantify the importance of each data point. This enables dynamic decision-making on whether a data point should be stored for potential retraining or discarded without compromising model performance. Experimental evaluations on four benchmark datasets demonstrate that our approach can match or even surpass the accuracy of models trained on the entire dataset, while achieving storage savings of up to 39%. T o our knowledge, this is the first algorithm to make online decisions about data point retention without requiring access to the entire dataset. In the rapidly evolving domain of machine learning, the quantity of available data have reached unprecedented levels. While large datasets have traditionally been the bedrock of robust machine learning models, the sheer magnitude of data now available poses both opportunities and challenges. One of the primary challenges is efficient data management, especially but not only in scenarios with constrained computational and storage resources [1].


Clear Storage: The Ethics of Deletion Policies for Stored Facial Images

#artificialintelligence

Although the topic of facial recognition is uncomfortable, hearing of its application in a place associated with child-like innocence was particularly jarring to those concerned about the Disney Company violating their privacy. It should be noted that, in Disney's case, guests uncomfortable with facial scanners could opt for a less invasive ticket scan and there are no reports of the technology being implemented in a non-voluntary manner. However, not all companies allow customers to opt out of facial data collection. In July The Verge reported that Lowe's, Macy's and Ace Hardware all currently employ facial recognition algorithms, while McDonalds, Walgreens and even 7โ€“11 are considering using facial recognition in the future. Although that may sound scary, there is nothing illegal about the practice, since facial recognition techniques are unregulated in the U.S. and throughout most of the world.


AI Fear Factor - How Much Do They Know?

#artificialintelligence

Modern societies are formed on concepts of free will and self-governance. Societal authority is put on opinions and feelings of people. While people have common ideas around universal concepts like voter knows best, customer is always right etc., subconsciously, free will itself is influenced by a lot of data points - cultural, spiritual, personal etc. In this age, AI can automatically manage and manipulate these data points at a massive scale- in effect, hacking human feelings, attitudes, beliefs, behaviours and in essence, changing the very fibre of society itself. AI with deep learning neural networks improve themselves by learning on more and more data and establishing complex associations and patterns.


Data Retention: Tough Choices Ahead

#artificialintelligence

As the cost per byte of storage has declined, it has become a habit to simply store data "just in case." At a time when the overwhelming majority of data was generated by human beings, nobody thought much of it. Data was summarized, information extracted from it and the raw data points were still kept should they be needed later. Cisco tells us that as of 2008, there were more things connected to the internet than people, so we can use that as the point in time when the amount of data being generated and stored had its hockey stick moment. Now we have more sensors in more places monitoring more and more activity and generating more and more data points.


Machine Learning and Big Data: Predictive Analytics Make Big Data Meaningful

#artificialintelligence

Predictive analytics use data, statistical algorithms, and Machine Learning techniques to predict the likelihood of business trends and financial performance, based on the past. They bring together several technologies and disciplines such as statistical analysis, data mining, predictive modelling and Machine Learning to predict the future of businesses. For example, it is possible to anticipate the consequences of a decision or the reactions of consumers. Predictive analytics provide insightful insights from large datasets, allowing companies to decide where to go next and provide a better customer experience. With increased data, computing power and the development of easier-to-use AI software and analytical tools, many companies can now use predictive analytics.


Managing the tricky balance between data pooling and data retention with predictive platforms

@machinelearnbot

Likewise, spam detection is most effective when the learning algorithm has been populated with relevant examples. The steering system of a self-driving car is the same story. It won't function properly until it has learned to recognise other vehicles and road signage via traffic imaging. Similarly, a healthcare diagnostic support tool depends upon medical image matching. And an automatic translation tool will need to draw on a body of existing translated texts.


Avoiding industrial IoT digital exhaust with machine learning - IoT Agenda

#artificialintelligence

The internet of things is speeding from concept to reality, with sensors and smart connected devices feeding us a deluge of data 24/7. A study conducted by Cisco estimates that IoT devices will generate 600 zettabytes of data per year by 2020. Most of that data is likely to be generated by automotive, manufacturing, heavy industrial and energy sectors. Such massive growth in industrial IoT data suggests we're about to enter a new industrial revolution where industries undergo as radical a transformation as that of the first industrial revolution. With the Industry 4.0 factory automation trend catching on, data-driven artificial intelligence promises to create cyber-physical systems that learn as they grow, predict failures before they impact performance, and connect factories and supply chains more efficiently than we could ever have imagined.