analytical approach
An Analytical Approach to Privacy and Performance Trade-Offs in Healthcare Data Sharing
Wei, Yusi, Benson, Hande Y., Capan, Muge
The secondary use of healthcare data is vital for research and clinical innovation, but it raises concerns about patient privacy. This study investigates how to balance privacy preservation and data utility in healthcare data sharing, considering the perspectives of both data providers and data users. Using a dataset of adult patients hospitalized between 2013 and 2015, we predict whether sepsis was present at admission or developed during the hospital stay. We identify sub-populations, such as older adults, frequently hospitalized patients, and racial minorities, that are especially vulnerable to privacy attacks due to their unique combinations of demographic and healthcare utilization attributes. These groups are also critical for machine learning (ML) model performance. We evaluate three anonymization methods-$k$-anonymity, the technique by Zheng et al., and the MO-OBAM model-based on their ability to reduce re-identification risk while maintaining ML utility. Results show that $k$-anonymity offers limited protection. The methods of Zheng et al. and MO-OBAM provide stronger privacy safeguards, with MO-OBAM yielding the best utility outcomes: only a 2% change in precision and recall compared to the original dataset. This work provides actionable insights for healthcare organizations on how to share data responsibly. It highlights the need for anonymization methods that protect vulnerable populations without sacrificing the performance of data-driven models.
Throwing Objects into A Moving Basket While Avoiding Obstacles
Kasaei, Hamidreza, Kasaei, Mohammadreza
The capabilities of a robot will be increased significantly by exploiting throwing behavior. In particular, throwing will enable robots to rapidly place the object into the target basket, located outside its feasible kinematic space, without traveling to the desired location. In previous approaches, the robot often learned a parameterized throwing kernel through analytical approaches, imitation learning, or hand-coding. There are many situations in which such approaches do not work/generalize well due to various object shapes, heterogeneous mass distribution, and also obstacles that might be presented in the environment. It is obvious that a method is needed to modulate the throwing kernel through its meta parameters. In this paper, we tackle object throwing problem through a deep reinforcement learning approach that enables robots to precisely throw objects into moving baskets while there are obstacles obstructing the path. To the best of our knowledge, we are the first group that addresses throwing objects with obstacle avoidance. Such a throwing skill not only increases the physical reachability of a robot arm but also improves the execution time. In particular, the robot detects the pose of the target object, basket, and obstacle at each time step, predicts the proper grasp configuration for the target object, and then infers appropriate parameters to throw the object into the basket. Due to safety constraints, we develop a simulation environment in Gazebo to train the robot and then use the learned policy in real-robot directly. To assess the performers of the proposed approach, we perform extensive sets of experiments in both simulation and real robots in three scenarios. Experimental results showed that the robot could precisely throw a target object into the basket outside its kinematic range and generalize well to new locations and objects without colliding with obstacles.
how-is-ai-impacting-the-advertising-industry
Artificial intelligence has come a long way, especially in terms of operations. According to Salesforce, around 60% of market leaders suggest that AI can be helpful for various program campaigns. With digital transformation taking over the world, running programmatic campaigns becomes easy. According to Accenture, the communications and information industry, AI capabilities will help to generate around $4.7 trillion by 2035 for coalescence. Artificial intelligence has been bringing significant changes across the advertising industry, especially in advertising across the sales industry.
Essential data science skills that no one talks about - KDnuggets
The top results are long lists of technical terms, named hard skills. Python, algebra, statistics, and SQL are some of the most popular ones. Later, there come soft skills -- communication, business acumen, team player, etc. Let's pretend that you are a super-human possessing all the above abilities. You code from the age of five, you are a Kaggle grandmaster and your conference papers are guaranteed to get a best-paper award. There is still a very high chance that your projects struggle to reach maturity and become full-fledged commercial products. Recent studies estimate that more than 85% of data science projects fail to reach production. The studies provide numerous reasons for the failures. And I have not seen the so-called essential skills mentioned even once as a potential reason.
Essential data science skills that no one talks about.
The top results are long lists of technical terms, named hard skills. Python, algebra, statistics, and SQL are some of the most popular ones. Later, there come soft skills -- communication, business acumen, team player, etc. Let's pretend that you are a super-human possessing all the above abilities. You code from the age of five, you are a Kaggle grandmaster and your conference papers are guaranteed to get a best-paper award. There is still a very high chance that your projects struggle to reach maturity and become full-fledged commercial products. Recent studies estimate that more than 85% of data science projects fail to reach production. The studies provide numerous reasons for the failures.
How Artificial Intelligence Is Reinventing Human Resources?
Everyone knows that AI has made a great entry in our lives and now everyone is totally dependent on its services which are not less than a miracle. There is hardly any field left that is not being touched by Artificial intelligence. AI has revolutionized the way of almost all industries. Yes, all industries have started making use of AI these days in different ways. In this blog, we will discuss how it is playing an important role in reinventing Human resources.
Algorithm Selection Framework for Cyber Attack Detection
Chalรฉ, Marc, Bastian, Nathaniel D., Weir, Jeffery
The number of cyber threats against both wired and wireless computer systems and other components of the Internet of Things continues to increase annually. In this work, an algorithm selection framework is employed on the NSL-KDD data set and a novel paradigm of machine learning taxonomy is presented. The framework uses a combination of user input and meta-features to select the best algorithm to detect cyber attacks on a network. Performance is compared between a rule-of-thumb strategy and a meta-learning strategy. The framework removes the conjecture of the common trial-and-error algorithm selection method. The framework recommends five algorithms from the taxonomy. Both strategies recommend a high-performing algorithm, though not the best performing. The work demonstrates the close connectedness between algorithm selection and the taxonomy for which it is premised.
Webinar: An Analytical Approach to Financial Crimes Rules Creation and Optimization - Altair Knowledge Works
Recent studies from leading financial institutions show that financial fraud continues to rise at alarming rates. According to Experian, credit and debit card fraud has risen over 60% in the first half of 2019. Although Millennials are highly targeted (an 80% chance of being a target of fraud), no one is immune from criminal activity. Home grown fraud rules, based on in-house subject matter expertise and observations, are commonly used to counter fraud attacks as well as other financial crimes, such as Anti-Money Laundering. Altair offers an industry leading approach to augment/adjust these rules based on actual analytical models.
The Art of Data Science
With much of the latest discussion focused on the latest techniques in machine learning and in particular deep learning, the significant benefits of machine learning and deep learning are now a public reality. Yet, machine learning in effect represents the predictive analytics techniques that have been used for many years by data scientists. Furthermore, data scientists and their end users have always recognized the huge economic advantages of predictive analytics. But the significant advances of deep learning in the last 5 years have just expanded the application of predictive analytics to other areas which were technically not feasible at the time. The market for these solutions is huge and the competition is fierce.
Cross-validation in high-dimensional spaces: a lifeline for least-squares models and multi-class LDA
Least-squares models such as linear regression and Linear Discriminant Analysis (LDA) are amongst the most popular statistical learning techniques. However, since their computation time increases cubically with the number of features, they are inefficient in high-dimensional neuroimaging datasets. Fortunately, for k-fold cross-validation, an analytical approach has been developed that yields the exact cross-validated predictions in least-squares models without explicitly training the model. Its computation time grows with the number of test samples. Here, this approach is systematically investigated in the context of cross-validation and permutation testing. LDA is used exemplarily but results hold for all other least-squares methods. Furthermore, a non-trivial extension to multi-class LDA is formally derived. The analytical approach is evaluated using complexity calculations, simulations, and permutation testing of an EEG/MEG dataset. Depending on the ratio between features and samples, the analytical approach is up to 10,000x faster than the standard approach (retraining the model on each training set). This allows for a fast cross-validation of least-squares models and multi-class LDA in high-dimensional data, with obvious applications in multi-dimensional datasets, Representational Similarity Analysis, and permutation testing.