employee attrition
Mitigating Attrition: Data-Driven Approach Using Machine Learning and Data Engineering
This paper presents a novel data-driven approach to mitigating employee attrition using machine learning and data engineering techniques. The proposed framework integrates data from various human resources systems and leverages advanced feature engineering to capture a comprehensive set of factors influencing attrition. The study outlines a robust modeling approach that addresses challenges such as imbalanced datasets, categorical data handling, and model interpretation. The methodology includes careful consideration of training and testing strategies, baseline model establishment, and the development of calibrated predictive models. The research emphasizes the importance of model interpretation using techniques like SHAP values to provide actionable insights for organizations. Key design choices in algorithm selection, hyperparameter tuning, and probability calibration are discussed. This approach enables organizations to proactively identify attrition risks and develop targeted retention strategies, ultimately redu
Can Large Language Model Predict Employee Attrition?
Ma, Xiaoye, Liu, Weiheng, Zhao, Changyi, Tukhvatulina, Liliya R.
Employee attrition poses significant costs for organizations, with traditional statistical prediction methods often struggling to capture modern workforce complexities. Machine learning (ML) advancements offer more scalable and accurate solutions, but large language models (LLMs) introduce new potential in human resource management by interpreting nuanced employee communication and detecting subtle turnover cues. This study leverages the IBM HR Analytics Attrition dataset to compare the predictive accuracy and interpretability of a fine-tuned GPT-3.5 model against traditional ML classifiers, including Logistic Regression, k-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, Random Forest, AdaBoost, and XGBoost. While traditional models are easier to use and interpret, LLMs can reveal deeper patterns in employee behavior. Our findings show that the fine-tuned GPT-3.5 model outperforms traditional methods with a precision of 0.91, recall of 0.94, and an F1-score of 0.92, while the best traditional model, SVM, achieved an F1-score of 0.82, with Random Forest and XGBoost reaching 0.80. These results highlight GPT-3.5's ability to capture complex patterns in attrition risk, offering organizations improved insights for retention strategies and underscoring the value of LLMs in HR applications.
- Asia > Russia > Siberian Federal District > Tomsk Oblast > Tomsk (0.05)
- Europe > Russia (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Switzerland (0.04)
- Information Technology (0.49)
- Law (0.46)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Retention Is All You Need
Mohiuddin, Karishma, Alam, Mirza Ariful, Alam, Mirza Mohtashim, Welke, Pascal, Martin, Michael, Lehmann, Jens, Vahdati, Sahar
Skilled employees are the most important pillars of an organization. Despite this, most organizations face high attrition and turnover rates. While several machine learning models have been developed to analyze attrition and its causal factors, the interpretations of those models remain opaque. In this paper, we propose the HR-DSS approach, which stands for Human Resource (HR) Decision Support System, and uses explainable AI for employee attrition problems. The system is designed to assist HR departments in interpreting the predictions provided by machine learning models. In our experiments, we employ eight machine learning models to provide predictions. We further process the results achieved by the best-performing model by the SHAP explainability process and use the SHAP values to generate natural language explanations which can be valuable for HR. Furthermore, using "What-if-analysis", we aim to observe plausible causes for attrition of an individual employee. The results show that by adjusting the specific dominant features of each individual, employee attrition can turn into employee retention through informative business decisions.
- Europe > Austria > Vienna (0.14)
- Europe > United Kingdom > England > West Midlands > Birmingham (0.05)
- Europe > Germany > Saxony > Dresden (0.04)
- (5 more...)
"How to make them stay?" -- Diverse Counterfactual Explanations of Employee Attrition
Artelt, André, Gregoriades, Andreas
Employee attrition is an important and complex problem that can directly affect an organisation's competitiveness and performance. Explaining the reasons why employees leave an organisation is a key human resource management challenge due to the high costs and time required to attract and keep talented employees. Businesses therefore aim to increase employee retention rates to minimise their costs and maximise their performance. Machine learning (ML) has been applied in various aspects of human resource management including attrition prediction to provide businesses with insights on proactive measures on how to prevent talented employees from quitting. Among these ML methods, the best performance has been reported by ensemble or deep neural networks, which by nature constitute black box techniques and thus cannot be easily interpreted. To enable the understanding of these models' reasoning several explainability frameworks have been proposed. Counterfactual explanation methods have attracted considerable attention in recent years since they can be used to explain and recommend actions to be performed to obtain the desired outcome. However current counterfactual explanations methods focus on optimising the changes to be made on individual cases to achieve the desired outcome. In the attrition problem it is important to be able to foresee what would be the effect of an organisation's action to a group of employees where the goal is to prevent them from leaving the company. Therefore, in this paper we propose the use of counterfactual explanations focusing on multiple attrition cases from historical data, to identify the optimum interventions that an organisation needs to make to its practices/policies to prevent or minimise attrition probability for these cases.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
- Europe > Middle East > Cyprus > Limassol > Limassol (0.04)
- Europe > Germany (0.04)
Analyzing Employee Attrition in Healthcare Data and Predicting Outcomes
Many of the causes of healthcare worker attrition are related to the stressful nature of the healthcare work industry. Many employees work long hours and often experience high burnout. Employee attrition in healthcare is an issue because it exacerbates the issue of the limited supply of workers in the space. Since the healthcare space is significantly understaffed and many healthcare employees are overworked, the quality of care and the speed to care are often negatively impacted. In general, efforts towards reducing healthcare attrition involve improving the quality of the work environment.
- Information Technology > Security & Privacy (0.41)
- Health & Medicine > Consumer Health (0.41)
Why AutoML Is An Essential New Tool For Data Scientists
Machine learning (ML) is the current paradigm for modeling statistical phenomena by harnessing algorithms that exploit computer intelligence. It is common place to build ML models that predict housing prices, aggregate users by their potential marketing interests, and use image recognition techniques to identify brain tumors. However, up until now these models have required scrupulous trial and error in order to optimize model performance on unseen data. The advent of automated machine learning (AutoML) aims to curb the resources required (time and expertise) by offering well-designed pipelines that handle data preprocessing, feature selection, and model creation and evaluation. While AutoML may initially only appeal to enterprises that want to harness the power of ML without consuming precious budgets and hiring skilled data practitioners, it also contains very strong promise to become an invaluable tool for the experienced data scientist.
33 unusual problems that can be solved with data science
Automated translation, including translating one programming language into another one (for instance, SQL to Python - the converse is not possible) Spell checks, especially for people writing in multiple languages - lot's of progress to be made here, including automatically recognizing the language when you type, and stop trying to correct the same word every single time (some browsers have tried to change Ning to Nong hundreds of times, and I have no idea why after 50 failures they continue to try - I call this machine unlearning) Detection of earth-like planets - focus on planetary systems with many planets to increase odds of finding inhabitable planets, rather than stars and planets matching our Sun and Earth Distinguishing between noise and signal on millions of NASA pictures or videos, to identify patterns Automated piloting (drones, cars without pilots) Customized, patient-specific medications and diets Predicting and legally manipulating elections Predicting oil demand, oil ...
Why AutoML is An Essential New Tool For Data Scientists
Machine learning (ML) is the current paradigm for modeling statistical phenomena by harnessing algorithms that exploit computer intelligence. It is common place to build ML models that predict housing prices, aggregate users by their potential marketing interests, and use image recognition techniques to identify brain tumors. However, up until now these models have required scrupulous trial and error in order to optimize model performance on unseen data. The advent of automated machine learning (AutoML) aims to curb the resources required (time and expertise) by offering well-designed pipelines that handle data preprocessing, feature selection, and model creation and evaluation. While AutoML may initially only appeal to enterprises that want to harness the power of ML without consuming precious budgets and hiring skilled data practitioners, it also contains very strong promise to become an invaluable tool for the experienced data scientist.
- Information Technology > Communications > Mobile (1.00)
- Information Technology > Artificial Intelligence > Robots (0.73)
- Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.73)
- Information Technology > Communications > Mobile (1.00)
- Information Technology > Artificial Intelligence > Robots (0.73)
- Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.73)