AITopics | stratify

Collaborating Authors

stratify

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Stratify: Rethinking Federated Learning for Non-IID Data through Balanced Sampling

Wong, Hui Yeok, Lim, Chee Kau, Chan, Chee Seng

arXiv.org Artificial IntelligenceApr-21-2025

Federated Learning (FL) on non-independently and identically distributed (non-IID) data remains a critical challenge, as existing approaches struggle with severe data heterogeneity. Current methods primarily address symptoms of non-IID by applying incremental adjustments to Federated Averaging (FedAvg), rather than directly resolving its inherent design limitations. Consequently, performance significantly deteriorates under highly heterogeneous conditions, as the fundamental issue of imbalanced exposure to diverse class and feature distributions remains unresolved. This paper introduces Stratify, a novel FL framework designed to systematically manage class and feature distributions throughout training, effectively tackling the root cause of non-IID challenges. Inspired by classical stratified sampling, our approach employs a Stratified Label Schedule (SLS) to ensure balanced exposure across labels, significantly reducing bias and variance in aggregated gradients. Complementing SLS, we propose a label-aware client selection strategy, restricting participation exclusively to clients possessing data relevant to scheduled labels. Additionally, Stratify incorporates a fine-grained, high-frequency update scheme, accelerating convergence and further mitigating data heterogeneity. To uphold privacy, we implement a secure client selection protocol leveraging homomorphic encryption, enabling precise global label statistics without disclosing sensitive client information. Extensive evaluations on MNIST, CIFAR-10, CIFAR-100, Tiny-ImageNet, COVTYPE, PACS, and Digits-DG demonstrate that Stratify attains performance comparable to IID baselines, accelerates convergence, and reduces client-side computation compared to state-of-the-art methods, underscoring its practical effectiveness in realistic federated learning scenarios.

artificial intelligence, learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2504.13462

Genre: Research Report (0.84)

Industry: Information Technology > Security & Privacy (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Stratify: Unifying Multi-Step Forecasting Strategies

Green, Riku, Stevens, Grant, Abdallah, Zahraa, Filho, Telmo M. Silva

arXiv.org Machine LearningDec-29-2024

A key aspect of temporal domains is the ability to make predictions multiple time steps into the future, a process known as multi-step forecasting (MSF). At the core of this process is selecting a forecasting strategy, however, with no existing frameworks to map out the space of strategies, practitioners are left with ad-hoc methods for strategy selection. In this work, we propose Stratify, a parameterised framework that addresses multi-step forecasting, unifying existing strategies and introducing novel, improved strategies. We evaluate Stratify on 18 benchmark datasets, five function classes, and short to long forecast horizons (10, 20, 40, 80). In over 84% of 1080 experiments, novel strategies in Stratify improved performance compared to all existing ones. Importantly, we find that no single strategy consistently outperforms others in all task settings, highlighting the need for practitioners explore the Stratify space to carefully search and select forecasting strategies based on task-specific requirements. Our results are the most comprehensive benchmarking of known and novel forecasting strategies. We make code available to reproduce our results.

artificial intelligence, machine learning, stratify, (17 more...)

arXiv.org Machine Learning

2412.2051

Country: Europe (0.28)

Genre: Research Report > New Finding (0.68)

Industry:

Transportation (0.46)
Energy (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Modeling & Simulation (0.67)

Add feedback

What is Stratify in train_test_split? With example - Dragon Forest

#artificialintelligenceOct-9-2022, 13:10:10 GMT

To spit data into a training set and test set, you had indeed used the train_test_split library from scikit learn. There are some parameters in train_test_split like random_state, stratify, shuffle, test_size, etc. Here we will talk about one parameter called stratify in train_test_split in a simple way. Basically, we use stratify to create an unbiased dataset when you have a biased dataset. Suppose we have data and if that data is biased then we can have to use stratify to overcome train_test_split's biased random sampling problem.

dragon forest, stratify, stratify parameter, (3 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Stratified Random Sampling Using Python and Pandas

#artificialintelligenceMay-27-2021, 15:07:55 GMT

Sometimes the sample data that data scientists are given does not fit what we know about the wider population data. For example, lets assume that the data science team were given survey data and we noticed that the survey respondents were 60% male and 40% female. In the real world the UK general population is closer to 49.4% male and 50.6% female (source: https://tinyurl.com/43hpe5e4) There could be many explanations for our 60% male sample data. One possibility is that the data collection method might have been flawed.

proportion, sample data, stratified random sampling, (9 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Pandemic Ai - iOS + Apple Watch

#artificialintelligenceAug-8-2020, 05:15:23 GMT

Your digital vital sign dashboard will show your heart rate, heart rate variability, oxygen saturation, and will use our AI platform to assume longitudinal exacerbations or recovery of your COVID-19 infection. Our 5 meter walk test (frailty test) and 6 minute walk test (cardiovascular function) will be bound to your accelerometer and be able to trend your frailty, heart and lung reserve for fitness as well as for infection. You will be able to enter your medication and laboratory tests in our tracker. Your GPS location will be coupled to a geolocation beacon to an emergency medical service and your physician. Insights will provide links to our clinical trials, CDC, FDA websites as well as advice about anti-inflammatory diets and peer reviewed scientific journals. The Yoga Mode will also allow a proprietary therapeutic anti-inflammatory yogic breathing which will certainly change your cardiovascular health.

ai-me, artificial intelligence, machine learning, (16 more...)

#artificialintelligence

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.72)

Add feedback