Oceania
Russia fires North Korean ballistic missiles in 'extremely dangerous' threat to Europe and Asia: Zelenskyy
Fox News' Alex Hogan reports on one of the largest Russian attacks on Ukraine since the war began. Fox News contributor Mike Pompeo also breaks down the Trump administration's travel ban and discusses the U.S. role in potential peace talks. North Korean ballistic missiles once again rained down over Ukraine this week as the war with Russia continues to rage, prompting President Volodymyr Zelenskyy to renew warnings that the threat posed by the Moscow-Pyongyang alliance is "extremely dangerous" for Europe and Asia alike. "The longer this war continues on our territory, the more warfare technologies evolve, and the greater the threat will be to everyone," Zelenskyy said Tuesday. "This must be addressed now, not when thousands of upgraded Shahed drones and ballistic missiles begin to threaten Seoul and Tokyo." Zelenskyy's warning came just one day after Ukraine's military intelligence chief, Kyrylo Budanov, confirmed in an interview with The War Zone that Russia has significantly improved North Korea's KN-23 ballistic missiles.
Australia has 'no alternative' but to embrace AI and seek to be a world leader in the field, industry and science minister says
Australia must "lean in hard" to the benefits of artificial intelligence or else risk ending up "on the end of somebody else's supply chain", according to the new industry and science minister, Tim Ayres, with the Labor government planning to further regulate the rapidly evolving technology. Ayres, a former official with the manufacturing union, acknowledged Australians remained sceptical about AI and stressed that employers and employees needed to have discussions about how automation could affect workplaces. The minister said Australia had "no alternative" but to embrace the new technology and seek to become a world leader in regulating and using AI. "It's the government's job to lean into the opportunity to outline that for businesses and for workers, but also to make sure that they are confident that we've got the capability to deal with the potential pitfalls," Ayres told Guardian Australia. "I think the Australian answer has got to be leaning in hard and focusing on strategy and regulation that is in the interest of Australians."
Model-Free Kernel Conformal Depth Measures Algorithm for Uncertainty Quantification in Regression Models in Separable Hilbert Spaces
Matabuena, Marcos, Ghosal, Rahul, Mozharovskyi, Pavlo, Padilla, Oscar Hernan Madrid, Onnela, Jukka-Pekka
Depth measures are powerful tools for defining level sets in emerging, non--standard, and complex random objects such as high-dimensional multivariate data, functional data, and random graphs. Despite their favorable theoretical properties, the integration of depth measures into regression modeling to provide prediction regions remains a largely underexplored area of research. To address this gap, we propose a novel, model-free uncertainty quantification algorithm based on conditional depth measures--specifically, conditional kernel mean embeddings and an integrated depth measure. These new algorithms can be used to define prediction and tolerance regions when predictors and responses are defined in separable Hilbert spaces. The use of kernel mean embeddings ensures faster convergence rates in prediction region estimation. To enhance the practical utility of the algorithms with finite samples, we also introduce a conformal prediction variant that provides marginal, non-asymptotic guarantees for the derived prediction regions. Additionally, we establish both conditional and unconditional consistency results, as well as fast convergence rates in certain homoscedastic settings. We evaluate the finite--sample performance of our model in extensive simulation studies involving various types of functional data and traditional Euclidean scenarios. Finally, we demonstrate the practical relevance of our approach through a digital health application related to physical activity, aiming to provide personalized recommendations
A Sample Efficient Conditional Independence Test in the Presence of Discretization
Sun, Boyang, Yao, Yu, Dong, Xinshuai, Liu, Zongfang, Liu, Tongliang, Qiu, Yumou, Zhang, Kun
In many real-world scenarios, interested variables are often represented as discretized values due to measurement limitations. Applying Conditional Independence (CI) tests directly to such discretized data, however, can lead to incorrect conclusions. To address this, recent advancements have sought to infer the correct CI relationship between the latent variables through binarizing observed data. However, this process inevitably results in a loss of information, which degrades the test's performance. Motivated by this, this paper introduces a sample-efficient CI test that does not rely on the binarization process. We find that the independence relationships of latent continuous variables can be established by addressing an over-identifying restriction problem with Generalized Method of Moments (GMM). Based on this insight, we derive an appropriate test statistic and establish its asymptotic distribution correctly reflecting CI by leveraging nodewise regression. Theoretical findings and Empirical results across various datasets demonstrate that the superiority and effectiveness of our proposed test. Our code implementation is provided in https://github.com/boyangaaaaa/DCT
Federated Learning: From Theory to Practice
This book offers a hands-on introduction to building and understanding federated learning (FL) systems. FL enables multiple devices -- such as smartphones, sensors, or local computers -- to collaboratively train machine learning (ML) models, while keeping their data private and local. It is a powerful solution when data cannot or should not be centralized due to privacy, regulatory, or technical reasons. The book is designed for students, engineers, and researchers who want to learn how to design scalable, privacy preserving FL systems. Our main focus is on personalization: enabling each device to train its own model while still benefiting from collaboration with relevant devices. This is achieved by leveraging similarities between (the learning tasks associated with) devices that are encoded by the weighted edges (or links) of a federated learning network (FL network). The key idea is to represent real-world FL systems as networks of devices, where nodes correspond to device and edges represent communication links and data similarities between them. The training of personalized models for these devices can be naturally framed as a distributed optimization problem. This optimization problem is referred to as generalized total variation minimization (GTVMin) and ensures that devices with similar learning tasks learn similar model parameters. Our approach is both mathematically principled and practically motivated. While we introduce some advanced ideas from optimization theory and graph-based learning, we aim to keep the book accessible. Readers are guided through the core ideas step by step, with intuitive explanations.
Spatiotemporal deep learning models for detection of rapid intensification in cyclones
Sutar, Vamshika, Singh, Amandeep, Chandra, Rohitash
Cyclone rapid intensification is the rapid increase in cyclone wind intensity, exceeding a threshold of 30 knots, within 24 hours. Rapid intensification is considered an extreme event during a cyclone, and its occurrence is relatively rare, contributing to a class imbalance in the dataset. A diverse array of factors influences the likelihood of a cyclone undergoing rapid intensification, further complicating the task for conventional machine learning models. In this paper, we evaluate deep learning, ensemble learning and data augmentation frameworks to detect cyclone rapid intensification based on wind intensity and spatial coordinates. We note that conventional data augmentation methods cannot be utilised for generating spatiotemporal patterns replicating cyclones that undergo rapid intensification. Therefore, our framework employs deep learning models to generate spatial coordinates and wind intensity that replicate cyclones to address the class imbalance problem of rapid intensification. We also use a deep learning model for the classification module within the data augmentation framework to di fferentiate between rapid and non-rapid intensification events during a cyclone. Our results show that data augmentation improves the results for rapid intensification detection in cyclones, and spatial coordinates play a critical role as input features to the given models. This paves the way for research in synthetic data generation for spatiotemporal data with extreme events. Introduction Over the past decade, the impacts of climate change have manifested in an alarming increase in the strength of tropical cyclones, characterised by elevated levels of precipitation and wind intensity, resulting in devastating consequences on a global scale [1, 2, 3]. Rappaport et al. [4] defined rapid intensification as a sudden surge in wind intensity exceeding 30 knots (35 miles / hour or 55 kilometres / hour) within 24 hours [5]. Forecasting the rapid intensification of high-category cyclones (Category 4 and 5) poses greater challenges due to their infrequent occurrence, in contrast to lower-category cyclones[6].
Employing self-supervised learning models for cross-linguistic child speech maturity classification
Zhang, Theo, Suresh, Madurya, Warlaumont, Anne S., Hitczenko, Kasia, Cristia, Alejandrina, Cychosz, Margaret
Speech technology systems struggle with many downstream tasks for child speech due to small training corpora and the difficulties that child speech pose. We apply a novel dataset, SpeechMaturity, to state-of-the-art transformer models to address a fundamental classification task: identifying child vocalizations. Unlike previous corpora, our dataset captures maximally ecologically-valid child vocalizations across an unprecedented sample, comprising children acquiring 25+ languages in the U.S., Bolivia, Vanuatu, Papua New Guinea, Solomon Islands, and France. The dataset contains 242,004 labeled vocalizations, magnitudes larger than previous work. Models were trained to distinguish between cry, laughter, mature (consonant+vowel), and immature speech (just consonant or vowel). Models trained on the dataset outperform state-of-the-art models trained on previous datasets, achieved classification accuracy comparable to humans, and were robust across rural and urban settings.
HASFL: Heterogeneity-aware Split Federated Learning over Edge Computing Systems
Lin, Zheng, Chen, Zhe, Chen, Xianhao, Ni, Wei, Gao, Yue
--Split federated learning (SFL) has emerged as a promising paradigm to democratize machine learning (ML) on edge devices by enabling layer-wise model partitioning. However, existing SFL approaches suffer significantly from the straggler effect due to the heterogeneous capabilities of edge devices. T o address the fundamental challenge, we propose adaptively controlling batch sizes (BSs) and model splitting (MS) for edge devices to overcome resource heterogeneity. We first derive a tight convergence bound of SFL that quantifies the impact of varied BSs and MS on learning performance. Based on the convergence bound, we propose HASFL, a heterogeneity-aware SFL framework capable of adaptively controlling BS and MS to balance communication-computing latency and training convergence in heterogeneous edge networks. Extensive experiments with various datasets validate the effectiveness of HASFL and demonstrate its superiority over state-of-the-art benchmarks. Conventional machine learning (ML) frameworks predominantly rely on centralized learning (CL), where raw data is gathered and processed at a central server for model training. However, CL is often impractical due to its high communication latency, increased backbone traffic, and privacy risks [1]-[4]. To address these limitations, federated learning (FL) [5], [6] has emerged as a promising alternative that allows participating devices to collaboratively train a shared model via exchanging model parameters (e.g., gradients) rather than raw data, thereby protecting data privacy and reducing communication costs [7], [8]. Despite its advantage, on-device training of FL poses a significant challenge for its deployment on resource-constrained edge devices as ML models scale up [9], [10].
Watch the mesmerizing first-ever footage of a rare Antarctic squid
Breakthroughs, discoveries, and DIY tips sent every weekday. Oceanographers on an excursion in the Southern Ocean captured a chance, unprecedented encounter with a sizable deep-sea squid. While piloting a remotely operated submersible 7,000 feet below the ocean surface from aboard the Schmidt Ocean Institute's research vessel Falkor (too), experts glimpsed a three-foot-long Gonatus antarcticus specimen. But according to National Geographic's announcement, the team wasn't even supposed to be in that location when they stumbled across the elusive cephalopod. "The ice blocks were moving so fast, it would put all the ships in danger, so we had to rearrange everything," said Manuel Novillo, a researcher at the Instituto de Diversidad y Ecología Animal.
AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models
Liu, Qi, Ruan, Jingqing, Li, Hao, Zhao, Haodong, Wang, Desheng, Chen, Jiansong, Guanglu, Wan, Cai, Xunliang, Zheng, Zhi, Xu, Tong
Existing multi-objective preference alignment methods for large language models (LLMs) face limitations: (1) the inability to effectively balance various preference dimensions, and (2) reliance on auxiliary reward/reference models introduces computational complexity. To address these challenges, we propose Adaptive Multi-objective Preference Optimization (AMoPO), a novel framework that achieves dynamic balance across preference dimensions. By introducing the multi-objective optimization paradigm to use the dimension-aware generation metrics as implicit rewards, AMoPO aligns LLMs with diverse preferences without additional reward models or reference models. We introduce an adaptive weight assignment mechanism that models the generation space as a Gaussian distribution, allowing dynamic prioritization of preference dimensions. Empirical results demonstrate that AMoPO outperforms state-of-the-art baselines by 28.5%, and the experiments on 7B, 14B, and 32B models reveal the scaling ability of AMoPO. Moreover, additional analysis of multiple dimensions verifies its adaptability and effectiveness. These findings validate AMoPO's capability to achieve dimension-aware preference alignment, highlighting its superiority. Our codes and datasets are available at https://github.com/Javkonline/AMoPO.