Goto

Collaborating Authors

 Oceania


Addressing Mark Imbalance in Integration-free Neural Marked Temporal Point Processes

Neural Information Processing Systems

Marked Temporal Point Process (MTPP) has been well studied to model the event distribution in marked event streams, which can be used to predict the mark and arrival time of the next event. However, existing studies overlook that the distribution of event marks is highly imbalanced in many real-world applications, with some marks being frequent but others rare. The imbalance poses a significant challenge to the performance of the next event prediction, especially for events of rare marks. To address this issue, we propose a thresholding method, which learns thresholds to tune the mark probability normalized by the mark's prior probability to optimize mark prediction, rather than predicting the mark directly based on the mark probability as in existing studies. In conjunction with this method, we predict the mark first and then the time. In particular, we develop a novel neural MTPP model to support effective time sampling and estimation of mark probability without computationally expensive numerical improper integration. Extensive experiments on real-world datasets demonstrate the superior performance of our solution against various baselines for the next event mark and time prediction.


Representation Consistency for Accurate and Coherent LLMAnswer Aggregation

Neural Information Processing Systems

Test-time scaling improves large language models' (LLMs) performance by allocating more compute budget during inference. To achieve this, existing methods often require intricate modifications to prompting and sampling strategies. In this work, we introduce representation consistency (RC), a test-time scaling method for aggregating answers drawn from multiple candidate responses of an LLM regardless of how they were generated, including variations in prompt phrasing and sampling strategy. RC enhances answer aggregation by not only considering the number of occurrences of each answer in the candidate response set, but also the consistency of the model's internal activations while generating the set of responses leading to each answer. These activations can be either dense (raw model activations) or sparse (encoded via pretrained sparse autoencoders). Our rationale is that if the model's representations of multiple responses converging on the same answer are highly variable, this answer is more likely to be the result of incoherent reasoning and should be down-weighted during aggregation. Importantly, our method only uses cached activations and lightweight similarity computations and requires no additional model queries.


Feeding Kids Eggs Early in Life Helps Prevent Food Allergy, New Study Says

TIME - Tech

Follow this section to personalize your feed and get instant alerts. Follow Go to your personalized feed WHY FOLLOW? Smart Alerts: Get notified about major news as it happens. Follow this tag to personalize your feed and get instant alerts. Follow Go to your personalized feed WHY FOLLOW?


ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking

Neural Information Processing Systems

Supervised learning relies on high-quality labeled data, but obtaining such data through human annotation is both expensive and time-consuming. Recent work explores using large language models (LLMs) for annotation, but LLM-generated labels still fall short of human-level quality. To address this problem, we propose the Annotation with Critical Thinking (ACT) data pipeline, where LLMs serve not only as annotators but also as judges to critically identify potential errors. Human effort is then directed towards reviewing only the most "suspicious" cases, significantly improving the human annotation efficiency. Our major contributions are as follows: (1) ACT is applicable to a wide range of domains, including natural language processing (NLP), computer vision (CV), and multimodal understanding, by leveraging multimodal-LLMs (MLLMs).


UMU-Bench: Closing the Modality Gap in Multimodal Unlearning Evaluation

Neural Information Processing Systems

Although Multimodal Large Language Models (MLLMs) have advanced numerous fields, their training on extensive multimodal datasets introduces significant privacy concerns, prompting the necessity for effective unlearning methods. However, current multimodal unlearning approaches often directly adapt techniques from unimodal contexts, largely overlooking the critical issue of modality alignment, i.e., consistently removing knowledge across both unimodal and multimodal settings. To close this gap, we introduce UMU-Bench, a unified benchmark specifically targeting modality misalignment in multimodal unlearning. UMU-Benchconsists of a meticulously curated dataset featuring 653 individual profiles, each described with both unimodal and multimodal knowledge. Additionally, novel tasks and evaluation metrics focusing on modality alignment are introduced, facilitating a comprehensive analysis of unimodal and multimodal unlearning effectiveness. Through extensive experimentation with state-of-the-art unlearning algorithms on UMU-Bench, we demonstrate prevalent modality misalignment issues in existing methods. These findings underscore the critical need for novel multimodal unlearning approaches explicitly considering modality alignment.


Orochi: Versatile Biomedical Image Processor

Neural Information Processing Systems

Deep learning has emerged as a pivotal tool for accelerating research in the life sciences, with the low-level processing of biomedical images (e.g., registration, fusion, restoration, super-resolution) being one of its most critical applications. Platforms such as ImageJ (Fiji) and napari have enabled the development of customized plugins for various models. However, these plugins are typically based on models that are limited to specific tasks and datasets, making them less practical for biologists. To address this challenge, we introduce Orochi, the first application-oriented, efficient, and versatile image processor designed to overcome these limitations. Orochi is pre-trained on patches/volumes extracted from the raw data of over 100 publicly available studies using our Random Multi-scale Sampling strategy.


Canada proposes teen social media ban - with workaround for tech firms

BBC News

Canada is proposing a social media ban for children and teenagers under the age of 16, mirroring a similar law passed in Australia late last year. But unlike Australia's law, tech firms could sidestep Canada's ban if they demonstrate they have policies to minimise harm to minors. The law includes sweeping measures to regulate AI chatbots and curtail harmful content online. It would create a regulator to ensure tech firms comply. Some free speech groups have warned it would expand censorship.


Orochi: Versatile Biomedical Image Processor

Neural Information Processing Systems

Deep learning has emerged as a pivotal tool for accelerating research in the life sciences, with the low-level processing of biomedical images (e.g., registration, fusion, restoration, super-resolution) being one of its most critical applications. Platforms such as ImageJ (Fiji) and napari have enabled the development of customized plugins for various models. However, these plugins are typically based on models that are limited to specific tasks and datasets, making them less practical for biologists.


Forthcoming machine learning and AI seminars: June 2026 edition

AIHub

This post contains a list of the AI-related seminars that are scheduled to take place between 1 June and 31 July 2026. All events detailed here are free and open for anyone to attend virtually. Franco Accordino and Monika Lanzenberger (European Commission) The Digital Humanism (DIGHUM) Initiative The talk will be livestreamed on YouTube here . K Madhava Krishna (IIIT Hyderabad) Robotics Café The Google Meet link is here . Gianfranco Polizzi (University of Birmingham) Raspberry PI Sign up here to join.


This model is not a real person: how AI is shaking up fashion – video

The Guardian

From digital twins to models'sculpted' by programmers, generative AI has been popping up all over the fashion industry. When an Australian e-commerce retailer started using AI-generated models to sell products, lifestyle editor Alyx Gorman had to see if the garments were more than mere pixels. The Iconic, which sells the dress worn in this video, said in a statement: 'Where AI-generated imagery is used to advertise products for sale on our platform, our expectation is that it is clearly labelled and that the product itself is represented as accurately as possible for customers.' Meanwhile, Atoir, the designer, said: 'The Australian fashion industry is highly competitive, particularly for independent brands. We believe that when used responsibly, tools like this can help smaller businesses to operate with greater agility while still maintaining the creative standards and product integrity that matter to both the brand and the customer.'