Goto

Collaborating Authors

 Personal Assistant Systems


Hisense taps new Google Home APIs to expand smart home integration

PCWorld

Google issued 100 announcements during its Google I/O developers conference this week, none of which involved the smart home. That apparent lack of enthusiasm for a topic close to our heart didn't dissuade TV and smart-appliance manufacturer Hisense from announcing plans to integrate new Google Home APIs into its own ConnectLife app, so that third-party smart home devices can be folded into that ecosystem. Hisense first announced that it would open its ConnectLife app to third-party products in December, 2024. Today, it announced it will incorporate the latest Google Home APIs into the app by the fall of 2025, Hisense says this will enable users to onboard a wide range of third-party smart home devices--including Matter and Works With Google Home-certified products--to create a more integrated smart home experience. Hisense cited two examples of how this would benefit ConnectLife users: "One-touch modes and customized automations can blend Hisense products with third-party devices to create intelligent home responses, such as air conditioners automatically adjusting based on third-party air quality sensors, or smart lights providing visual notifications when the Hisense refrigerator's VersaTemp drawer reaches the ideal temperature for chilling drinks."


Matter-enabled SwitchBot Hub 3 smart home controller is now available

PCWorld

The SwitchBot Hub 3 smart home controller is now available for purchase. The Matter-capable device is quite different than other smart home hubs we've tested, starting with its rotary knob that can adjust the target temperature on a smart thermostat, the brightness of smart lighting devices, or the volume level of a connected speaker. Another feature that makes the 120 controller so interesting is the USB-C cable that connects it to its power supply: The cable senses the ambient temperature and relative humidity in the room where the Hub 3 is installed. These readings are shown on the hub's display. We have a hands-on review of the all-new SwitchBot Ultra, which is also shipping today.


Boost your workflow for life with this 60 AI assistant

Popular Science

How often do you wish you had an assistant at work? Let Swatle be your AI-powered partner, helping you tackle your projects efficiently. And luckily, a premium lifetime subscription can be yours now for just 59.99 (reg. Think of Swatle as your right-hand tool, serving as an AI-powered productivity partner ready to help you manage projects, automate repetitive tasks, and even organize your whole team's workflow. Thanks to Swatle's artificial intelligence, it even adapts to your individual needs as you go.


Graph Foundation Models: A Comprehensive Survey

arXiv.org Artificial Intelligence

Graph-structured data pervades domains such as social networks, biological systems, knowledge graphs, and recommender systems. While foundation models have transformed natural language processing, vision, and multimodal learning through large-scale pretraining and generalization, extending these capabilities to graphs -- characterized by non-Euclidean structures and complex relational semantics -- poses unique challenges and opens new opportunities. To this end, Graph Foundation Models (GFMs) aim to bring scalable, general-purpose intelligence to structured data, enabling broad transfer across graph-centric tasks and domains. This survey provides a comprehensive overview of GFMs, unifying diverse efforts under a modular framework comprising three key components: backbone architectures, pretraining strategies, and adaptation mechanisms. We categorize GFMs by their generalization scope -- universal, task-specific, and domain-specific -- and review representative methods, key innovations, and theoretical insights within each category. Beyond methodology, we examine theoretical foundations including transferability and emergent capabilities, and highlight key challenges such as structural alignment, heterogeneity, scalability, and evaluation. Positioned at the intersection of graph learning and general-purpose AI, GFMs are poised to become foundational infrastructure for open-ended reasoning over structured data. This survey consolidates current progress and outlines future directions to guide research in this rapidly evolving field. Resources are available at https://github.com/Zehong-Wang/Awesome-Foundation-Models-on-Graphs.


Personalized Diffusion Model Reshapes Cold-Start Bundle Recommendation

arXiv.org Artificial Intelligence

Bundle recommendation aims to recommend a set of items to each user. However, the sparser interactions between users and bundles raise a big challenge, especially in cold-start scenarios. Traditional collaborative filtering methods do not work well for this kind of problem because these models rely on interactions to update the latent embedding, which is hard to work in a cold-start setting. We propose a new approach (DisCo), which relies on a personalized Diffusion backbone, enhanced by disentangled aspects for the user's interest, to generate a bundle in distribution space for each user to tackle the cold-start challenge. During the training phase, DisCo adjusts an additional objective loss term to avoid bias, a prevalent issue while using the generative model for top-$K$ recommendation purposes. Our empirical experiments show that DisCo outperforms five comparative baselines by a large margin on three real-world datasets. Thereby, this study devises a promising framework and essential viewpoints in cold-start recommendation. Our materials for reproducibility are available at: https://github.com/bt-nghia/DisCo.


AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models

arXiv.org Artificial Intelligence

Jailbreak attacks to Large audio-language models (LALMs) are studied recently, but they achieve suboptimal effectiveness, applicability, and practicability, particularly, assuming that the adversary can fully manipulate user prompts. In this work, we first conduct an extensive experiment showing that advanced text jailbreak attacks cannot be easily ported to end-to-end LALMs via text-to speech (TTS) techniques. We then propose AudioJailbreak, a novel audio jailbreak attack, featuring (1) asynchrony: the jailbreak audio does not need to align with user prompts in the time axis by crafting suffixal jailbreak audios; (2) universality: a single jailbreak perturbation is effective for different prompts by incorporating multiple prompts into perturbation generation; (3) stealthiness: the malicious intent of jailbreak audios will not raise the awareness of victims by proposing various intent concealment strategies; and (4) over-the-air robustness: the jailbreak audios remain effective when being played over the air by incorporating the reverberation distortion effect with room impulse response into the generation of the perturbations. In contrast, all prior audio jailbreak attacks cannot offer asynchrony, universality, stealthiness, or over-the-air robustness. Moreover, AudioJailbreak is also applicable to the adversary who cannot fully manipulate user prompts, thus has a much broader attack scenario. Extensive experiments with thus far the most LALMs demonstrate the high effectiveness of AudioJailbreak. We highlight that our work peeks into the security implications of audio jailbreak attacks against LALMs, and realistically fosters improving their security robustness. The implementation and audio samples are available at our website https://audiojailbreak.github.io/AudioJailbreak.


Fuck the Algorithm: Conceptual Issues in Algorithmic Bias

arXiv.org Artificial Intelligence

Algorithmic bias has been the subject of much recent controversy. To clarify what is at stake and to make progress resolving the controversy, a better understanding of the concepts involved would be helpful. The discussion here focuses on the disputed claim that algorithms themselves cannot be biased. To clarify this claim we need to know what kind of thing 'algorithms themselves' are, and to disambiguate the several meanings of 'bias' at play. This further involves showing how bias of moral import can result from statistical biases, and drawing connections to previous conceptual work about political artifacts and oppressive things. Data bias has been identified in domains like hiring, policing and medicine. Examples where algorithms themselves have been pinpointed as the locus of bias include recommender systems that influence media consumption, academic search engines that influence citation patterns, and the 2020 UK algorithmically-moderated A-level grades. Recognition that algorithms are a kind of thing that can be biased is key to making decisions about responsibility for harm, and preventing algorithmically mediated discrimination.


JIR-Arena: The First Benchmark Dataset for Just-in-time Information Recommendation

arXiv.org Artificial Intelligence

Just-in-time Information Recommendation (JIR) is a service designed to deliver the most relevant information precisely when users need it, , addressing their knowledge gaps with minimal effort and boosting decision-making and efficiency in daily life. Advances in device-efficient deployment of foundation models and the growing use of intelligent wearable devices have made always-on JIR assistants feasible. However, there has been no systematic effort to formally define JIR tasks or establish evaluation frameworks. To bridge this gap, we present the first mathematical definition of JIR tasks and associated evaluation metrics. Additionally, we introduce JIR-Arena, a multimodal benchmark dataset featuring diverse, information-request-intensive scenarios to evaluate JIR systems across critical dimensions: i) accurately inferring user information needs, ii) delivering timely and relevant recommendations, and iii) avoiding irrelevant content that may distract users. Developing a JIR benchmark dataset poses challenges due to subjectivity in estimating user information needs and uncontrollable system variables affecting reproducibility. To address these, JIR-Arena: i) combines input from multiple humans and large AI models to approximate information need distributions; ii) assesses JIR quality through information retrieval outcomes using static knowledge base snapshots; and iii) employs a multi-turn, multi-entity validation framework to improve objectivity and generality. Furthermore, we implement a baseline JIR system capable of processing real-time information streams aligned with user inputs. Our evaluation of this baseline system on JIR-Arena indicates that while foundation model-based JIR systems simulate user needs with reasonable precision, they face challenges in recall and effective content retrieval. To support future research in this new area, we fully release our code and data.


LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems

arXiv.org Artificial Intelligence

Recommender systems (RS) are increasingly vulnerable to shilling attacks, where adversaries inject fake user profiles to manipulate system outputs. Traditional attack strategies often rely on simplistic heuristics, require access to internal RS data, and overlook the manipulation potential of textual reviews. In this work, we introduce Agent4SR, a novel framework that leverages Large Language Model (LLM)-based agents to perform low-knowledge, high-impact shilling attacks through both rating and review generation. Agent4SR simulates realistic user behavior by orchestrating adversarial interactions, selecting items, assigning ratings, and crafting reviews, while maintaining behavioral plausibility. Our design includes targeted profile construction, hybrid memory retrieval, and a review attack strategy that propagates target item features across unrelated reviews to amplify manipulation. Extensive experiments on multiple datasets and RS architectures demonstrate that Agent4SR outperforms existing low-knowledge baselines in both effectiveness and stealth. Our findings reveal a new class of emergent threats posed by LLM-driven agents, underscoring the urgent need for enhanced defenses in modern recommender systems.


Wish you had an extra set of hands? Now you do thanks to this AI assistant, now 75% off

PCWorld

TL;DR: Take some things off your plate for good with a lifetime subscription to the Swatle All-in-One AI Assistant, now just 59.99 (reg. Looking for a way AI can make your life easier? Swatle is an AI-powered productivity partner that serves as your own project assistant, helping you get more things done with fewer resources. And right now, you can secure a premium lifetime subscription for just 59.99 (reg. Could you use an extra pair of hands at work? Swatle helps manage projects, automate repetitive tasks, or organize your team's workflow. It adapts to your individual needs, helping you streamline your operations and eliminate your inefficiencies so you can work smarter and faster.