AITopics

BBC NewsApr-30-2025, 05:01:18 GMT

'Bella the robot waitress won't replace our staff'

'Bella the robot waitress won't replace our staff' 4 days agoShareSaveSophie CridlandReporting fromPortlandShareSaveBBCMike Deadman, from The View Cafe and Bar, said Bella was not being used to replace staff Bella carries multiple trays packed with food and drinks, deftly swerving any obstacles and delivering orders day in and day out to her customers. This is the latest recruit at The View Cafe and Bar at Portland's Heights hotel in Dorset. But Bella is no normal member of the waiting staff - she is a state-of-the art robot programmed to serve and even interact with the eatery's patrons. And costing a little under 9,000, it is hoped it can be an economical idea, as well as a novel one. But assistant manager Mike Deadman insists Bella - built by Chinese technology company Pudu - will not result in any job losses.

artificial intelligence, bella, robot, (6 more...)

BBC News

Country:

South America (0.16)
North America > Central America (0.16)
Oceania > Australia (0.07)
(15 more...)

Industry:

Consumer Products & Services > Restaurants (0.61)
Leisure & Entertainment (0.55)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Salazar, Israfel, Burda, Manuel Fernández, Islam, Shayekh Bin, Moakhar, Arshia Soltani, Singh, Shivalika, Farestam, Fabian, Romanou, Angelika, Boiko, Danylo, Khullar, Dipika, Zhang, Mike, Krzemiński, Dominik, Novikova, Jekaterina, Shimabucoro, Luísa, Imperial, Joseph Marvin, Maheshwary, Rishabh, Duwal, Sharad, Amayuelas, Alfonso, Rajwal, Swati, Purbey, Jebish, Ruby, Ahmed, Popovič, Nicholas, Suppa, Marek, Wasi, Azmine Toushik, Kadiyala, Ram Mohan Rao, Tsymboi, Olga, Kostritsya, Maksim, Moakhar, Bardia Soltani, Merlin, Gabriel da Costa, Coletti, Otávio Ferracioli, Shiviari, Maral Jabbari, fard, MohammadAmin farahani, Fernandez, Silvia, Grandury, María, Abulkhanov, Dmitry, Sharma, Drishti, De Mitri, Andre Guarnier, Marchezi, Leticia Bossatto, Heydari, Setayesh, Obando-Ceron, Johan, Kohut, Nazar, Ermis, Beyza, Elliott, Desmond, Ferrante, Enzo, Hooker, Sara, Fadaee, Marzieh

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

arXiv.org Artificial IntelligenceApr-30-2025

The evaluation of vision-language models (VLMs) has mainly relied on English-language benchmarks, leaving significant gaps in both multilingual and multicultural coverage. While multilingual benchmarks have expanded, both in size and languages, many rely on translations of English datasets, failing to capture cultural nuances. In this work, we propose Kaleidoscope, as the most comprehensive exam benchmark to date for the multilingual evaluation of vision-language models. Kaleidoscope is a large-scale, in-language multimodal benchmark designed to evaluate VLMs across diverse languages and visual inputs. Kaleidoscope covers 18 languages and 14 different subjects, amounting to a total of 20,911 multiple-choice questions. Built through an open science collaboration with a diverse group of researchers worldwide, Kaleidoscope ensures linguistic and cultural authenticity. We evaluate top-performing multilingual vision-language models and find that they perform poorly on low-resource languages and in complex multimodal scenarios. Our results highlight the need for progress on culturally inclusive multimodal evaluation frameworks.

computational linguistic, large language model, machine learning, (19 more...)

2504.07072

Country:

Europe (1.00)
South America (0.92)
Asia > Middle East > UAE (0.46)
North America > United States > Minnesota (0.27)

Genre: Research Report > New Finding (0.65)

Industry:

Health & Medicine (0.92)
Education (0.88)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Pereira, Ingryd V. S. T., Cavalcanti, George D. C., Cruz, Rafael M. O.

Multi-view autoencoders for Fake News Detection

arXiv.org Artificial IntelligenceApr-29-2025

Given the volume and speed at which fake news spreads across social media, automatic fake news detection has become a highly important task. However, this task presents several challenges, including extracting textual features that contain relevant information about fake news. Research about fake news detection shows that no single feature extraction technique consistently outperforms the others across all scenarios. Nevertheless, different feature extraction techniques can provide complementary information about the textual data and enable a more comprehensive representation of the content. This paper proposes using multi-view autoencoders to generate a joint feature representation for fake news detection by integrating several feature extraction techniques commonly used in the literature. Experiments on fake news datasets show a significant improvement in classification performance compared to individual views (feature representations). We also observed that selecting a subset of the views instead of composing a latent space with all the views can be advantageous in terms of accuracy and computational effort. For further details, including source codes, figures, and datasets, please refer to the project's repository: https://github.com/ingrydpereira/multiview-fake-news.

artificial intelligence, machine learning, natural language, (12 more...)

doi: 10.1109/CI-NLPSoMe64976.2025.10970665

2504.08102

Country: South America > Brazil > Pernambuco (0.14)

Genre: Research Report (0.84)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Anghinoni, Luiz Antonio Nicolau, Denardin, Gustavo Weber, Gertrudes, Jadson Castro, Casanova, Dalcimar, Oliva, Jefferson Tales

The use of Multi-domain Electroencephalogram Representations in the building of Models based on Convolutional and Recurrent Neural Networks for Epilepsy Detection

arXiv.org Artificial IntelligenceApr-28-2025

This important role has led researchers to develop various methods for gathering information about brain activity, resulting in significant advancements in medical signal and image acquisition systems [2]. Among these advancements are functional neuroimaging techniques, such as functional magnetic resonance imaging, magnetoencephalography (MEG), positron emission tomography (PET), and electroencephalography [2]. Among these techniques, electroencephalography stands out due to three key advantages: it is a non-invasive method that allows data generation from any individual, has excellent temporal resolution--effectively capturing events occurring within milliseconds--and is relatively cost-effective compared to other examinations [3]. Electroencephalography monitors the brain's electrical activity through electrodes placed on the scalp, and the resulting data, known as the electroencephalogram (EEG), consists of a time series of electrical potentials that reflect neurological activity [4]. The EEG signal is widely used in the field of neuroscience and has the potential to advance brain-computer interfaces [5], facilitate emotion detection [6], enable classification of sleep stages [7] and help clinicians and researchers in identifying brain diseases, including but not limited to Alzheimer's disease [8], dyslexia [9], schizophrenia [10], Creutzfeldt-Jakob disease [11] and cognitive impairment [12]. Epilepsy, for example, is a neurological disorder characterized by abnormal brain activity that can lead to seizures, unusual behaviors, or even loss of consciousness.

data mining, data quality, machine learning, (18 more...)

2504.17908

Country:

South America > Brazil > Minas Gerais (0.04)
North America > United States > Massachusetts (0.04)
Europe > Switzerland > Geneva > Geneva (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (0.92)
Research Report > Experimental Study (0.68)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Epilepsy (0.68)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.54)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
(4 more...)

Ma, Haroui, Quinzan, Francesco, Willem, Theresa, Bauer, Stefan

AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis

arXiv.org Machine LearningApr-28-2025

Machine learning (ML) systems for medical imaging have demonstrated remarkable diagnostic capabilities, but their susceptibility to biases poses significant risks, since biases may negatively impact generalization performance. In this paper, we introduce a novel statistical framework to evaluate the dependency of medical imaging ML models on sensitive attributes, such as demographics. Our method leverages the concept of counterfactual invariance, measuring the extent to which a model's predictions remain unchanged under hypothetical changes to sensitive attributes. We present a practical algorithm that combines conditional latent diffusion models with statistical hypothesis testing to identify and quantify such biases without requiring direct access to counterfactual data. Through experiments on synthetic datasets and large-scale real-world medical imaging datasets, including \textsc{cheXpert} and MIMIC-CXR, we demonstrate that our approach aligns closely with counterfactual fairness principles and outperforms standard baselines. This work provides a robust tool to ensure that ML diagnostic systems generalize well, e.g., across demographic groups, offering a critical step towards AI safety in healthcare. Code: https://github.com/Neferpitou3871/AI-Alignment-Medical-Imaging.

artificial intelligence, dataset, machine learning, (17 more...)

2504.19621

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(12 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Agrawal, Shubhada, Ramdas, Aaditya

On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds

arXiv.org Machine LearningApr-28-2025

We prove two lower bounds for stopping times of sequential tests between general composite nulls and alternatives. The first lower bound is for the setting where the type-1 error level $\alpha$ approaches zero, and equals $\log(1/\alpha)$ divided by a certain infimum KL divergence, termed $\operatorname{KL_{inf}}$. The second lower bound applies to the setting where $\alpha$ is fixed and $\operatorname{KL_{inf}}$ approaches 0 (meaning that the null and alternative sets are not separated) and equals $c \operatorname{KL_{inf}}^{-1} \log \log \operatorname{KL_{inf}}^{-1}$ for a universal constant $c > 0$. We also provide a sufficient condition for matching the upper bounds and show that this condition is met in several special cases. Given past work, these upper and lower bounds are unsurprising in their form; our main contribution is the generality in which they hold, for example, not requiring reference measures or compactness of the classes.

artificial intelligence, machine learning, sequential test, (16 more...)

2504.19952

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Data Science (0.68)

Petti, Samantha, Martí-Gómez, Carlos, Kinney, Justin B., Zhou, Juannan, McCandlish, David M.

On learning functions over biological sequence space: relating Gaussian process priors, regularization, and gauge fixing

arXiv.org Machine LearningApr-26-2025

Mappings from biological sequences (DNA, RNA, protein) to quantitative measures of sequence functionality play an important role in contemporary biology. We are interested in the related tasks of (i) inferring predictive sequence-to-function maps and (ii) decomposing sequence-function maps to elucidate the contributions of individual subsequences. Because each sequence-function map can be written as a weighted sum over subsequences in multiple ways, meaningfully interpreting these weights requires "gauge-fixing," i.e., defining a unique representation for each map. Recent work has established that most existing gauge-fixed representations arise as the unique solutions to $L_2$-regularized regression in an overparameterized "weight space" where the choice of regularizer defines the gauge. Here, we establish the relationship between regularized regression in overparameterized weight space and Gaussian process approaches that operate in "function space," i.e. the space of all real-valued functions on a finite set of sequences. We disentangle how weight space regularizers both impose an implicit prior on the learned function and restrict the optimal weights to a particular gauge. We also show how to construct regularizers that correspond to arbitrary explicit Gaussian process priors combined with a wide variety of gauges. Next, we derive the distribution of gauge-fixed weights implied by the Gaussian process posterior and demonstrate that even for long sequences this distribution can be efficiently computed for product-kernel priors using a kernel trick. Finally, we characterize the implicit function space priors associated with the most common weight space regularizers. Overall, our framework unifies and extends our ability to infer and interpret sequence-function relationships.

artificial intelligence, machine learning, modeling & simulation, (20 more...)

2504.19034

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
Europe > France (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Machine LearningApr-25-2025

CAPO: Cost-Aware Prompt Optimization

Zehle, Tom, Schlager, Moritz, Heiß, Timo, Feurer, Matthias

Large language models (LLMs) have revolutionized natural language processing by solving a wide range of tasks simply guided by a prompt. Yet their performance is highly sensitive to prompt formulation. While automated prompt optimization addresses this challenge by finding optimal prompts, current methods require a substantial number of LLM calls and input tokens, making prompt optimization expensive. We introduce CAPO (Cost-Aware Prompt Optimization), an algorithm that enhances prompt optimization efficiency by integrating AutoML techniques. CAPO is an evolutionary approach with LLMs as operators, incorporating racing to save evaluations and multi-objective optimization to balance performance with prompt length. It jointly optimizes instructions and few-shot examples while leveraging task descriptions for improved robustness. Our extensive experiments across diverse datasets and LLMs demonstrate that CAPO outperforms state-of-the-art discrete prompt optimization methods in 11/15 cases with improvements up to 21%p. Our algorithm achieves better performances already with smaller budgets, saves evaluations through racing, and decreases average prompt length via a length penalty, making it both cost-efficient and cost-aware. Even without few-shot examples, CAPO outperforms its competitors and generally remains robust to initial prompts. CAPO represents an important step toward making prompt optimization more powerful and accessible by improving cost-efficiency.

large language model, machine learning, natural language, (22 more...)

2504.16005

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Indonesia > Bali (0.04)
South America > Colombia > Meta Department > Villavicencio (0.04)
(2 more...)

Genre:

Overview (0.93)
Research Report > New Finding (0.92)

Industry:

Leisure & Entertainment (0.93)
Media (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Sridharan, Murali, Mäntylä, Mika, Rantala, Leevi

Detection, Classification and Prevalence of Self-Admitted Aging Debt

arXiv.org Artificial IntelligenceApr-25-2025

Context: Previous research on software aging is limited with focus on dynamic runtime indicators like memory and performance, often neglecting evolutionary indicators like source code comments and narrowly examining legacy issues within the TD context. Objective: We introduce the concept of Aging Debt (AD), representing the increased maintenance efforts and costs needed to keep software updated. We study AD through Self-Admitted Aging Debt (SAAD) observed in source code comments left by software developers. Method: We employ a mixed-methods approach, combining qualitative and quantitative analyses to detect and measure AD in software. This includes framing SAAD patterns from the source code comments after analysing the source code context, then utilizing the SAAD patterns to detect SAAD comments. In the process, we develop a taxonomy for SAAD that reflects the temporal aging of software and its associated debt. Then we utilize the taxonomy to quantify the different types of AD prevalent in OSS repositories. Results: Our proposed taxonomy categorizes temporal software aging into Active and Dormant types. Our extensive analysis of over 9,000+ Open Source Software (OSS) repositories reveals that more than 21% repositories exhibit signs of SAAD as observed from our gold standard SAAD dataset. Notably, Dormant AD emerges as the predominant category, highlighting a critical but often overlooked aspect of software maintenance. Conclusion: As software volume grows annually, so do evolutionary aging and maintenance challenges; our proposed taxonomy can aid researchers in detailed software aging studies and help practitioners develop improved and proactive maintenance strategies.

machine learning, natural language, programming language, (17 more...)

2504.17428

Country:

South America (0.67)
Europe > Finland (0.28)
North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (0.67)
Information Technology > Software (0.48)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)