AITopics

2503.03395

Country:

Europe > Germany (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Automobiles & Trucks (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.88)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.87)

Fontaine, Salomé A. Sepúveda, Amigó, José M.

Applications of Entropy in Data Analysis and Machine Learning: A Review

arXiv.org Machine LearningMar-4-2025

Since its origin in the thermodynamics of the 19th century, the concept of entropy has also permeated other fields of physics and mathematics, such as Classical and Quantum Statistical Mechanics, Information Theory, Probability Theory, Ergodic Theory and the Theory of Dynamical Systems. Specifically, we are referring to the classical entropies: the Boltzmann-Gibbs, von Neumann, Shannon, Kolmogorov-Sinai and topological entropies. In addition to their common name, which is historically justified (as we briefly describe in this review), other commonality of the classical entropies is the important role that they have played and are still playing in the theory and applications of their respective fields and beyond. Therefore, it is not surprising that, in the course of time, many other instances of the overarching concept of entropy have been proposed, most of them tailored to specific purposes. Following the current usage, we will refer to all of them, whether classical or new, simply as entropies. Precisely, the subject of this review is their applications in data analysis and machine learning. The reason for these particular applications is that entropies are very well suited to characterize probability mass distributions, typically generated by finite-state processes or symbolized signals. Therefore, we will focus on entropies defined as positive functionals on probability mass distributions and provide an axiomatic characterization that goes back to Shannon and Khinchin. Given the plethora of entropies in the literature, we have selected a representative group, including the classical ones. The applications summarized in this review finely illustrate the power and versatility of entropy in data analysis and machine learning.

application, data analysis and machine learning, entropy, (10 more...)

arXiv.org Machine Learning

2503.02921

Country:

Europe > Spain (0.14)
North America > United States > Massachusetts (0.14)
North America > Canada (0.14)
(2 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(3 more...)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
(4 more...)

Exploring Causality for HRI: A Case Study on Robotic Mental Well-being Coaching

Spitale, Micol, Babu, Srikar, Cakmak, Serhan, Cheong, Jiaee, Gunes, Hatice

One of the primary goals of Human-Robot Interaction (HRI) research is to develop robots that can interpret human behavior and adapt their responses accordingly. Adaptive learning models, such as continual and reinforcement learning, play a crucial role in improving robots' ability to interact effectively in real-world settings. However, these models face significant challenges due to the limited availability of real-world data, particularly in sensitive domains like healthcare and well-being. This data scarcity can hinder a robot's ability to adapt to new situations. To address these challenges, causality provides a structured framework for understanding and modeling the underlying relationships between actions, events, and outcomes. By moving beyond mere pattern recognition, causality enables robots to make more explainable and generalizable decisions. This paper presents an exploratory causality-based analysis through a case study of an adaptive robotic coach delivering positive psychology exercises over four weeks in a workplace setting. The robotic coach autonomously adapts to multimodal human behaviors, such as facial valence and speech duration. By conducting both macro- and micro-level causal analyses, this study aims to gain deeper insights into how adaptability can enhance well-being during interactions. Ultimately, this research seeks to advance our understanding of how causality can help overcome challenges in HRI, particularly in real-world applications.

coachee, interaction, robotic coach, (11 more...)

2503.11684

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Lombardy > Milan (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report > New Finding (0.94)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.34)

Kannan, Ashwin Viswanathan, Ganesan, Madhumitha

Neural Models of Task Adaptation: A Tutorial on Spiking Networks for Executive Control

The ability to adapt and switch between tasks is a fundamental Empirical studies further established the prefrontal cortex aspect of cognitive flexibility, shaping decision-making (PFC) as a key region in task-switching, with experiments such and behavioral efficiency in dynamic environments. Taskswitching as the Wisconsin Card Sorting Test (WCST) demonstrating its has been widely studied across disciplines such as role in adaptive behavior [14]-[16]. Spiking Neural Networks psychology, cognitive neuroscience, and artificial intelligence (SNNs) have emerged as a biologically realistic approach to [1], [2]. While humans often shift between tasks seamlessly, modeling neural dynamics, particularly due to their ability to performance variations arise depending on prior experience, replicate synaptic plasticity mechanisms such as Spike Timing-task familiarity, and cognitive load. Understanding these processes Dependent Plasticity (STDP) [10], [17]. Prior studies have requires computational models that can capture the successfully applied SNNs to pattern recognition and classification underlying neural mechanisms driving adaptive control and tasks [18] and have modeled sensory processing systems decision-making. Empirical studies have identified increased like the mammalian olfactory system [19]. These findings neural activity in the cognitive control network, particularly in establish a computational foundation for implementing taskswitching the prefrontal cortex (PFC), when engaging in task-switching models with biologically grounded learning dynamics.

mechanism, neuron, plasticity, (16 more...)

2503.03784

Country:

North America > United States > Wisconsin (0.24)
North America > United States > Oklahoma > Payne County > Stillwater (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Cognitive Science > Neuroscience (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.67)
Information Technology > Artificial Intelligence > Cognitive Science > Cognitive Architectures (0.47)

RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition

Zheng, Jinhui, Liu, Zhiquan, Si, Yain-Whar, Li, Jianqing, Zhang, Xinyuan, Li, Xiaofan, Huang, Haozhi, Gong, Xueyuan

Handwritten Paragraph Text Recognition (HPTR) is a challenging task in Computer Vision, requiring the transformation of a paragraph text image, rich in handwritten text, into text encoding sequences. One of the most advanced models for this task is Vertical Attention Network (VAN), which utilizes a Vertical Attention Module (VAM) to implicitly segment paragraph text images into text lines, thereby reducing the difficulty of the recognition task. However, from a network structure perspective, VAM is a single-branch module, which is less effective in learning compared to multi-branch modules. In this paper, we propose a new module, named Re-parameterizing Vertical Attention Fusion Module (RVAFM), which incorporates structural re-parameterization techniques. RVAFM decouples the structure of the module during training and inference stages. During training, it uses a multi-branch structure for more effective learning, and during inference, it uses a single-branch structure for faster processing. The features learned by the multi-branch structure are fused into the single-branch structure through a special fusion method named Re-parameterization Fusion (RF) without any loss of information. As a result, we achieve a Character Error Rate (CER) of 4.44% and a Word Error Rate (WER) of 14.37% on the IAM paragraph-level test set. Additionally, the inference speed is slightly faster than VAN.

dual-parameter layer, recognition, rvafm, (15 more...)

2503.03104

Country:

Asia > Macao (0.04)
Asia > China > Guangdong Province (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Text Recognition (0.63)

Generative assimilation and prediction for weather and climate

Yang, Shangshang, Nai, Congyi, Liu, Xinyan, Li, Weidong, Chao, Jie, Wang, Jingnan, Wang, Leyi, Li, Xichen, Chen, Xi, Lu, Bo, Xiao, Ziniu, Boers, Niklas, Yuan, Huiling, Pan, Baoxiang

Machine learning models have shown great success in predicting weather up to two weeks ahead, outperforming process-based benchmarks. However, existing approaches mostly focus on the prediction task, and do not incorporate the necessary data assimilation. Moreover, these models suffer from error accumulation in long roll-outs, limiting their applicability to seasonal predictions or climate projections. Here, we introduce Generative Assimilation and Prediction (GAP), a unified deep generative framework for assimilation and prediction of both weather and climate. By learning to quantify the probabilistic distribution of atmospheric states under observational, predictive, and external forcing constraints, GAP excels in a broad range of weather-climate related tasks, including data assimilation, seamless prediction, and climate simulation. In particular, GAP is competitive with state-of-the-art ensemble assimilation, probabilistic weather forecast and seasonal prediction, yields stable millennial simulations, and reproduces climate variability from daily to decadal time scales.

climatological distribution, constraint, prediction, (13 more...)

2503.03038

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China > Beijing > Beijing (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry:

Energy (0.93)
Government > Regional Government (0.46)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
(3 more...)

arXiv.org Artificial IntelligenceMar-3-2025

An Approach for Air Drawing Using Background Subtraction and Contour Extraction

Acharya, Ramkrishna

--In this paper, we propose a novel approach for air drawing that uses image processing techniques to draw on the screen by moving fingers in the air . This approach benefits a wide range of applications such as sign language, in-air drawing, and'writing' in the air as a new way of input. The approach starts with preparing ROI (Region of Interest) background images by taking a running average in initial camera frames and later subtracting it from the live camera frames to get a binary mask image. We calculate the pointer's position as the top of the contour on the binary image. When drawing a circle on the canvas in that position, it simulates the drawing. Furthermore, we combine the pre-trained T esseract model for OCR purposes. T o address the false contours, we perform hand detection based on the haar cascade before performing the background subtraction. In an experimental setup, we achieved a latency of only 100ms in air drawing.

background image, background subtraction, contour, (11 more...)

2503.01497

Country:

North America > United States (0.05)
Europe > Germany (0.05)

Genre: Research Report > Promising Solution (0.35)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.74)
Information Technology > Artificial Intelligence > Vision (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.30)

Joshi, Rishikesh, Sattar, Junaed

One-Shot Gesture Recognition for Underwater Diver-To-Robot Communication

arXiv.org Artificial IntelligenceMar-1-2025

Reliable human-robot communication is essential for underwater human-robot interaction (U-HRI), yet traditional methods such as acoustic signaling and predefined gesture-based models suffer from limitations in adaptability and robustness. In this work, we propose One-Shot Gesture Recognition (OSG), a novel method that enables real-time, pose-based, temporal gesture recognition underwater from a single demonstration, eliminating the need for extensive dataset collection or model retraining. OSG leverages shape-based classification techniques, including Hu moments, Zernike moments, and Fourier descriptors, to robustly recognize gestures in visually-challenging underwater environments. Our system achieves high accuracy on real-world underwater data and operates efficiently on embedded hardware commonly found on autonomous underwater vehicles (AUVs), demonstrating its feasibility for deployment on-board robots. Compared to deep learning approaches, OSG is lightweight, computationally efficient, and highly adaptable, making it ideal for diver-to-robot communication. We evaluate OSG's performance on an augmented gesture dataset and real-world underwater video data, comparing its accuracy against deep learning methods. Our results show OSG's potential to enhance U-HRI by enabling the immediate deployment of user-defined gestures without the constraints of predefined gesture languages.

gesture language, keypoint, recognition, (15 more...)

2503.00676

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
(4 more...)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Walters, Ben, Bethi, Yeshwanth, Kergan, Taylor, Nguyen, Binh, Amirsoleimani, Amirali, Eshraghian, Jason K., Afshar, Saeed, Azghadi, Mostafa Rahimi

NeuroMorse: A Temporally Structured Dataset For Neuromorphic Computing

arXiv.org Artificial IntelligenceFeb-28-2025

Neuromorphic engineering aims to advance computing by mimicking the brain's efficient processing, where data is encoded as asynchronous temporal events. This eliminates the need for a synchronisation clock and minimises power consumption when no data is present. However, many benchmarks for neuromorphic algorithms primarily focus on spatial features, neglecting the temporal dynamics that are inherent to most sequence-based tasks. This gap may lead to evaluations that fail to fully capture the unique strengths and characteristics of neuromorphic systems. In this paper, we present NeuroMorse, a temporally structured dataset designed for benchmarking neuromorphic learning systems. NeuroMorse converts the top 50 words in the English language into temporal Morse code spike sequences. Despite using only two input spike channels for Morse dots and dashes, complex information is encoded through temporal patterns in the data. The proposed benchmark contains feature hierarchy at multiple temporal scales that test the capacity of neuromorphic algorithms to decompose input patterns into spatial and temporal hierarchies. We demonstrate that our training set is challenging to categorise using a linear classifier and that identifying keywords in the test set is difficult using conventional methods.

dataset, neuromorphic computing, spike sequence, (11 more...)

2502.20729

Country:

Oceania > Australia > Queensland > Townsville (0.04)
North America > United States > California > Santa Cruz County > Santa Cruz (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.48)

Chire, Josimar, Mahmood, Khalid, Liang, Zhao

Complex Networks for Pattern-Based Data Classification

arXiv.org Artificial IntelligenceFeb-25-2025

Data classification techniques partition the data or feature space into smaller sub-spaces, each corresponding to a specific class. To classify into subspaces, physical features e.g., distance and distributions are utilized. This approach is challenging for the characterization of complex patterns that are embedded in the dataset. However, complex networks remain a powerful technique for capturing internal relationships and class structures, enabling High-Level Classification. Although several complex network-based classification techniques have been proposed, high-level classification by leveraging pattern formation to classify data has not been utilized. In this work, we present two network-based classification techniques utilizing unique measures derived from the Minimum Spanning Tree and Single Source Shortest Path. These network measures are evaluated from the data patterns represented by the inherent network constructed from each class. We have applied our proposed techniques to several data classification scenarios including synthetic and real-world datasets. Compared to the existing classic high-level and machine-learning classification techniques, we have observed promising numerical results for our proposed approaches. Furthermore, the proposed models demonstrate the following distinguished features in comparison to the previous high-level classification techniques: (1) A single network measure is introduced to characterize the data pattern, eliminating the need to determine weight parameters among network measures. Therefore, the model is largely simplified, while obtaining better classification results. (2) The metrics proposed are sensitive and used for classification with competitive results.

algorithm, dataset, sssp, (11 more...)

2503.05772

Country:

Europe > Sweden > Uppsala County > Uppsala (0.04)
Antarctica (0.04)
South America > Brazil > São Paulo (0.04)
(6 more...)

Genre:

Research Report (1.00)
Overview (0.68)

Industry:

Health & Medicine > Therapeutic Area (0.49)
Health & Medicine > Diagnostic Medicine > Imaging (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.67)