repeatability
- Europe > Germany > Saarland > Saarbrücken (0.14)
- North America > United States > Arizona > Maricopa County > Tempe (0.04)
- Asia (0.04)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- Research Report > New Finding (0.46)
- Research Report > Experimental Study (0.46)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Speech (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Asia > China > Beijing > Beijing (0.04)
- North America > Canada > Newfoundland and Labrador > Labrador (0.04)
- Asia > China > Shaanxi Province > Xi'an (0.04)
- Asia > China > Guangdong Province (0.04)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
- Information Technology > Sensing and Signal Processing > Image Processing (0.68)
Learning Repeatable Speech Embeddings Using An Intra-class Correlation Regularizer
A good supervised embedding for a specific machine learning task is only sensitive to changes in the label of interest and is invariant to other confounding factors. We leverage the concept of repeatability from measurement theory to describe this property and propose to use the intra-class correlation coefficient (ICC) to evaluate the repeatability of embeddings. We then propose a novel regularizer, the ICC regularizer, as a complementary component for contrastive losses to guide deep neural networks to produce embeddings with higher repeatability. We use simulated data to explain why the ICC regularizer works better on minimizing the intra-class variance than the contrastive loss alone. We implement the ICC regularizer and apply it to three speech tasks: speaker verification, voice style conversion, and a clinical application for detecting dysphonic voice. The experimental results demonstrate that adding an ICC regularizer can improve the repeatability of learned embeddings compared to only using the contrastive loss; further, these embeddings lead to improved performance in these downstream tasks.
Evaluating Magic Leap 2 Tool Tracking for AR Sensor Guidance in Industrial Inspections
Masuhr, Christian, Koch, Julian, Schüppstuhl, Thorsten
Rigorous evaluation of commercial Augmented Reality (AR) hardware is crucial, yet public benchmarks for tool tracking on modern Head-Mounted Displays (HMDs) are limited. This paper addresses this gap by systematically assessing the Magic Leap 2 (ML2) controllers tracking performance. Using a robotic arm for repeatable motion (EN ISO 9283) and an optical tracking system as ground truth, our protocol evaluates static and dynamic performance under various conditions, including realistic paths from a hydrogen leak inspection use case. The results provide a quantitative baseline of the ML2 controller's accuracy and repeatability and present a robust, transferable evaluation methodology. The findings provide a basis to assess the controllers suitability for the inspection use case and similar industrial sensor-based AR guidance tasks.
- North America > United States (0.04)
- Europe > Switzerland (0.04)
- Europe > Spain > Galicia > Madrid (0.04)
- (2 more...)
- Energy > Renewable (0.68)
- Health & Medicine > Health Care Technology (0.67)
NetGent: Agent-Based Automation of Network Application Workflows
Daneshamooz, Jaber, Vuong, Eugene, Koduru, Laasya, Chandrasekaran, Sanjay, Gupta, Arpit
We present NetGent, an AI-agent framework for automating complex application workflows to generate realistic network traffic datasets. Developing generalizable ML models for networking requires data collection from network environments with traffic that results from a diverse set of real-world web applications. However, using existing browser automation tools that are diverse, repeatable, realistic, and efficient remains fragile and costly. NetGent addresses this challenge by allowing users to specify workflows as natural-language rules that define state-dependent actions. These abstract specifications are compiled into nondeterministic finite automata (NFAs), which a state synthesis component translates into reusable, executable code. This design enables deterministic replay, reduces redundant LLM calls through state caching, and adapts quickly when application interfaces change. In experiments, NetGent automated more than 50+ workflows spanning video-on-demand streaming, live video streaming, video conferencing, social media, and web scraping, producing realistic traffic traces while remaining robust to UI variability. By combining the flexibility of language-based agents with the reliability of compiled execution, NetGent provides a scalable foundation for generating the diverse, repeatable datasets needed to advance ML in networking.
- Media (0.99)
- Information Technology > Security & Privacy (0.68)
- Government > Regional Government > North America Government > United States Government (0.46)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.84)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
- South America > Brazil > Paraná > Curitiba (0.04)
- North America > Canada (0.04)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
- Europe > Germany > Saarland > Saarbrücken (0.14)
- North America > United States > Arizona > Maricopa County > Tempe (0.04)
- Asia (0.04)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- Research Report > New Finding (0.46)
- Research Report > Experimental Study (0.46)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Speech (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Constrained Natural Language Action Planning for Resilient Embodied Systems
Byrd, Grayson, Rivera, Corban, Kemp, Bethany, Booker, Meghan, Schmidt, Aurora, de Melo, Celso M, Seenivasan, Lalithkumar, Unberath, Mathias
Corresponding Author Abstract--Replicating human-level intelligence in the execution of embodied tasks remains challenging due to the unconstrained nature of real-world environments. Novel use of large language models (LLMs) for task planning seeks to address the previously intractable state/action space of complex planning tasks, but hallucinations limit their reliability, and thus, viability beyond a research context. Additionally, the prompt engineering required to achieve adequate system performance lacks transparency, and thus, repeatability. In contrast to LLM planning, symbolic planning methods offer strong reliability and repeatability guarantees, but struggle to scale to the complexity and ambiguity of real-world tasks. We introduce a new robotic planning method that augments LLM planners with symbolic planning oversight to improve reliability and repeatability, and provide a transparent approach to defining hard constraints with considerably stronger clarity than traditional prompt engineering. Importantly, these augmentations preserve the reasoning capabilities of LLMs and retain impressive generalization in open-world environments. We demonstrate our approach in simulated and real-world environments. On the ALFWorld planning benchmark, our approach outperforms current state-of-the-art methods, achieving a near-perfect 99% success rate. Deployment of our method to a real-world quadruped robot resulted in 100% task success compared to 50% and 30% for pure LLM and symbolic planners across embodied pick and place tasks. Our approach presents an effective strategy to enhance the reliability, repeatability and transparency of LLM-based robot planners while retaining their key strengths: flexibility and generalizability to complex real-world environments. We hope that this work will contribute to the broad goal of building resilient embodied intelligent systems. By leveraging the strengths of both emerging Large Language Model (LLM)-based and well-understood symbolic planning components, we present a novel, hybrid approach to high-level embodied planning that allows for explicit and rigid constraint definition while preserving the adaptability and common-sense reasoning of its LLM components to enable a highly adaptable, reliable, repeatable, and transparent planning solution for use in unconstrained open-world environments. NABLING the reliable autonomy of embodied agents in complex and potentially unknown environments is a long-standing goal in robotics. Achieving this goal requires seamless interaction and understanding between a variety of system components including perception, control, navigation, and high-level planning.
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
- Asia > China > Beijing > Beijing (0.04)
- North America > Canada > Newfoundland and Labrador > Labrador (0.04)
- Asia > China > Shaanxi Province > Xi'an (0.04)
- Asia > China > Guangdong Province (0.04)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
- Information Technology > Sensing and Signal Processing > Image Processing (0.68)
Learning to Orient Surfaces by Self-supervised Spherical CNNs (Supplementary Material)
In this section, we study how the data augmentation carried out while training on local surface patches improves the robustness of Compass against self-occlusions and missing parts. Results for 3DMatch are shown in Table 1: the performance gain achieved by Compass when deploying the proposed data augmentation validates its importance. Indeed, without the proposed augmentation FLARE performs better than Compass on this dataset. Compass on 3DMatch and 3DMatch rotated. CNNs, we are able to achieve a similar performance on both datasets.
- South America > Brazil > Paraná > Curitiba (0.04)
- North America > Canada (0.04)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)