AITopics | Islam, Md Jahidul

Collaborating Authors

Islam, Md Jahidul

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ClipRover: Zero-shot Vision-Language Exploration and Target Discovery by Mobile Robots

Zhang, Yuxuan, Abdullah, Adnan, Koppal, Sanjeev J., Islam, Md Jahidul

arXiv.org Artificial IntelligenceFeb-12-2025

Vision-language navigation (VLN) has emerged as a promising paradigm, enabling mobile robots to perform zero-shot inference and execute tasks without specific pre-programming. However, current systems often separate map exploration and path planning, with exploration relying on inefficient algorithms due to limited (partially observed) environmental information. In this paper, we present a novel navigation pipeline named ''ClipRover'' for simultaneous exploration and target discovery in unknown environments, leveraging the capabilities of a vision-language model named CLIP. Our approach requires only monocular vision and operates without any prior map or knowledge about the target. For comprehensive evaluations, we design the functional prototype of a UGV (unmanned ground vehicle) system named ''Rover Master'', a customized platform for general-purpose VLN tasks. We integrate and deploy the ClipRover pipeline on Rover Master to evaluate its throughput, obstacle avoidance capability, and trajectory performance across various real-world scenarios. Experimental results demonstrate that ClipRover consistently outperforms traditional map traversal algorithms and achieves performance comparable to path-planning methods that depend on prior map and target knowledge. Notably, ClipRover offers real-time active navigation without requiring pre-captured candidate images or pre-built node graphs, addressing key limitations of existing VLN pipelines.

artificial intelligence, natural language, planning & scheduling, (15 more...)

arXiv.org Artificial Intelligence

2502.08791

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Robots (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.73)
Information Technology > Artificial Intelligence > Natural Language (0.53)

Add feedback

Demonstrating CavePI: Autonomous Exploration of Underwater Caves by Semantic Guidance

Gupta, Alankrit, Abdullah, Adnan, Li, Xianyao, Ramesh, Vaishnav, Rekleitis, Ioannis, Islam, Md Jahidul

arXiv.org Artificial IntelligenceFeb-7-2025

Enabling autonomous robots to safely and efficiently navigate, explore, and map underwater caves is of significant importance to water resource management, hydrogeology, archaeology, and marine robotics. In this work, we demonstrate the system design and algorithmic integration of a visual servoing framework for semantically guided autonomous underwater cave exploration. We present the hardware and edge-AI design considerations to deploy this framework on a novel AUV (Autonomous Underwater Vehicle) named CavePI. The guided navigation is driven by a computationally light yet robust deep visual perception module, delivering a rich semantic understanding of the environment. Subsequently, a robust control mechanism enables CavePI to track the semantic guides and navigate within complex cave structures. We evaluate the system through field experiments in natural underwater caves and spring-water sites and further validate its ROS (Robot Operating System)-based digital twin in a simulation environment. Our results highlight how these integrated design choices facilitate reliable navigation under feature-deprived, GPS-denied, and low-visibility conditions.

artificial intelligence, caveline, cavepi, (15 more...)

arXiv.org Artificial Intelligence

2502.05384

Country: North America > United States > Delaware > New Castle County > Newark (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Energy (0.93)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.48)

Add feedback

BlueME: Robust Underwater Robot-to-Robot Communication Using Compact Magnetoelectric Antennas

Talebi, Mehron, Mahmud, Sultan, Khalifa, Adam, Islam, Md Jahidul

arXiv.org Artificial IntelligenceDec-28-2024

We present the design, development, and experimental validation of BlueME, a compact magnetoelectric (ME) antenna array system for underwater robot-to-robot communication. BlueME employs ME antennas operating at their natural mechanical resonance frequency to efficiently transmit and receive very-low-frequency (VLF) electromagnetic signals underwater. We outline the design, simulation, fabrication, and integration of the proposed system on low-power embedded platforms focusing on portable and scalable applications. For performance evaluation, we deployed BlueME on an autonomous surface vehicle (ASV) and a remotely operated vehicle (ROV) in open-water field trials. Our tests demonstrate that BlueME maintains reliable signal transmission at distances beyond 200 meters while consuming only 1 watt of power. Field trials show that the system operates effectively in challenging underwater conditions such as turbidity, obstacles, and multipath interference -- that generally affect acoustics and optics. Our analysis also examines the impact of complete submersion on system performance and identifies key deployment considerations. This work represents the first practical underwater deployment of ME antennas outside the laboratory, and implements the largest VLF ME array system to date. BlueME demonstrates significant potential for marine robotics and automation in multi-robot cooperative systems and remote sensor networks.

antenna, artificial intelligence, communication, (16 more...)

arXiv.org Artificial Intelligence

2411.09241

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Industry:

Energy > Oil & Gas > Upstream (1.00)
Electrical Industrial Apparatus (0.87)
Telecommunications (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Communications > Networks > Sensor Networks (0.34)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.34)

Add feedback

Human-Machine Interfaces for Subsea Telerobotics: From Soda-straw to Natural Language Interactions

Abdullah, Adnan, Chen, Ruo, Blow, David, Uthai, Thanakon, Du, Eric Jing, Islam, Md Jahidul

arXiv.org Artificial IntelligenceDec-2-2024

This review explores the evolution of human-machine interfaces (HMIs) for subsea telerobotics, tracing back the transition from traditional first-person "soda-straw" consoles (narrow field-of-view camera feed) to advanced interfaces powered by gesture recognition, virtual reality, and natural language models. First, we discuss various forms of subsea telerobotics applications, current state-of-the-art (SOTA) interface systems, and the challenges they face in robust underwater sensing, real-time estimation, and low-latency communication. Through this analysis, we highlight how advanced HMIs facilitate intuitive interactions between human operators and robots to overcome these challenges. A detailed review then categorizes and evaluates the cutting-edge HMI systems based on their offered features from both human perspectives (e.g., enhancing operator control and situational awareness) and machine perspectives (e.g., improving safety, mission accuracy, and task efficiency). Moreover, we examine the literature on bidirectional interaction and intelligent collaboration in terms of sensory feedback and intuitive control mechanisms for both physical and virtual interfaces. The paper concludes by identifying critical challenges, open research questions, and future directions, emphasizing the need for multidisciplinary collaboration in subsea telerobotics. Key words: Subsea telerobotics; marine robotics; human-machine interface; shared autonomy.

artificial intelligence, natural language, survey article, (19 more...)

arXiv.org Artificial Intelligence

2412.01753

Country:

Europe (0.67)
North America > United States > Florida (0.27)

Genre:

Research Report > Promising Solution (1.00)
Overview (1.00)
Research Report > New Finding (0.87)

Industry:

Health & Medicine > Consumer Health (1.00)
Government > Military (1.00)
Energy > Oil & Gas > Upstream (1.00)
Government > Regional Government > North America Government > United States Government (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.67)

Add feedback

Optimized Vessel Segmentation: A Structure-Agnostic Approach with Small Vessel Enhancement and Morphological Correction

Song, Dongning, Huang, Weijian, Liu, Jiarun, Islam, Md Jahidul, Yang, Hao, Wang, Shanshan

arXiv.org Artificial IntelligenceNov-22-2024

Accurate segmentation of blood vessels is essential for various clinical assessments and postoperative analyses. However, the inherent challenges of vascular imaging, such as sparsity, fine granularity, low contrast, data distribution variability, and the critical need for preserving topological structure, making generalized vessel segmentation particularly complex. While specialized segmentation methods have been developed for specific anatomical regions, their over-reliance on tailored models hinders broader applicability and generalization. General-purpose segmentation models introduced in medical imaging often fail to address critical vascular characteristics, including the connectivity of segmentation results. To overcome these limitations, we propose an optimized vessel segmentation framework: a structure-agnostic approach incorporating small vessel enhancement and morphological correction for multi-modality vessel segmentation. To train and validate this framework, we compiled a comprehensive multi-modality dataset spanning 17 datasets and benchmarked our model against six SAM-based methods and 17 expert models. The results demonstrate that our approach achieves superior segmentation accuracy, generalization, and a 34.6% improvement in connectivity, underscoring its clinical potential. An ablation study further validates the effectiveness of the proposed improvements. We will release the code and dataset at github following the publication of this work.

artificial intelligence, machine learning, segmentation, (16 more...)

arXiv.org Artificial Intelligence

2411.15251

Country:

Asia (0.46)
Europe (0.46)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

AquaFuse: Waterbody Fusion for Physics Guided View Synthesis of Underwater Scenes

Siddique, Md Abu Bakr, Wu, Jiayi, Rekleitis, Ioannis, Islam, Md Jahidul

arXiv.org Artificial IntelligenceNov-1-2024

We introduce the idea of AquaFuse, a physics-based method for synthesizing waterbody properties in underwater imagery. We formulate a closed-form solution for waterbody fusion that facilitates realistic data augmentation and geometrically consistent underwater scene rendering. AquaFuse leverages the physical characteristics of light propagation underwater to synthesize the waterbody from one scene to the object contents of another. Unlike data-driven style transfer, AquaFuse preserves the depth consistency and object geometry in an input scene. We validate this unique feature by comprehensive experiments over diverse underwater scenes. We find that the AquaFused images preserve over 94% depth consistency and 90-95% structural similarity of the input scenes. We also demonstrate that it generates accurate 3D view synthesis by preserving object geometry while adapting to the inherent waterbody fusion process. AquaFuse opens up a new research direction in data augmentation by geometry-preserving style transfer for underwater imaging and robot vision applications.

aquafuse, artificial intelligence, data augmentation, (15 more...)

arXiv.org Artificial Intelligence

2411.01119

Country:

North America > United States > South Carolina (0.28)
North America > United States > Maryland (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Word2Wave: Language Driven Mission Programming for Efficient Subsea Deployments of Marine Robots

Chen, Ruo, Blow, David, Abdullah, Adnan, Islam, Md Jahidul

arXiv.org Artificial IntelligenceSep-26-2024

This paper explores the design and development of a language-based interface for dynamic mission programming of autonomous underwater vehicles (AUVs). The proposed 'Word2Wave' (W2W) framework enables interactive programming and parameter configuration of AUVs for remote subsea missions. The W2W framework includes: (i) a set of novel language rules and command structures for efficient language-to-mission mapping; (ii) a GPT-based prompt engineering module for training data generation; (iii) a small language model (SLM)-based sequence-to-sequence learning pipeline for mission command generation from human speech or text; and (iv) a novel user interface for 2D mission map visualization and human-machine interfacing. The proposed learning pipeline adapts an SLM named T5-Small that can learn language-to-mission mapping from processed language data effectively, providing robust and efficient performance. In addition to a benchmark evaluation with state-of-the-art, we conduct a user interaction study to demonstrate the effectiveness of W2W over commercial AUV programming interfaces. Across participants, W2W-based programming required less than 10% time for mission programming compared to traditional interfaces; it is deemed to be a simpler and more natural paradigm for subsea mission programming with a usability score of 76.25. W2W opens up promising future research opportunities on hands-free AUV mission programming for efficient subsea deployments.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2409.18405

Country: North America > United States (0.68)

Genre: Research Report (0.82)

Industry: Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Weakly Supervised Caveline Detection For AUV Navigation Inside Underwater Caves

Yu, Boxiao, Tibbetts, Reagan, Barua, Titon, Morales, Ailani, Rekleitis, Ioannis, Islam, Md Jahidul

arXiv.org Artificial IntelligenceJun-28-2023

Underwater caves are challenging environments that are crucial for water resource management, and for our understanding of hydro-geology and history. Mapping underwater caves is a time-consuming, labor-intensive, and hazardous operation. For autonomous cave mapping by underwater robots, the major challenge lies in vision-based estimation in the complete absence of ambient light, which results in constantly moving shadows due to the motion of the camera-light setup. Thus, detecting and following the caveline as navigation guidance is paramount for robots in autonomous cave mapping missions. In this paper, we present a computationally light caveline detection model based on a novel Vision Transformer (ViT)-based learning pipeline. We address the problem of scarce annotated training data by a weakly supervised formulation where the learning is reinforced through a series of noisy predictions from intermediate sub-optimal models. We validate the utility and effectiveness of such weak supervision for caveline detection and tracking in three different cave locations: USA, Mexico, and Spain. Experimental results demonstrate that our proposed model, CL-ViT, balances the robustness-efficiency trade-off, ensuring good generalization performance while offering 10+ FPS on single-board (Jetson TX2) devices.

artificial intelligence, caveline, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2303.0367

Country: North America > United States (0.89)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback