AITopics | Cladera, Fernando

Collaborating Authors

Cladera, Fernando

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

4D Metric-Semantic Mapping for Persistent Orchard Monitoring: Method and Dataset

Lei, Jiuzhou, Prabhu, Ankit, Liu, Xu, Cladera, Fernando, Mortazavi, Mehrad, Ehsani, Reza, Chaudhari, Pratik, Kumar, Vijay

arXiv.org Artificial IntelligenceSep-29-2024

Automated persistent and fine-grained monitoring of orchards at the individual tree or fruit level helps maximize crop yield and optimize resources such as water, fertilizers, and pesticides while preventing agricultural waste. Towards this goal, we present a 4D spatio-temporal metric-semantic mapping method that fuses data from multiple sensors, including LiDAR, RGB camera, and IMU, to monitor the fruits in an orchard across their growth season. A LiDAR-RGB fusion module is designed for 3D fruit tracking and localization, which first segments fruits using a deep neural network and then tracks them using the Hungarian Assignment algorithm. Additionally, the 4D data association module aligns data from different growth stages into a common reference frame and tracks fruits spatio-temporally, providing information such as fruit counts, sizes, and positions. We demonstrate our method's accuracy in 4D metric-semantic mapping using data collected from a real orchard under natural, uncontrolled conditions with seasonal variations. We achieve a 3.1 percent error in total fruit count estimation for over 1790 fruits across 60 apple trees, along with accurate size estimation results with a mean error of 1.1 cm. The datasets, consisting of LiDAR, RGB, and IMU data of five fruit species captured across their growth seasons, along with corresponding ground truth data, will be made publicly available at: https://4d-metric-semantic-mapping.org/

artificial intelligence, machine learning, point cloud, (18 more...)

arXiv.org Artificial Intelligence

2409.19786

Country: North America > United States > California > Merced County > Merced (0.28)

Genre: Research Report (0.64)

Industry:

Materials > Chemicals > Agricultural Chemicals (0.54)
Food & Agriculture > Agriculture > Pest Control (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

EvMAPPER: High Altitude Orthomapping with Event Cameras

Cladera, Fernando, Chaney, Kenneth, Hsieh, M. Ani, Taylor, Camillo J., Kumar, Vijay

arXiv.org Artificial IntelligenceSep-26-2024

Traditionally, unmanned aerial vehicles (UAVs) rely on CMOS-based cameras to collect images about the world below. One of the most successful applications of UAVs is to generate orthomosaics or orthomaps, in which a series of images are integrated together to develop a larger map. However, the use of CMOS-based cameras with global or rolling shutters mean that orthomaps are vulnerable to challenging light conditions, motion blur, and high-speed motion of independently moving objects under the camera. Event cameras are less sensitive to these issues, as their pixels are able to trigger asynchronously on brightness changes. This work introduces the first orthomosaic approach using event cameras. In contrast to existing methods relying only on CMOS cameras, our approach enables map generation even in challenging light conditions, including direct sunlight and after sunset.

artificial intelligence, event camera, reconstruction, (16 more...)

arXiv.org Artificial Intelligence

2409.1812

Country:

North America > United States > Pennsylvania (0.14)
North America > Canada > British Columbia (0.14)

Genre: Research Report (0.50)

Industry:

Aerospace & Defense > Aircraft (0.48)
Transportation > Air (0.46)
Information Technology > Robotics & Automation (0.34)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.48)

Add feedback

AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions

Chopra, Samarth, Cladera, Fernando, Murali, Varun, Kumar, Vijay

arXiv.org Artificial IntelligenceSep-23-2024

Neural Radiance Fields (NeRFs) have shown significant promise in 3D scene reconstruction and novel view synthesis. In agricultural settings, NeRFs can serve as digital twins, providing critical information about fruit detection for yield estimation and other important metrics for farmers. However, traditional NeRFs are not robust to challenging lighting conditions, such as low-light, extreme bright light and varying lighting. To address these issues, this work leverages three different sensors: an RGB camera, an event camera and a thermal camera. Our RGB scene reconstruction shows an improvement in PSNR and SSIM by +2.06 dB and +8.3% respectively. Our cross-spectral scene reconstruction enhances downstream fruit detection by +43.0% in mAP50 and +61.1% increase in mAP50-95. The integration of additional sensors leads to a more robust and informative NeRF. We demonstrate that our multi-modal system yields high quality photo-realistic reconstructions under various tree canopy covers and at different times of the day. This work results in the development of a resilient NeRF, capable of performing well in visibly degraded scenarios, as well as a learnt cross-spectral representation, that is used for automated fruit detection.

artificial intelligence, fruit detection, reconstruction, (15 more...)

arXiv.org Artificial Intelligence

2409.15487

Country: North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)

Genre: Research Report (1.00)

Industry: Food & Agriculture > Agriculture (1.00)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

Air-Ground Collaboration with SPOMP: Semantic Panoramic Online Mapping and Planning

Miller, Ian D., Cladera, Fernando, Smith, Trey, Taylor, Camillo Jose, Kumar, Vijay

arXiv.org Artificial IntelligenceJul-13-2024

Mapping and navigation have gone hand-in-hand since long before robots existed. Maps are a key form of communication, allowing someone who has never been somewhere to nonetheless navigate that area successfully. In the context of multi-robot systems, the maps and information that flow between robots are necessary for effective collaboration, whether those robots are operating concurrently, sequentially, or completely asynchronously. In this paper, we argue that maps must go beyond encoding purely geometric or visual information to enable increasingly complex autonomy, particularly between robots. We propose a framework for multi-robot autonomy, focusing in particular on air and ground robots operating in outdoor 2.5D environments. We show that semantic maps can enable the specification, planning, and execution of complex collaborative missions, including localization in GPS-denied settings. A distinguishing characteristic of this work is that we strongly emphasize field experiments and testing, and by doing so demonstrate that these ideas can work at scale in the real world. We also perform extensive simulation experiments to validate our ideas at even larger scales. We believe these experiments and the experimental results constitute a significant step forward toward advancing the state-of-the-art of large-scale, collaborative multi-robot systems operating with real communication, navigation, and perception constraints.

artificial intelligence, machine learning, robot, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TFR.2024.3424748

2407.09902

Country: North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)

Genre: Research Report (0.81)

Industry:

Information Technology (1.00)
Transportation > Ground > Road (0.46)
Transportation > Infrastructure & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Enabling Large-scale Heterogeneous Collaboration with Opportunistic Communications

Cladera, Fernando, Ravichandran, Zachary, Miller, Ian D., Hsieh, M. Ani, Taylor, C. J., Kumar, Vijay

arXiv.org Artificial IntelligenceSep-27-2023

Multi-robot collaboration in large-scale environments with limited-sized teams and without external infrastructure is challenging, since the software framework required to support complex tasks must be robust to unreliable and intermittent communication links. In this work, we present MOCHA (Multi-robot Opportunistic Communication for Heterogeneous Collaboration), a framework for resilient multi-robot collaboration that enables large-scale exploration in the absence of continuous communications. MOCHA is based on a gossip communication protocol that allows robots to interact opportunistically whenever communication links are available, propagating information on a peer-to-peer basis. We demonstrate the performance of MOCHA through real-world experiments with commercial-off-the-shelf (COTS) communication hardware. We further explore the system's scalability in simulation, evaluating the performance of our approach as the number of robots increases and communication ranges vary. Finally, we demonstrate how MOCHA can be tightly integrated with the planning stack of autonomous robots. We show a communication-aware planning algorithm for a high-altitude aerial robot executing a collaborative task while maximizing the amount of information shared with ground robots. The source code for MOCHA and the high-altitude UAV planning system is available open source: http://github.com/KumarRobotics/MOCHA, http://github.com/KumarRobotics/air_router.

artificial intelligence, enabling large-scale heterogeneous collaboration, planning & scheduling, (1 more...)

arXiv.org Artificial Intelligence

2309.15975

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.53)

Add feedback

SEER: Safe Efficient Exploration for Aerial Robots using Learning to Predict Information Gain

Tao, Yuezhan, Wu, Yuwei, Li, Beiming, Cladera, Fernando, Zhou, Alex, Thakur, Dinesh, Kumar, Vijay

arXiv.org Artificial IntelligenceAug-13-2023

We address the problem of efficient 3-D exploration in indoor environments for micro aerial vehicles with limited sensing capabilities and payload/power constraints. We develop an indoor exploration framework that uses learning to predict the occupancy of unseen areas, extracts semantic features, samples viewpoints to predict information gains for different exploration goals, and plans informative trajectories to enable safe and smart exploration. Extensive experimentation in simulated and real-world environments shows the proposed approach outperforms the state-of-the-art exploration framework by 24% in terms of the total path length in a structured indoor environment and with a higher success rate during exploration.

artificial intelligence, exploration, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICRA48891.2023.10160295

2209.11034

Country:

North America > United States (0.14)
Asia > Middle East > Israel (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Active Metric-Semantic Mapping by Multiple Aerial Robots

Liu, Xu, Prabhu, Ankit, Cladera, Fernando, Miller, Ian D., Zhou, Lifeng, Taylor, Camillo J., Kumar, Vijay

arXiv.org Artificial IntelligenceAug-13-2023

Traditional approaches for active mapping focus on building geometric maps. For most real-world applications, however, actionable information is related to semantically meaningful objects in the environment. We propose an approach to the active metric-semantic mapping problem that enables multiple heterogeneous robots to collaboratively build a map of the environment. The robots actively explore to minimize the uncertainties in both semantic (object classification) and geometric (object modeling) information. We represent the environment using informative but sparse object models, each consisting of a basic shape and a semantic class label, and characterize uncertainties empirically using a large amount of real-world data. Given a prior map, we use this model to select actions for each robot to minimize uncertainties. The performance of our algorithm is demonstrated through multi-robot experiments in diverse real-world environments. The proposed framework is applicable to a wide range of real-world problems, such as precision agriculture, infrastructure inspection, and asset mapping in factories. A demo video can be found at https://youtu.be/S86SgXi54oU.

mapping, natural language, object-oriented architecture, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICRA48891.2023.10161564

2209.08465

Country: North America > United States (0.46)

Genre: Research Report (0.64)

Industry: Food & Agriculture > Agriculture (0.54)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.49)

Add feedback