AITopics | Indian Ocean

Collaborating Authors

Indian Ocean

MAGIC-VQA: Multimodal And Grounded Inference with Commonsense Knowledge for Visual Question Answering

Yang, Shuo, Luo, Siwen, Han, Soyeon Caren, Hovy, Eduard

arXiv.org Artificial IntelligenceMar-24-2025

Visual Question Answering (VQA) requires reasoning across visual and textual modalities, yet Large Vision-Language Models (LVLMs) often lack integrated commonsense knowledge, limiting their robustness in real-world scenarios. To address this, we introduce MAGIC-VQA, a novel framework that enhances VQA by systematically integrating commonsense knowledge with LVLMs. MAGIC-VQA employs a three-stage process: (1) Explicit Knowledge Integration from external sources, (2) By-Type Post-Processing for contextual refinement, and (3) Implicit Knowledge Augmentation using a Graph Neural Network (GNN) for structured reasoning. While GNNs bring greater depth to structured inference, they enable superior relational inference beyond LVLMs. MAGIC-VQA bridges a key gap by unifying commonsensse knowledge with LVLM-driven reasoning, eliminating the need for extensive pre-training or complex prompt tuning. Our framework achieves state-of-the-art performance on benchmark datasets, significantly improving commonsense reasoning in VQA.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2503.18491

Country:

Atlantic Ocean (0.04)
North America > United States > Virginia (0.04)
Pacific Ocean (0.04)
(4 more...)

Genre: Research Report (0.64)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Fish Mouth Inspired Origami Gripper for Robust Multi-Type Underwater Grasping

Guo, Honghao, Huang, Junda, Zhang, Ian, Liang, Boyuan, Ma, Xin, Liu, Yunhui, Zhou, Jianshu

arXiv.org Artificial IntelligenceMar-20-2025

Robotic grasping and manipulation in underwater environments present unique challenges for robotic hands traditionally used on land. These challenges stem from dynamic water conditions, a wide range of object properties from soft to stiff, irregular object shapes, and varying surface frictions. One common approach involves developing finger-based hands with embedded compliance using underactuation and soft actuators. This study introduces an effective alternative solution that does not rely on finger-based hand designs. We present a fish mouth inspired origami gripper that utilizes a single degree of freedom to perform a variety of robust grasping tasks underwater. The innovative structure transforms a simple uniaxial pulling motion into a grasping action based on the Yoshimura crease pattern folding. The origami gripper offers distinct advantages, including scalable and optimizable design, grasping compliance, and robustness, with four grasping types: pinch, power grasp, simultaneous grasping of multiple objects, and scooping from the seabed. In this work, we detail the design, modeling, fabrication, and validation of a specialized underwater gripper capable of handling various marine creatures, including jellyfish, crabs, and abalone. By leveraging an origami and bio-inspired approach, the presented gripper demonstrates promising potential for robotic grasping and manipulation in underwater environments.

artificial intelligence, gripper, origami gripper, (16 more...)

arXiv.org Artificial Intelligence

2503.11049

Country:

Asia > China > Hong Kong (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Indian Ocean > Red Sea (0.04)
(7 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.72)

Add feedback

CNCast: Leveraging 3D Swin Transformer and DiT for Enhanced Regional Weather Forecasting

Liang, Hongli, Zhang, Yuanting, Meng, Qingye, He, Shuangshuang, Yuan, Xingyuan

arXiv.org Artificial IntelligenceMar-16-2025

This study introduces a cutting-edge regional weather forecasting model based on the SwinTransformer 3D architecture. This model is specifically designed to deliver precise hourly weather predictions ranging from 1 hour to 5 days, significantly improving the reliability and practicality of short-term weather forecasts. Our model has demonstrated generally superior performance when compared to Pangu, a well-established global model. The evaluation indicates that our model excels in predicting most weather variables, highlighting its potential as a more effective alternative in the field of limited area modeling. A noteworthy feature of this model is the integration of enhanced boundary conditions, inspired by traditional numerical weather prediction (NWP) techniques. This integration has substantially improved the model's predictive accuracy. Additionally, the model includes an innovative approach for diagnosing hourly total precipitation at a high spatial resolution of approximately 5 kilometers. This is achieved through a latent diffusion model, offering an alternative method for generating high-resolution precipitation data.

artificial intelligence, machine learning, precipitation, (18 more...)

arXiv.org Artificial Intelligence

2503.13546

Country:

Asia > China (0.06)
North America > United States (0.04)
Indian Ocean (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.66)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Climate land use and other drivers impacts on island ecosystem services: a global review

Moustakas, Aristides, Zemah-Shamir, Shiri, Tase, Mirela, Zotos, Savvas, Demirel, Nazli, Zoumides, Christos, Christoforidi, Irene, Dindaroglu, Turgay, Albayrak, Tamer, Ayhan, Cigdem Kaptan, Fois, Mauro, Manolaki, Paraskevi, Sandor, Attila D., Sieber, Ina, Stamatiadou, Valentini, Tzirkalli, Elli, Vogiatzakis, Ioannis N., Zemah-Shamir, Ziv, Zittis, George

arXiv.org Artificial IntelligenceMar-13-2025

Islands are diversity hotspots and vulnerable to environmental degradation, climate variations, land use changes and societal crises. These factors can exhibit interactive impacts on ecosystem services. The study reviewed a large number of papers on the climate change-islands-ecosystem services topic worldwide. Potential inclusion of land use changes and other drivers of impacts on ecosystem services were sequentially also recorded. The study sought to investigate the impacts of climate change, land use change, and other non-climatic driver changes on island ecosystem services. Explanatory variables examined were divided into two categories: environmental variables and methodological ones. Environmental variables include sea zone geographic location, ecosystem, ecosystem services, climate, land use, other driver variables, Methodological variables include consideration of policy interventions, uncertainty assessment, cumulative effects of climate change, synergistic effects of climate change with land use change and other anthropogenic and environmental drivers, and the diversity of variables used in the analysis. Machine learning and statistical methods were used to analyze their effects on island ecosystem services. Negative climate change impacts on ecosystem services are better quantified by land use change or other non-climatic driver variables than by climate variables. The synergy of land use together with climate changes is modulating the impact outcome and critical for a better impact assessment. Analyzed together, there is little evidence of more pronounced for a specific sea zone, ecosystem, or ecosystem service. Climate change impacts may be underestimated due to the use of a single climate variable deployed in most studies. Policy interventions exhibit low classification accuracy in quantifying impacts indicating insufficient efficacy or integration in the studies.

climate change, ecosystem service, island, (11 more...)

arXiv.org Artificial Intelligence

2503.10278

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
Oceania > Marshall Islands (0.04)
(49 more...)

Genre: Research Report > New Finding (0.46)

Industry: Law > Real Estate Law (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Predicting Tropical Cyclone Track Forecast Errors using a Probabilistic Neural Network

Fernandez, M. A., Barnes, Elizabeth A., Barnes, Randal J., DeMaria, Mark, McGraw, Marie, Chirokova, Galina, Lu, Lixin

arXiv.org Artificial IntelligenceMar-12-2025

A new method for estimating tropical cyclone track uncertainty is presented and tested. This method uses a neural network to predict a bivariate normal distribution, which serves as an estimate for track uncertainty. We train the network and make predictions on forecasts from the National Hurricane Center (NHC), which currently uses static error distributions based on forecasts from the past five years for most applications. The neural network-based method produces uncertainty estimates that are dynamic and probabilistic. Further, the neural network-based method allows for probabilistic statements about tropical cyclone trajectories, including landfall probability, which we highlight. We show that our predictions are well calibrated using multiple metrics, that our method produces better uncertainty estimates than current NHC approaches, and that our method achieves similar performance to the Global Ensemble Forecast System. Once trained, the computational cost of predictions using this method is negligible, making it a strong candidate to improve the NHC's operational estimations of tropical cyclone track uncertainty.

forecast, lead time, prediction, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1175/AIES-D-24-0066.1

2503.0984

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > Colorado > Larimer County > Fort Collins (0.04)
North America > United States > Texas (0.04)
(7 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

XAI4Extremes: An interpretable machine learning framework for understanding extreme-weather precursors under climate change

Wei, Jiawen, Bora, Aniruddha, Oommen, Vivek, Dong, Chenyu, Yang, Juntao, Adie, Jeff, Chen, Chen, See, Simon, Karniadakis, George, Mengaldo, Gianmarco

arXiv.org Artificial IntelligenceMar-11-2025

Extreme weather events are increasing in frequency and intensity due to climate change. This, in turn, is exacting a significant toll in communities worldwide. While prediction skills are increasing with advances in numerical weather prediction and artificial intelligence tools, extreme weather still present challenges. More specifically, identifying the precursors of such extreme weather events and how these precursors may evolve under climate change remain unclear. In this paper, we propose to use post-hoc interpretability methods to construct relevance weather maps that show the key extreme-weather precursors identified by deep learning models. We then compare this machine view with existing domain knowledge to understand whether deep learning models identified patterns in data that may enrich our understanding of extreme-weather precursors. We finally bin these relevant maps into different multi-year time periods to understand the role that climate change is having on these precursors. The experiments are carried out on Indochina heatwaves, but the methodology can be readily extended to other extreme weather events worldwide.

heatwave, machine learning, tackling climate change, (10 more...)

arXiv.org Artificial Intelligence

2503.08163

Country:

North America > United States (0.28)
Asia > Singapore (0.05)
Europe (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Kr\'eyoLID From Language Identification Towards Language Mining

Dent, Rasul, Suarez, Pedro Ortiz, Clérice, Thibault, Sagot, Benoît

arXiv.org Artificial IntelligenceMar-9-2025

Automatic language identification is frequently framed as a multi-class classification problem. However, when creating digital corpora for less commonly written languages, it may be more appropriate to consider it a data mining problem. For these varieties, one knows ahead of time that the vast majority of documents are of little interest. By minimizing resources spent on classifying such documents, we can create corpora much faster and with better coverage than using established pipelines. To demonstrate the effectiveness of the language mining perspective, we introduce a new pipeline and corpora for several French-based Creoles.

corpora, creole, threshold, (13 more...)

arXiv.org Artificial Intelligence

2503.06547

Country:

Indian Ocean (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(19 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Add feedback

ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy

Sun, Jianwen, Feng, Yukang, Li, Chuanhao, Zhang, Fanrui, Li, Zizhen, Ai, Jiaxin, Zhou, Sizhuo, Dai, Yu, Zhang, Shenglin, Zhang, Kaipeng

arXiv.org Artificial IntelligenceMar-9-2025

Unified models (UniMs) for multimodal understanding and generation have recently received much attention in the area of vision and language. Existing UniMs are designed to simultaneously learn both multimodal understanding and generation capabilities, demanding substantial computational resources, and often struggle to generate interleaved text-image. We present ARMOR, a resource-efficient and pure autoregressive framework that achieves both understanding and generation by fine-tuning existing multimodal large language models (MLLMs). Specifically, ARMOR extends existing MLLMs from three perspectives: (1) For model architecture, an asymmetric encoder-decoder architecture with a forward-switching mechanism is introduced to unify embedding space integrating textual and visual modalities for enabling natural text-image interleaved generation with minimal computational overhead. (2) For training data, a meticulously curated, high-quality interleaved dataset is collected for fine-tuning MLLMs. (3) For the training algorithm, we propose a ``what or how to generate" algorithm to empower existing MLLMs with multimodal generation capabilities while preserving their multimodal understanding capabilities, through three progressive training stages based on the collected dataset. Experimental results demonstrate that ARMOR upgrades existing MLLMs to UniMs with promising image generation capabilities, using limited training resources. Our code will be released soon at https://armor.github.io.

dataset, generation capability, mllm, (15 more...)

arXiv.org Artificial Intelligence

2503.06542

Country:

Asia > China > Shanghai > Shanghai (0.04)
Indian Ocean > Red Sea (0.04)
Asia > Middle East > Yemen (0.04)
(8 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Zhao, Eric, Awasthi, Pranjal, Haghtalab, Nika

arXiv.org Artificial IntelligenceMar-7-2025

Finetuning provides a scalable and cost-effective means of customizing language models for specific tasks or response styles, with greater reliability than prompting or in-context learning. In contrast, the conventional wisdom is that injecting knowledge via finetuning results in brittle performance and poor generalization. We argue that the dichotomy of "task customization" (e.g., instruction tuning) and "knowledge injection" (e.g., teaching new facts) is a distinction without a difference. We instead identify concrete factors that explain the heterogeneous effectiveness observed with finetuning. To this end, we conduct a large-scale experimental study of finetuning the frontier Gemini v1.5 model family on a spectrum of datasets that are artificially engineered to interpolate between the strengths and failure modes of finetuning. Our findings indicate that question-answer training data formats provide much stronger knowledge generalization than document/article-style training data, numerical information can be harder for finetuning to retain than categorical information, and models struggle to apply finetuned knowledge during multi-step reasoning even when trained on similar examples -- all factors that render "knowledge injection" to be especially difficult, even after controlling for considerations like data augmentation and information volume. On the other hand, our findings also indicate that it is not fundamentally more difficult to finetune information about a real-world event than information about what a model's writing style should be.

evaluation task, figure 4, information, (12 more...)

arXiv.org Artificial Intelligence

2503.05919

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > District of Columbia > Washington (0.04)
(27 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government (0.93)
Leisure & Entertainment > Sports > Soccer (0.93)
Law (0.68)
Health & Medicine > Pharmaceuticals & Biotechnology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Add feedback

Iran showcases new weapons as it prepares for a rocky 2025

Al JazeeraMar-6-2025, 14:11:26 GMT

Tehran, Iran – Iran's army and Islamic Revolutionary Guard Corps (IRGC) have been showcasing and testing new defensive and offensive weapons in large-scale military exercises for the past three months. The country is preparing for another tumultuous year amid threats by the United States and Israel to bomb Iranian nuclear facilities, critical energy infrastructure, and military sites. Iran is also promising a third iteration of its major military strikes on Israel, in retaliation for Israeli attacks amid the devastating war on Gaza. The exercises – Eqtedar, Zolfaqar and Great Prophet – have been held across Iran, the Sea of Oman and the northern Indian Ocean. The weapons tested show Iran intends to maintain its defiance of Israel and the West, refusing to negotiate with US President Donald Trump under his "maximum pressure" policy and continuing to advance its nuclear programme.

artificial intelligence, drone, iran, (15 more...)

Al Jazeera

Country:

North America > United States (0.90)
Asia > Middle East > Israel (0.68)
Asia > Middle East > Oman (0.26)
(6 more...)

Industry:

Government > Military (1.00)
Government > Regional Government > North America Government > United States Government (0.55)
Government > Regional Government > Asia Government > Middle East Government > Iran Government (0.35)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.97)

Add feedback