Goto

Collaborating Authors

 Caspian Sea


KazQAD: Kazakh Open-Domain Question Answering Dataset

arXiv.org Artificial Intelligence

We introduce KazQAD -- a Kazakh open-domain question answering (ODQA) dataset -- that can be used in both reading comprehension and full ODQA settings, as well as for information retrieval experiments. KazQAD contains just under 6,000 unique questions with extracted short answers and nearly 12,000 passage-level relevance judgements. We use a combination of machine translation, Wikipedia search, and in-house manual annotation to ensure annotation efficiency and data quality. The questions come from two sources: translated items from the Natural Questions (NQ) dataset (only for training) and the original Kazakh Unified National Testing (UNT) exam (for development and testing). The accompanying text corpus contains more than 800,000 passages from the Kazakh Wikipedia. As a supplementary dataset, we release around 61,000 question-passage-answer triples from the NQ dataset that have been machine-translated into Kazakh. We develop baseline retrievers and readers that achieve reasonable scores in retrieval (NDCG@10 = 0.389 MRR = 0.382), reading comprehension (EM = 38.5 F1 = 54.2), and full ODQA (EM = 17.8 F1 = 28.7) settings. Nevertheless, these results are substantially lower than state-of-the-art results for English QA collections, and we think that there should still be ample room for improvement. We also show that the current OpenAI's ChatGPTv3.5 is not able to answer KazQAD test questions in the closed-book setting with acceptable quality. The dataset is freely available under the Creative Commons licence (CC BY-SA) at https://github.com/IS2AI/KazQAD.


Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages

arXiv.org Artificial Intelligence

Large, curated, web-crawled corpora play a vital role in training language models (LMs). They form the lion's share of the training data in virtually all recent LMs, such as the well-known GPT, LLaMA and XLM-RoBERTa models. However, despite this importance, relatively little attention has been given to the quality of these corpora. In this paper, we compare four of the currently most relevant large, web-crawled corpora (CC100, MaCoCu, mC4 and OSCAR) across eleven lower-resourced European languages. Our approach is two-fold: first, we perform an intrinsic evaluation by performing a human evaluation of the quality of samples taken from different corpora; then, we assess the practical impact of the qualitative differences by training specific LMs on each of the corpora and evaluating their performance on downstream tasks. We find that there are clear differences in quality of the corpora, with MaCoCu and OSCAR obtaining the best results. However, during the extrinsic evaluation, we actually find that the CC100 corpus achieves the highest scores. We conclude that, in our experiments, the quality of the web-crawled corpora does not seem to play a significant role when training LMs.


SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection

arXiv.org Artificial Intelligence

Synthetic Aperture Radar (SAR) object detection has gained significant attention recently due to its irreplaceable all-weather imaging capabilities. However, this research field suffers from both limited public datasets (mostly comprising <2K images with only mono-category objects) and inaccessible source code. To tackle these challenges, we establish a new benchmark dataset and an open-source method for large-scale SAR object detection. Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets, providing a large-scale and diverse dataset for research purposes. To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created. With this high-quality dataset, we conducted comprehensive experiments and uncovered a crucial challenge in SAR object detection: the substantial disparities between the pretraining on RGB datasets and finetuning on SAR datasets in terms of both data domain and model structure. To bridge these gaps, we propose a novel Multi-Stage with Filter Augmentation (MSFA) pretraining framework that tackles the problems from the perspective of data input, domain transition, and model migration. The proposed MSFA method significantly enhances the performance of SAR object detection models while demonstrating exceptional generalizability and flexibility across diverse models. This work aims to pave the way for further advancements in SAR object detection. The dataset and code is available at https://github.com/zcablii/SARDet_100K.


Saving the legacy of Hero Ibash: Evaluating Four Language Models for Aminoacian

arXiv.org Artificial Intelligence

This study assesses four cutting-edge language models in the underexplored Aminoacian language. Through evaluation, it scrutinizes their adaptability, effectiveness, and limitations in text generation, semantic coherence, and contextual understanding. Uncovering insights into these models' performance in a low-resourced language, this research pioneers pathways to bridge linguistic gaps. By offering benchmarks and understanding challenges, it lays groundwork for future advancements in natural language processing, aiming to elevate the applicability of language models in similar linguistic landscapes, marking a significant step toward inclusivity and progress in language technology.


Can AI Assistants Know What They Don't Know?

arXiv.org Artificial Intelligence

Recently, AI assistants based on large language models (LLMs) show surprising performance in many tasks, such as dialogue, solving math problems, writing code, and using tools. Although LLMs possess intensive world knowledge, they still make factual errors when facing some knowledge intensive tasks, like open-domain question answering. These untruthful responses from the AI assistant may cause significant risks in practical applications. We believe that an AI assistant's refusal to answer questions it does not know is a crucial method for reducing hallucinations and making the assistant truthful. Therefore, in this paper, we ask the question "Can AI assistants know what they don't know and express them through natural language?" To answer this question, we construct a model-specific "I don't know" (Idk) dataset for an assistant, which contains its known and unknown questions, based on existing open-domain question answering datasets. Then we align the assistant with its corresponding Idk dataset and observe whether it can refuse to answer its unknown questions after alignment. Experimental results show that after alignment with Idk datasets, the assistant can refuse to answer most its unknown questions. For questions they attempt to answer, the accuracy is significantly higher than before the alignment.


Language Models Represent Space and Time

arXiv.org Artificial Intelligence

The capabilities of large language models (LLMs) have sparked debate over whether such systems just learn an enormous collection of superficial statistics or a coherent model of the data generation process -- a world model. We find preliminary evidence for the latter by analyzing the learned representations of three spatial datasets (world, US, NYC places) and three temporal datasets (historical figures, artworks, news headlines) in the Llama-2 family of models. We discover that LLMs learn linear representations of space and time across multiple scales. These representations are robust to prompting variations and unified across different entity types (e.g. cities and landmarks). In addition, we identify individual ``space neurons'' and ``time neurons'' that reliably encode spatial and temporal coordinates. While further investigation is needed, our results suggest modern LLMs learn rich spatiotemporal representations of the real world and possess basic ingredients of a world model.


Russia-Ukraine war: List of key events, day 540

Al Jazeera

Here is the situation on Thursday, August 17, 2023. Ukraine said Russia carried out a series of drone attacks on grain silos and warehouses at a Danube River port near the border with Romania. Kyiv said its forces liberated the settlement of Urozhaine in the southeast, but top general Oleksandr Syrskyi warned the situation around Kupiansk on the northeastern front was deteriorating amid Russian counterattacks. Video obtained by Al Jazeera suggests a controversial unit of Chechen troops has been policing the town of Enerhodar near the Russian-occupied Zaporizhzhia Nuclear Power Plant. Russia's Ministry of Defence said it shot down three Ukrainian drones southwest of Moscow and one over Crimea.


Exceedance Probability Forecasting via Regression for Significant Wave Height Prediction

arXiv.org Artificial Intelligence

Significant wave height forecasting is a key problem in ocean data analytics. Predicting the significant wave height is crucial for estimating the energy production from waves. Moreover, the timely prediction of large waves is important to ensure the safety of maritime operations, e.g. passage of vessels. We frame the task of predicting extreme values of significant wave height as an exceedance probability forecasting problem. Accordingly, we aim at estimating the probability that the significant wave height will exceed a predefined threshold. This task is usually solved using a probabilistic binary classification model. Instead, we propose a novel approach based on a forecasting model. The method leverages the forecasts for the upcoming observations to estimate the exceedance probability according to the cumulative distribution function. We carried out experiments using data from a buoy placed in the coast of Halifax, Canada. The results suggest that the proposed methodology is better than state-of-the-art approaches for exceedance probability forecasting.


US, European allies demand action to end Russia's use of Iranian drones in Ukraine

FOX News

A joint statement from the U.S. Representative to the United Nations on behalf of a coalition of European countries has urged the U.N. to investigate Russia's use of Iranian drones in Ukraine. "Earlier this month, the United States released further information documenting how Iran has provided Russia with hundreds of one-way attack UAVs (unmanned aerial vehicles), as well as UAV production-related equipment. Ukraine and the U.K. also submitted evidence to the U.N. of Iranian UAVs recovered by the Ukrainian armed forces," Linda Thomas-Greenfield, United States Ambassador to the United Nations, told reporters. "Russia has not only procured hundreds of Mohajer and Shahed series UAVs from Iran in clear violation of Resolution 2231, but it is also now working with Iran to produce these weapons inside Russia," she continued, reading a statement on behalf of the U.S., the U.K., France, Ukraine and Albania. "Russia has been using these UAVs in recent weeks to strike Kyiv, destroy Ukrainian infrastructure, and kill and terrorize Ukrainian civilians. Media reports indicate just this week Russia targeted Kyiv and other Ukrainian cities with dozens of Iranian-made drones," she said, adding, "The United Nations must respond to growing calls from the international community to investigate these violations."


US says Iran is helping Russia build drone manufacturing facility

Al Jazeera

The United States has accused the Iranian government of helping Russia to build a drone manufacturing plant near Moscow, in an escalation of their defence cooperation. In a statement on Friday, White House National Security Council spokesman John Kirby cited US intelligence findings that indicated Iran had provided material support for the plant, which could be operational by early next year. US officials also double-downed on claims that Iran has sent hundreds of drones -- or unmanned aerial vehicles (UAVs) -- to Russia for use in Ukraine, where a full-scale invasion was launched in 2022. "Russia has been using Iranian UAVs in recent weeks to strike Kyiv and terrorize the Ukrainian population, and the Russia-Iran military partnership appears to be deepening," Kirby said in Friday's statement. "We are also concerned that Russia is working with Iran to produce Iranian UAVs from inside Russia."