Bucharest
Romania claims parts of possible Russian drone fell on its territory
Parts of what could be a Russian drone fell on Romanian territory, Romania's Defence Minister Angel Tilvar says, two days after Ukraine said Russian drones had detonated on the NATO member's land. Romanian officials had earlier denied reports of drones falling on Romanian territory and said Russian attacks in neighbouring Ukraine did not cause a direct threat. Tilvar told local news channel Antena 3 CNN on Wednesday that parts of what was most likely a drone were discovered in the eastern Tulcea county, an area of the Danube that forms a natural border between Romania and war-torn Ukraine. "I confirm that in this area, pieces that may be of a drone were found," he said, adding that the pieces did not pose a threat. He said the area had not been evacuated because there was nothing to suggest that the parts were dangerous and said the pieces would be analysed to confirm their origin.
An Open Hyperspectral Dataset with Sea-Land-Cloud Ground-Truth from the HYPSO-1 Satellite
Justo, Jon A., Garrett, Joseph, Langer, Dennis D., Henriksen, Marie B., Ionescu, Radu T., Johansen, Tor A.
Datasets from airborne sensors like AVIRIS [2], ROSIS [3], and HYDICE [4] contain Hyperspectral Imaging, employed in satellites for space remote labeled images with diverse land cover categories like urban sensing, like HYPSO-1, faces constraints due to few and agricultural areas. Additionally, other popular datasets labeled data sets, affecting the training of AI models demanding such as the Kennedy Space Center and Jasper Ridge have limited these ground-truth annotations. In this work, we water coverage due to the capture over small geographic introduce The HYPSO-1 Sea-Land-Cloud-Labeled Dataset, extents using airborne platforms. Despite the widespread use an open dataset with 200 diverse hyperspectral images from of these datasets in HSI classification, each set consists of a the HYPSO-1 mission, available in both raw and calibrated single labeled image, inadequate for training emerging dataintensive forms for scientific research in Earth observation.
ACTI at EVALITA 2023: Overview of the Conspiracy Theory Identification Task
Russo, Giuseppe, Stoehr, Niklas, Ribeiro, Manoel Horta
Automatic Conspiracy Theory Identification (ACTI) is a new shared task proposed for the first time at the EVALITA 2023 evaluation campaign. ACTI is based on a new, manually labeled dataset of comments scraped from conspiratorial Telegram channels and consists of two subtasks: (1) identifying conspiratorial content (conspiratorial content classification); and (2) classifying content into specific conspiracy theories (conspiratorial category classification). A total of 15 teams participated in the task with 81 submissions. In this task summary, we discuss the data and task, and outline the bestperforming approaches that are largely based on large language models. We conclude with a brief discussion of the application of large language models to counter the spread of misinformation on online platforms.
Learning Diverse Features in Vision Transformers for Improved Generalization
Nicolicioiu, Armand Mihai, Nicolicioiu, Andrei Liviu, Alexe, Bogdan, Teney, Damien
Deep learning models often rely only on a small set of features even when there is a rich set of predictive signals in the training data. This makes models brittle and sensitive to distribution shifts. In this work, we first examine vision transformers (ViTs) and find that they tend to extract robust and spurious features with distinct attention heads. As a result of this modularity, their performance under distribution shifts can be significantly improved at test time by pruning heads corresponding to spurious features, which we demonstrate using an "oracle selection" on validation data. Second, we propose a method to further enhance the diversity and complementarity of the learned features by encouraging orthogonality of the attention heads' input gradients. We observe improved out-of-distribution performance on diagnostic benchmarks (MNIST-CIFAR, Waterbirds) as a consequence of the enhanced diversity of features and the pruning of undesirable heads.
Semantic Change Detection for the Romanian Language
Truică, Ciprian-Octavian, Tudose, Victor, Apostol, Elena-Simona
Language is in a continuous process of change that occurs permanently, language change being the phenomenon that drives language evolution, as a process of adaptation to the environment and the ways other speakers use the language [3, 2]. The various instances of language change are classified into different categories, such as regular phonetic changes, changes in word usage, and changes in the way words appear together, i.e., syntactic changes. Although it is usually a continuous process that follows regular patterns, very abrupt changes in the meanings of words can still occur, usually motivated by a change in the context a community lives in [15, 9, 7]. Semantic change, as a phenomenon permanently present in language evolution, is an important aspect that should be taken into account when working with historical data[1]. Historical linguists, lexical typologists, and other humanities and social science experts have studied the meaning of words and how it changes over time.
Multi-Task Hypergraphs for Semi-supervised Learning using Earth Observations
Pirvu, Mihai, Marcu, Alina, Dobrescu, Alexandra, Belbachir, Nabil, Leordeanu, Marius
There are many ways of interpreting the world and they are highly interdependent. We exploit such complex dependencies and introduce a powerful multi-task hypergraph, in which every node is a task and different paths through the hypergraph reaching a given task become unsupervised teachers, by forming ensembles that learn to generate reliable pseudolabels for that task. Each hyperedge is part of an ensemble teacher for a given task and it is also a student of the self-supervised hypergraph system. We apply our model to one of the most important problems of our times, that of Earth Observation, which is highly multi-task and it often suffers from missing ground-truth data. By performing extensive experiments on the NASA NEO Dataset, spanning a period of 22 years, we demonstrate the value of our multi-task semi-supervised approach, by consistent improvements over strong baselines and recent work. We also show that the hypergraph can adapt unsupervised to gradual data distribution shifts and reliably recover, through its multi-task self-supervision process, the missing data for several observational layers for up to seven years.
GaitPT: Skeletons Are All You Need For Gait Recognition
Catruna, Andy, Cosma, Adrian, Radoi, Emilian
The analysis of patterns of walking is an important area of research that has numerous applications in security, healthcare, sports and human-computer interaction. Lately, walking patterns have been regarded as a unique fingerprinting method for automatic person identification at a distance. In this work, we propose a novel gait recognition architecture called Gait Pyramid Transformer (GaitPT) that leverages pose estimation skeletons to capture unique walking patterns, without relying on appearance information. GaitPT adopts a hierarchical transformer architecture that effectively extracts both spatial and temporal features of movement in an anatomically consistent manner, guided by the structure of the human skeleton. Our results show that GaitPT achieves state-of-the-art performance compared to other skeleton-based gait recognition works, in both controlled and in-the-wild scenarios. GaitPT obtains 82.6% average accuracy on CASIA-B, surpassing other works by a margin of 6%. Moreover, it obtains 52.16% Rank-1 accuracy on GREW, outperforming both skeleton-based and appearance-based approaches.
SkinDistilViT: Lightweight Vision Transformer for Skin Lesion Classification
Lungu-Stan, Vlad-Constantin, Cercel, Dumitru-Clementin, Pop, Florin
Skin cancer is a treatable disease if discovered early. We provide a production-specific solution to the skin cancer classification problem that matches human performance in melanoma identification by training a vision transformer on melanoma medical images annotated by experts. Since inference cost, both time and memory wise is important in practice, we employ knowledge distillation to obtain a model that retains 98.33% of the teacher's balanced multi-class accuracy, at a fraction of the cost. Memory-wise, our model is 49.60% smaller than the teacher. Time-wise, our solution is 69.25% faster on GPU and 97.96% faster on CPU. By adding classification heads at each level of the transformer and employing a cascading distillation process, we improve the balanced multi-class accuracy of the base model by 2.1%, while creating a range of models of various sizes but comparable performance.
CMISR: Circular Medical Image Super-Resolution
Li, Honggui, Trocan, Maria, Galayko, Dimitri, Sawan, Mohamad
Classical methods of medical image super-resolution (MISR) utilize open-loop architecture with implicit under-resolution (UR) unit and explicit super-resolution (SR) unit. The UR unit can always be given, assumed, or estimated, while the SR unit is elaborately designed according to various SR algorithms. The closed-loop feedback mechanism is widely employed in current MISR approaches and can efficiently improve their performance. The feedback mechanism may be divided into two categories: local and global feedback. Therefore, this paper proposes a global feedback-based closed-cycle framework, circular MISR (CMISR), with unambiguous UR and SR elements. Mathematical model and closed-loop equation of CMISR are built. Mathematical proof with Taylor-series approximation indicates that CMISR has zero recovery error in steady-state. In addition, CMISR holds plug-and-play characteristic which can be established on any existing MISR algorithms. Five CMISR algorithms are respectively proposed based on the state-of-the-art open-loop MISR algorithms. Experimental results with three scale factors and on three open medical image datasets show that CMISR is superior to MISR in reconstruction performance and is particularly suited to medical images with strong edges or intense contrast.
Synthesizing Political Zero-Shot Relation Classification via Codebook Knowledge, NLI, and ChatGPT
Hu, Yibo, Parolin, Erick Skorupa, Khan, Latifur, Brandt, Patrick T., Osorio, Javier, D'Orazio, Vito J.
Recent supervised models for event coding vastly outperform pattern-matching methods. However, their reliance solely on new annotations disregards the vast knowledge within expert databases, hindering their applicability to fine-grained classification. To address these limitations, we explore zero-shot approaches for political event ontology relation classification, by leveraging knowledge from established annotation codebooks. Our study encompasses both ChatGPT and a novel natural language inference (NLI) based approach named ZSP. ZSP adopts a tree-query framework that deconstructs the task into context, modality, and class disambiguation levels. This framework improves interpretability, efficiency, and adaptability to schema changes. By conducting extensive experiments on our newly curated datasets, we pinpoint the instability issues within ChatGPT and highlight the superior performance of ZSP. ZSP achieves an impressive 40% improvement in F1 score for fine-grained Rootcode classification. ZSP demonstrates competitive performance compared to supervised BERT models, positioning it as a valuable tool for event record validation and ontology development. Our work underscores the potential of leveraging transfer learning and existing expertise to enhance the efficiency and scalability of research in the field.