AITopics | tiny paper

Collaborating Authors

tiny paper

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Crypto-ncRNA: Non-coding RNA (ncRNA) Based Encryption Algorithm

Wang, Xu, Wang, Yiquan, Huang, Tin-yeh

arXiv.org Artificial IntelligenceApr-28-2025

A BSTRACT In the looming post-quantum era, traditional cryptographic systems are increasingly vulnerable to quantum computing attacks that can compromise their mathematical foundations. To address this critical challenge, we propose crypto-ncRNA--a bio-convergent cryptographic framework that leverages the dynamic folding properties of non-coding RNA (ncRNA) to generate high-entropy, quantum-resistant keys and produce unpredictable ciphertexts. The framework employs a novel, multi-stage process: encoding plaintext into RNA sequences, predicting and manipulating RNA secondary structures using advanced algorithms, and deriving cryptographic keys through the intrinsic physical unclonability of RNA molecules. Experimental evaluations indicate that, although cryptoncRNA's encryption speed is marginally lower than that of AES, it significantly outperforms RSA in terms of efficiency and scalability while achieving a 100% pass rate on the NIST SP 800-22 randomness tests. These results demonstrate that crypto-ncRNA offers a promising and robust approach for securing digital infrastructures against the evolving threats posed by quantum computing. Moreover, with the rapid advancement of artificial intelligence, RNA-based research has gradually unfolded into a new realm of innovation (Townshend et al. (2021)). Recent studies showed that the dynamic folding processes of RNA molecules intrinsically exhibit physical unclonable functions (PUFs) characteristics (Herder et al. (2014); Li et al. (2022); Luescher et al. (2024); Zhou et al. (2021)), thereby establishing a pathway for designing post-quantum cryptography (PQC) systems (Arapinis et al. (2021); Cambou et al. (2021)).

ai4na workshop, artificial intelligence, data length, (15 more...)

arXiv.org Artificial Intelligence

2504.17878

Country:

North America > United States (0.46)
Asia > China (0.28)

Genre: Research Report > New Finding (0.54)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.93)
Banking & Finance > Trading (0.86)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Enhancing Downstream Analysis in Genome Sequencing: Species Classification While Basecalling

Kodra, Riselda, Benmeziane, Hadjer, Boybat, Irem, Simon, William Andrew

arXiv.org Artificial IntelligenceApr-10-2025

The ability to quickly and accurately identify microbial species in a sample, known as metagenomic profiling, is critical across various fields, from healthcare to environmental science. This paper introduces a novel method to profile signals coming from sequencing devices in parallel with determining their nucleotide sequences, a process known as basecalling, via a multi-objective deep neural network for simultaneous basecalling and multi-class genome classification. We introduce a new loss strategy where losses for basecalling and classification are back-propagated separately, with model weights combined for the shared layers, and a pre-configured ranking strategy allowing top-K species accuracy, giving users flexibility to choose between higher accuracy or higher speed at identifying the species. We achieve state-of-the-art basecalling accuracies, while classification accuracies meet and exceed the results of state-of-the-art binary classifiers, attaining an average of 92.5%/98.9% accuracy at identifying the top-1/3 species among a total of 17 genomes in the Wick bacterial dataset. The work presented here has implications for future studies in metagenomic profiling by accelerating the bottleneck step of matching the DNA sequence to the correct genome.

accuracy, artificial intelligence, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2504.07065

Country: Europe > Switzerland (0.28)

Genre: Research Report (0.84)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.71)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

LongProLIP: A Probabilistic Vision-Language Model with Long Context Text

Chun, Sanghyuk, Yun, Sangdoo

arXiv.org Artificial IntelligenceMar-13-2025

Recently, Probabilistic Language-Image Pre-Training (ProLIP) has been proposed to tackle the multiplicity issue of vision-language (VL) tasks. Despite their success in probabilistic representation learning at a scale, the ProLIP models cannot handle long context texts longer than 64 context length, which limits their ability to capture rich contextual information from longer text sequences. To address this issue, this paper proposes a fine-tuning strategy for ProLIP to accept longer texts, e.g., 256 text tokens. Experimental results on Urban-1k and the DataComp evaluation suite show that the proposed LongProLIP recipe can improve understanding of long contexts while minimizing the negative effect of fine-tuning.We also observe a trade-off between the long context understanding (measured by Urban-1k) and general zero-shot capability (measured by evaluation datasets by DataComp). Code is available at https://github.com/naver-ai/prolip

computer vision, dataset, longprolip, (11 more...)

arXiv.org Artificial Intelligence

2503.08048

Country: Europe > Spain > Andalusia > Granada Province > Granada (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Causal Covariate Shift Correction using Fisher information penalty

Khan, Behraj, Mirza, Behroz, Syed, Tahir

arXiv.org Artificial IntelligenceFeb-11-2025

We also present the baselines, datasets details, C 3 batchwise performance λ selection details, and experimental setup. A.1 R EPRESENTING THE CURRENT DERIVATIVE WITH THE F ISHER INFORMATION MATRIX Let us consider having a model with parameter θ and a likelihood function p (X | θ), where X is observed data. The estimate of true parameter θ can be found by using estimator ˆ θ . The Fisher information I (θ) can be defined as the expected value of the negative hessian of the log-likelihood function. I (θ) = E null 2 log p (X | θ) θ θ T null (4) The Cram er-Rao Lower Bound (CRLB) states that for any unbiased estimator ˆ θ, the variance-covariance matrix V ( ˆ θ) satisfies the inequality property: V ( ˆ θ) I 1 (θ) (5) The symbol represents the following matrix inequality V ( ˆ θ) I 1 (θ) positive and semi-definite.

covariate shift, kuzushiji-mnist 75, permuted-mnist 95, (16 more...)

arXiv.org Artificial Intelligence

2502.15756

Country:

Asia > Pakistan > Sindh > Karachi Division > Karachi (0.04)
Europe > Hungary > Budapest > Budapest (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Region Mixup

Saha, Saptarshi, Garain, Utpal

arXiv.org Artificial IntelligenceSep-23-2024

This paper introduces a simple extension of mixup (Zhang et al., 2018) data augmentation to enhance generalization in visual recognition tasks. Unlike the vanilla mixup method, which blends entire images, our approach focuses on combining regions from multiple images. Mixup (Zhang et al., 2018) is a data augmentation method that trains models on weighted averages of randomly paired training points. The averaging weights are typically sampled from a beta distribution with parameter α, where α ensures that the generated training set remains close to the original dataset. Mixup-generated perturbations may adhere only to the direction towards any arbitrary data point, potentially resulting in suboptimal regularization (Guo et al., 2019).

international conference, region mixup, semanticscholar, (12 more...)

arXiv.org Artificial Intelligence

2409.15028

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.05)
North America > United States > California > San Diego County > San Diego (0.05)
Asia > India > West Bengal > Kolkata (0.05)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Is Watermarking LLM-Generated Code Robust?

Suresh, Tarun, Ugare, Shubham, Singh, Gagandeep, Misailovic, Sasa

arXiv.org Artificial IntelligenceJun-28-2024

We present the first study of the robustness of existing watermarking techniques on Python code generated by large language models. Although existing works showed that watermarking can be robust for natural language, we show that it is easy to remove these watermarks on code by semantic-preserving transformations.

language model, transformation, watermark, (15 more...)

arXiv.org Artificial Intelligence

2403.17983

Country:

North America > United States > New Jersey (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning

Hira, Medha, Goel, Arnav, Gupta, Anubha

arXiv.org Artificial IntelligenceJun-18-2024

This paper presents CrossVoice, a novel cascade-based Speech-to-Speech Translation (S2ST) system employing advanced ASR, MT, and TTS technologies with cross-lingual prosody preservation through transfer learning. We conducted comprehensive experiments comparing CrossVoice with direct-S2ST systems, showing improved BLEU scores on tasks such as Fisher Es-En, VoxPopuli Fr-En and prosody preservation on benchmark datasets CVSS-T and IndicTTS. With an average mean opinion score of 3.6 out of 4, speech synthesized by CrossVoice closely rivals human speech on the benchmark highlighting the efficacy of cascade-based systems and transfer learning in multilingual S2ST with prosody transfer. Transformer-based models (Vaswani et al., 2017) have revolutionized speech processing, leading to significant advancements in automatic speech recognition and text-to-speech technologies (Latif et al., 2023; Prabhavalkar et al., 2023). This shift towards end-to-end systems has opened new avenues in Speech-to-Speech Translation (S2ST) for translating speech across languages.

bleu score, crossvoice, translation, (15 more...)

arXiv.org Artificial Intelligence

2406.00021

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
Asia > India > NCT > New Delhi (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Multilingual Prosody Transfer: Comparing Supervised & Transfer Learning

Goel, Arnav, Hira, Medha, Gupta, Anubha

arXiv.org Artificial IntelligenceJun-18-2024

The field of prosody transfer in speech synthesis systems is rapidly advancing. This research is focused on evaluating learning methods for adapting pre-trained monolingual text-to-speech (TTS) models to multilingual conditions, i.e., Supervised Fine-Tuning (SFT) and Transfer Learning (TL). This comparison utilizes three distinct metrics: Mean Opinion Score (MOS), Recognition Accuracy (RA), and Mel Cepstral Distortion (MCD). Results demonstrate that, in comparison to SFT, TL leads to significantly enhanced performance, with an average MOS higher by 1.53 points, a 37.5% increase in RA, and approximately, a 7.8-point improvement in MCD. These findings are instrumental in helping build TTS models for low-resource languages.

iclr 2024, prosody transfer, transfer learning, (11 more...)

arXiv.org Artificial Intelligence

2406.00022

Country:

Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.05)
Asia > India > NCT > New Delhi (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Solar Panel Segmentation :Self-Supervised Learning Solutions for Imperfect Datasets

Sagaram, Sankarshanaa, Didwania, Krish, Srivastava, Laven, Kasliwal, Aditya, Kailas, Pallavi, Verma, Ujjwal

arXiv.org Artificial IntelligenceJun-2-2024

The increasing adoption of solar energy necessitates advanced methodologies for monitoring and maintenance to ensure optimal performance of solar panel installations. A critical component in this context is the accurate segmentation of solar panels from aerial or satellite imagery, which is essential for identifying operational issues and assessing efficiency. This paper addresses the significant challenges in panel segmentation, particularly the scarcity of annotated data and the labour-intensive nature of manual annotation for supervised learning. We explore and apply Self-Supervised Learning (SSL) to solve these challenges. We demonstrate that SSL significantly enhances model generalization under various conditions and reduces dependency on manually annotated data, paving the way for robust and adaptable solar panel segmentation solutions.

dataset, learning, segmentation, (11 more...)

arXiv.org Artificial Intelligence

2402.12843

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > India > Karnataka > Bengaluru (0.04)
Asia > China > Jiangsu Province (0.04)

Genre: Research Report (0.64)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.82)

Add feedback

An Evaluation Benchmark for Autoformalization in Lean4

Gulati, Aryan, Ladsaria, Devanshu, Mishra, Shubhra, Sidhu, Jasdeep, Miranda, Brando

arXiv.org Artificial IntelligenceJun-1-2024

Large Language Models (LLMs) hold the potential to revolutionize autoformalization. The introduction of Lean4, a mathematical programming language, presents an unprecedented opportunity to rigorously assess the autoformalization capabilities of LLMs. This paper introduces a novel evaluation benchmark designed for Lean4, applying it to test the abilities of state-of-the-art LLMs, including GPT-3.5, GPT-4, and Gemini Pro. Our comprehensive analysis reveals that, despite recent advancements, these LLMs still exhibit limitations in autoformalization, particularly in more complex areas of mathematics. These findings underscore the need for further development in LLMs to fully harness their potential in scientific research and development. This study not only benchmarks current LLM capabilities but also sets the stage for future enhancements in autoformalization.

autoformalization, lean4, obj, (14 more...)

arXiv.org Artificial Intelligence

2406.06555

Country:

North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.78)

Add feedback