AITopics

doi: 10.1145/3706598.3713295

2504.06771

Country:

North America > United States (1.00)
Europe > United Kingdom (0.67)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Overview (0.92)

Industry:

Health & Medicine (1.00)
Energy (1.00)
Banking & Finance > Trading (1.00)
Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Human Computer Interaction > Interfaces (0.93)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)
(4 more...)

Yanampally, Abhiram Reddy

High-Resource Translation:Turning Abundance into Accessibility

High-Resource Translation: Turning Abundance into Accessibility Y anampally Abhiram Reddy ABV -IIITM Gwalior, MP, India Abstract --This paper presents a novel approach to constructing an English-to-T elugu translation model by leveraging transfer learning techniques and addressing the challenges associated with low-resource languages. Utilizing the Bharat Parallel Corpus Collection (BPCC) as the primary dataset, the model incorporates iterative backtranslation to generate synthetic parallel data, effectively augmenting the training dataset and enhancing the model's translation capabilities. The focus of this research extends beyond mere translation accuracy; it encompasses a comprehensive strategy for improving model performance through data augmentation, optimization of training parameters, and the effective utilization of pre-trained models. By adopting these methodologies, we aim to create a more robust translation system that can handle a diverse range of sentence structures and linguistic nuances inherent to both English and T elugu. This research highlights the significance of innovative data handling techniques and the potential of transfer learning in overcoming the limitations posed by sparse datasets in low-resource languages.This research not only contributes to the field of machine translation but also aims to facilitate better communication and understanding between English and T elugu speakers in real-world contexts. Future work will concentrate on further enhancing the models robustness and expanding its applicability to more complex sentence structures, ultimately ensuring its practical usability across various domains and applications. I NTRODUCTION Machine translation (MT) is a significant subfield of natural language processing (NLP) that focuses on automatically translating text from one language to another.

machine learning, natural language, translation, (21 more...)

2504.05914

Country:

Asia > India (0.24)
Europe > Finland > Uusimaa > Helsinki (0.04)
Europe > United Kingdom > Scotland (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)

Genre:

Research Report (0.70)
Overview (0.48)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Machine LearningApr-9-2025

Probabilistic QoS Metric Forecasting in Delay-Tolerant Networks Using Conditional Diffusion Models on Latent Dynamics

Zhang, Enming, Liu, Zheng, Xiang, Yu, Qu, Yanwen

Probabilistic QoS Metric Forecasting in Delay-T olerant Networks Using Conditional Diffusion Models on Latent Dynamics Enming Zhang School of Computer Science Nanjing University of Posts and T elecommunications Nanjing, China b20060123@njupt.edu.cn Zheng Liu School of Computer Science Nanjing University of Posts and T elecommunications Nanjing, China zliu@njupt.edu.cn Y u Xiang School of Computer Science Nanjing University of Posts and T elecommunications Nanjing, China 1221045920@njupt.edu.cn Abstract --Active QoS metric prediction, commonly employed in the maintenance and operation of DTN, could enhance network performance regarding latency, throughput, energy consumption, and dependability. Naturally formulated as a multivariate time series forecasting problem, it attracts substantial research efforts. Traditional mean regression methods for time series forecasting cannot capture the data complexity adequately, resulting in deteriorated performance in operational tasks in DTNs such as routing. This paper formulates the prediction of QoS metrics in DTN as a probabilistic forecasting problem on multivariate time series, where one could quantify the uncertainty of forecasts by characterizing the distribution of these samples. The proposed approach hires diffusion models and incorporates the latent temporal dynamics of non-stationary and multi-mode data into them.

artificial intelligence, diffusion model, machine learning, (15 more...)

arXiv.org Machine Learning

2504.08821

Country:

Asia > China > Jiangsu Province > Nanjing (1.00)
Oceania > Australia (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(2 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Industry:

Information Technology (0.68)
Energy (0.48)
Telecommunications (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

An experimental survey and Perspective View on Meta-Learning for Automated Algorithms Selection and Parametrization

Garouani, Moncef

Considerable progress has been made in the recent literature studies to tackle the Algorithms Selection and Parametrization (ASP) problem, which is diversified in multiple meta-learning setups. Yet there is a lack of surveys and comparative evaluations that critically analyze, summarize and assess the performance of existing methods. In this paper, we provide an overview of the state of the art in this continuously evolving field. The survey sheds light on the motivational reasons for pursuing classifiers selection through meta-learning. In this regard, Automated Machine Learning (AutoML) is usually treated as an ASP problem under the umbrella of the democratization of machine learning. Accordingly, AutoML makes machine learning techniques accessible to domain scientists who are interested in applying advanced analytics but lack the required expertise. It can ease the task of manually selecting ML algorithms and tuning related hyperparameters. We comprehensively discuss the different phases of classifiers selection based on a generic framework that is formed as an outcome of reviewing prior works. Subsequently, we propose a benchmark knowledge base of 4 millions previously learned models and present extensive comparative evaluations of the prominent methods for classifiers selection based on 08 classification algorithms and 400 benchmark datasets. The comparative study quantitatively assesses the performance of algorithms selection methods along while emphasizing the strengths and limitations of existing studies.

data mining, evolutionary algorithm, machine learning, (18 more...)

2504.06207

Country: Europe (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Industry:

Information Technology (1.00)
Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
(6 more...)

Zhao, Xufang, Tsimhoni, Omer

Real-Time Pitch/F0 Detection Using Spectrogram Images and Convolutional Neural Networks

-- Pitch (also called F0 or fundamental frequency) is a very important voice feature for smart mobility features, such as driver's emotion detection, vehicle personalized profiles, and secured speaker identification. This paper presents a novel approach to de tect F0 through Convolutional Neural Networks (CNN) and image processing techniques to directly estimate pitch from spectrogram images. Our new approach demonstrates a very good detection accuracy; a total of 9 2 % of predicted pitch contours have strong or moderate correlations to the true pitch contours. Furthermore, t he experimental comparison between our new approach and other state - of - the - art CNN methods reveals that our approach can enhance the detection rate by approximately 5% across various Signal - to - Noise Ratio (SNR) conditions . Pitch detection is very widely used for smart mobility features. For example, as shown in Fig.1, pitch contour can be used to train a deep learning neural network for driver's emotion detection, which can alert road rage.

artificial intelligence, detection, machine learning, (16 more...)

2504.06165

Country: North America > United States > Michigan (0.14)

Genre:

Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Industry:

Transportation > Ground > Road (0.48)
Automobiles & Trucks (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Moreira, Gustavo, Bogucka, Edyta Paulina, Constantinides, Marios, Quercia, Daniele

The Hall of AI Fears and Hopes: Comparing the Views of AI Influencers and those of Members of the U.S. Public Through an Interactive Platform

AI development is shaped by academics and industry leaders - let us call them ``influencers'' - but it is unclear how their views align with those of the public. To address this gap, we developed an interactive platform that served as a data collection tool for exploring public views on AI, including their fears, hopes, and overall sense of hopefulness. We made the platform available to 330 participants representative of the U.S. population in terms of age, sex, ethnicity, and political leaning, and compared their views with those of 100 AI influencers identified by Time magazine. The public fears AI getting out of control, while influencers emphasize regulation, seemingly to deflect attention from their alleged focus on monetizing AI's potential. Interestingly, the views of AI influencers from underrepresented groups such as women and people of color often differ from the views of underrepresented groups in the public.

artificial intelligence, machine learning, natural language, (17 more...)

2504.06016

Country:

North America > United States (1.00)
Europe (1.00)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Overview (0.92)
Personal > Interview (0.67)

Industry:

Law (1.00)
Education (1.00)
Banking & Finance (1.00)
(4 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
(2 more...)

Wagner, Eitan, Keydar, Renana, Abend, Omri

Unsupervised Location Mapping for Narrative Corpora

This work presents the task of unsupervised location mapping, which seeks to map the trajectory of an individual narrative on a spatial map of locations in which a large set of narratives take place. Despite the fundamentality and generality of the task, very little work addressed the spatial mapping of narrative texts. The task consists of two parts: (1) inducing a ``map'' with the locations mentioned in a set of texts, and (2) extracting a trajectory from a single narrative and positioning it on the map. Following recent advances in increasing the context length of large language models, we propose a pipeline for this task in a completely unsupervised manner without predefining the set of labels. We test our method on two different domains: (1) Holocaust testimonies and (2) Lake District writing, namely multi-century literature on travels in the English Lake District. We perform both intrinsic and extrinsic evaluations for the task, with encouraging results, thereby setting a benchmark and evaluation practices for the task, as well as highlighting challenges.

large language model, machine learning, trajectory, (21 more...)

2504.05954

Country:

North America > United States (0.94)
Asia (0.68)
Europe > Poland > Lesser Poland Province > Kraków (0.14)

Genre:

Research Report (0.82)
Overview (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM

Zha, Jirong, Fan, Yuxuan, Yang, Xiao, Gao, Chen, Chen, Xinlei

3D spatial understanding is essential in real-world applications such as robotics, autonomous vehicles, virtual reality, and medical imaging. Recently, Large Language Models (LLMs), having demonstrated remarkable success across various domains, have been leveraged to enhance 3D understanding tasks, showing potential to surpass traditional computer vision methods. In this survey, we present a comprehensive review of methods integrating LLMs with 3D spatial understanding. We propose a taxonomy that categorizes existing methods into three branches: image-based methods deriving 3D understanding from 2D visual data, point cloud-based methods working directly with 3D representations, and hybrid modality-based methods combining multiple data streams. We systematically review representative methods along these categories, covering data representations, architectural modifications, and training strategies that bridge textual and 3D modalities. Finally, we discuss current limitations, including dataset scarcity and computational challenges, while highlighting promising research directions in spatial perception, multi-modal fusion, and real-world applications.

arxiv preprint arxiv, large language model, natural language, (14 more...)

2504.05786

Genre: Overview (1.00)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

AI-Driven Prognostics for State of Health Prediction in Li-ion Batteries: A Comprehensive Analysis with Validation

Ding, Tianqi, Xiang, Dawei, Sun, Tianyao, Qi, YiJiashum, Zhao, Zunduo

This paper presents a comprehensive review of AI-driven prognostics for State of Health (SoH) prediction in lithium-ion batteries. We compare the effectiveness of various AI algorithms, including FFNN, LSTM, and BiLSTM, across multiple datasets (CALCE, NASA, UDDS) and scenarios (e.g., varying temperatures and driving conditions). Additionally, we analyze the factors influencing SoH fluctuations, such as temperature and charge-discharge rates, and validate our findings through simulations. The results demonstrate that BiLSTM achieves the highest accuracy, with an average RMSE reduction of 15% compared to LSTM, highlighting its robustness in real-world applications.

artificial intelligence, deep learning, machine learning, (15 more...)

2504.05728

Country:

North America > United States > Michigan (0.28)
North America > United States > Connecticut (0.28)

Genre:

Overview (0.87)
Research Report > New Finding (0.54)

Industry:

Energy > Energy Storage (1.00)
Electrical Industrial Apparatus (1.00)
Transportation > Ground > Road (0.96)
Government > Regional Government > North America Government > United States Government (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Maity, Subhankar, Deroy, Aniket

Leveraging Prompt-Tuning for Bengali Grammatical Error Explanation Using Large Language Models

We propose a novel three-step prompt-tuning method for Bengali Grammatical Error Explanation (BGEE) using state-of-the-art large language models (LLMs) such as GPT-4, GPT-3.5 Turbo, and Llama-2-70b. Our approach involves identifying and categorizing grammatical errors in Bengali sentences, generating corrected versions of the sentences, and providing natural language explanations for each identified error. We evaluate the performance of our BGEE system using both automated evaluation metrics and human evaluation conducted by experienced Bengali language experts. Our proposed prompt-tuning approach shows that GPT-4, the best performing LLM, surpasses the baseline model in automated evaluation metrics, with a 5.26% improvement in F1 score and a 6.95% improvement in exact match. Furthermore, compared to the previous baseline, GPT-4 demonstrates a decrease of 25.51% in wrong error type and a decrease of 26.27% in wrong error explanation . However, the results still lag behind the human baseline.

explanation, large language model, machine learning, (17 more...)

2504.05642

Country:

Europe (1.00)
North America > United States (0.46)

Genre:

Research Report (0.50)
Overview (0.46)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)