AITopics | Ajaccio

Collaborating Authors

Ajaccio

U.N. calls for probe after alleged drone attack on Gaza-bound aid flotilla

The Japan TimesSep-25-2025, 02:25:00 GMT

U.N. calls for probe after alleged drone attack on Gaza-bound aid flotilla Activists wave Palestinian flags as they gather to support a flotilla carrying humanitarian aid in Ajaccio, on the French Mediterranean island of Corsica, on Sept 12. | AFP-JIJI Rome - The United Nations called Wednesday for an investigation into alleged drone attacks against a Gaza-bound aid flotilla that prompted Italy and Spain to send naval ships to help. The Global Sumud Flotilla, carrying activists including Swedish environmentalist Greta Thunberg, blamed Israel for more than a dozen explosions heard around its vessels off Greece late on Tuesday. U.N. Human Rights Office spokesperson Thameen Al-Kheetan said anyone responsible for the violations should be held accountable, and called for an independent, impartial and thorough investigation. In a time of both misinformation and too much information, quality journalism is more crucial than ever. By subscribing, you can help us get the story right. With your current subscription plan you can comment on stories.

crime & legal science, drone attack, gaza-bound aid flotilla, (7 more...)

The Japan Times

Country:

Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.83)
Europe > Spain (0.25)
Europe > Italy (0.25)
(8 more...)

Industry: Government > Military > Navy (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.83)
Information Technology > Communications > Social Media (0.79)

Add feedback

NICE^k Metrics: Unified and Multidimensional Framework for Evaluating Deterministic Solar Forecasting Accuracy

Voyant, Cyril, Despotovic, Milan, Garcia-Gutierrez, Luis, Silva, Rodrigo Amaro e, Lauret, Philippe, Soubdhan, Ted, Bailek, Nadjem

arXiv.org Machine LearningAug-5-2025

Accurate solar energy output prediction is key for integrating renewables into grids, maintaining stability, and improving energy management. However, standard error metrics such as Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Skill Scores (SS) fail to capture the multidimensional nature of solar irradiance forecasting. These metrics lack sensitivity to forecastability, rely on arbitrary baselines (e.g., clear-sky models), and are poorly suited for operational use. To address this, we introduce the NICEk framework (Normalized Informed Comparison of Errors, with k = 1, 2, 3, Sigma), offering a robust and interpretable evaluation of forecasting models. Each NICEk score corresponds to an Lk norm: NICE1 targets average errors, NICE2 emphasizes large deviations, NICE3 highlights outliers, and NICESigma combines all. Using Monte Carlo simulations and data from 68 stations in the Spanish SIAR network, we evaluated methods including autoregressive models, extreme learning, and smart persistence. Theoretical and empirical results align when assumptions hold (e.g., R^2 ~ 1.0 for NICE2). Most importantly, NICESigma consistently shows higher discriminative power (p < 0.05), outperforming traditional metrics (p > 0.05). The NICEk metrics exhibit stronger statistical significance (e.g., p-values from 10^-6 to 0.004 across horizons) and greater generalizability. They offer a unified and operational alternative to standard error metrics in deterministic solar forecasting.

artificial intelligence, forecasting, machine learning, (18 more...)

arXiv.org Machine Learning

2508.01457

Country:

Europe > Portugal > Coimbra > Coimbra (0.04)
Africa > Middle East > Algeria > Adrar Province > Adrar (0.04)
Europe > Serbia > Šumadija and Western Serbia > Šumadija District > Kragujevac (0.04)
(7 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

On the Importance of Clearsky Model in Short-Term Solar Radiation Forecasting

Voyant, Cyril, Despotovic, Milan, Notton, Gilles, Saint-Drenan, Yves-Marie, Asloune, Mohammed, Garcia-Gutierrez, Luis

arXiv.org Artificial IntelligenceMar-6-2025

Clearsky models are widely used in solar energy for many applications such as quality control, resource assessment, satellite-base irradiance estimation and forecasting. However, their use in forecasting and nowcasting is associated with a number of challenges. Synchronization errors, reliance on the Clearsky index (ratio of the global horizontal irradiance to its cloud-free counterpart) and high sensitivity of the clearsky model to errors in aerosol optical depth at low solar elevation limit their added value in real-time applications. This paper explores the feasibility of short-term forecasting without relying on a clearsky model. We propose a Clearsky-Free forecasting approach using Extreme Learning Machine (ELM) models. ELM learns daily periodicity and local variability directly from raw Global Horizontal Irradiance (GHI) data. It eliminates the need for Clearsky normalization, simplifying the forecasting process and improving scalability. Our approach is a non-linear adaptative statistical method that implicitely learns the irradiance in cloud-free conditions removing the need for an clear-sky model and the related operational issues. Deterministic and probabilistic results are compared to traditional benchmarks, including ARMA with McClear-generated Clearsky data and quantile regression for probabilistic forecasts. ELM matches or outperforms these methods, providing accurate predictions and robust uncertainty quantification. This approach offers a simple, efficient solution for real-time solar forecasting. By overcoming the stationarization process limitations based on usual multiplicative scheme Clearsky models, it provides a flexible and reliable framework for modern energy systems.

forecast, forecasting, irradiance, (15 more...)

arXiv.org Artificial Intelligence

2503.07647

Country:

Europe > Spain (0.04)
Europe > Serbia > Šumadija and Western Serbia > Šumadija District > Kragujevac (0.04)
North America > United States > Colorado > Denver County > Denver (0.04)
(5 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

RisingBALLER: A player is a token, a match is a sentence, A path towards a foundational model for football players data analytics

Adjileye, Akedjou Achraff

arXiv.org Artificial IntelligenceOct-1-2024

In this paper, I introduce RisingBALLER, the first publicly available approach that leverages a transformer model trained on football match data to learn matchspecific player representations. Drawing inspiration from advances in language modeling, RisingBALLER treats each football match as a unique sequence in which players serve as tokens, with their embeddings shaped by the specific context of the match. Through the use of masked player prediction (MPP) as a pre-training task, RisingBALLER learns foundational features for football player representations, similar to how language models learn semantic features for text representations. As a downstream task, I introduce next match statistics prediction (NMSP) to showcase the effectiveness of the learned player embeddings. The NMSP model surpasses a strong baseline commonly used for performance forecasting within the community. Furthermore, I conduct an in-depth analysis to demonstrate how RisingBALLER's learned embeddings can be used in various football analytics tasks, such as producing meaningful positional features that capture the essence and variety of player roles beyond rigid x,y coordinates, team cohesion estimation, and similar player retrieval for more effective data-driven scouting. More than a simple machine learning model, RisingBALLER is a comprehensive framework designed to transform football data analytics by learning high-level foundational features for players, taking into account the context of each match. It offers a deeper understanding of football players beyond individual statistics. In recent years, the field of machine learning has been revolutionized by the introduction of the transformer architecture [1], which initially gained prominence in natural language processing (NLP) with models like BERT [2], RoBERTa [3], and more recently, the widespread use of large language models (LLMs). These models, often trained on seemingly simple tasks such as next token prediction or masked token prediction, have demonstrated remarkable performance in learning high-level features that effectively represent each word and model language intricately. They are capable of learning nuanced representations of the multiple meanings a word can have depending on its context.

representation, risingballer, statistics, (13 more...)

arXiv.org Artificial Intelligence

2410.00943

Country:

Europe > Spain > Galicia > Madrid (0.05)
Europe > United Kingdom > England > Leicestershire > Leicester (0.04)
Europe > Germany > Rheinland-Pfalz > Mainz (0.04)
(13 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Sports > Football (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness

Wang, Wenxuan

arXiv.org Artificial IntelligenceAug-31-2024

Large language models (LLMs), such as ChatGPT, have rapidly penetrated into people's work and daily lives over the past few years, due to their extraordinary conversational skills and intelligence. ChatGPT has become the fastest-growing software in terms of user numbers in human history and become an important foundational model for the next generation of artificial intelligence applications. However, the generations of LLMs are not entirely reliable, often producing content with factual errors, biases, and toxicity. Given their vast number of users and wide range of application scenarios, these unreliable responses can lead to many serious negative impacts. This thesis introduces the exploratory works in the field of language model reliability during the PhD study, focusing on the correctness, non-toxicity, and fairness of LLMs from both software testing and natural language processing perspectives. First, to measure the correctness of LLMs, we introduce two testing frameworks, FactChecker and LogicAsker, to evaluate factual knowledge and logical reasoning accuracy, respectively. Second, for the non-toxicity of LLMs, we introduce two works for red-teaming LLMs. Third, to evaluate the fairness of LLMs, we introduce two evaluation frameworks, BiasAsker and XCulturalBench, to measure the social bias and cultural bias of LLMs, respectively.

38th ieee acm international conference, commercial software and research model, software engineering conference and symposium, (15 more...)

arXiv.org Artificial Intelligence

2409.00551

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)
Europe > United Kingdom (0.13)
Asia > Russia (0.13)
(26 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Overview (1.00)
Workflow (0.92)

Industry:

Media (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Know When To Stop: A Study of Semantic Drift in Text Generation

Spataru, Ava, Hambro, Eric, Voita, Elena, Cancedda, Nicola

arXiv.org Artificial IntelligenceApr-8-2024

In this work, we explicitly show that modern LLMs tend to generate correct facts first, then "drift away" and generate incorrect facts later: this was occasionally observed but never properly measured. We develop a semantic drift score that measures the degree of separation between correct and incorrect facts in generated texts and confirm our hypothesis when generating Wikipedia-style biographies. This correct-then-incorrect generation pattern suggests that factual accuracy can be improved by knowing when to stop generation. Therefore, we explore the trade-off between information quantity and factual accuracy for several early stopping methods and manage to improve factuality by a large margin. We further show that reranking with semantic similarity can further improve these results, both compared to the baseline and when combined with early stopping. Finally, we try calling external API to bring the model back to the right generation path, but do not get positive results. Overall, our methods generalize and can be applied to any long-form text generation to produce more reliable information, by balancing trade-offs between factual accuracy, information quantity and computational cost.

paragraph, sd score, semantic drift, (14 more...)

arXiv.org Artificial Intelligence

2404.05411

Country:

North America > United States > California (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > France > Corsica > Ajaccio (0.04)
(12 more...)

Genre: Research Report (1.00)

Industry:

Media (0.68)
Leisure & Entertainment > Sports > Rugby > Rugby League (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

The Earth is Flat? Unveiling Factual Errors in Large Language Models

Wang, Wenxuan, Shi, Juluan, Tu, Zhaopeng, Yuan, Youliang, Huang, Jen-tse, Jiao, Wenxiang, Lyu, Michael R.

arXiv.org Artificial IntelligenceJan-1-2024

Large Language Models (LLMs) like ChatGPT are foundational in various applications due to their extensive knowledge from pre-training and fine-tuning. Despite this, they are prone to generating factual and commonsense errors, raising concerns in critical areas like healthcare, journalism, and education to mislead users. Current methods for evaluating LLMs' veracity are limited by test data leakage or the need for extensive human labor, hindering efficient and accurate error detection. To tackle this problem, we introduce a novel, automatic testing framework, FactChecker, aimed at uncovering factual inaccuracies in LLMs. This framework involves three main steps: First, it constructs a factual knowledge graph by retrieving fact triplets from a large-scale knowledge database. Then, leveraging the knowledge graph, FactChecker employs a rule-based approach to generates three types of questions (Yes-No, Multiple-Choice, and WH questions) that involve single-hop and multi-hop relations, along with correct answers. Lastly, it assesses the LLMs' responses for accuracy using tailored matching strategies for each question type. Our extensive tests on six prominent LLMs, including text-davinci-002, text-davinci-003, ChatGPT~(gpt-3.5-turbo, gpt-4), Vicuna, and LLaMA-2, reveal that FactChecker can trigger factual errors in up to 45\% of questions in these models. Moreover, we demonstrate that FactChecker's test cases can improve LLMs' factual accuracy through in-context learning and fine-tuning (e.g., llama-2-13b-chat's accuracy increase from 35.3\% to 68.5\%). We are making all code, data, and results available for future research endeavors.

factchecker, llm, triplet, (15 more...)

arXiv.org Artificial Intelligence

2401.00761

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Russia (0.14)
North America > United States > District of Columbia > Washington (0.05)
(12 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Information Technology (1.00)
Health & Medicine (1.00)
Media (0.87)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Modelling and Detection of Driver's Fatigue using Ontology

Lambert, Alexandre, Hina, Manolo Dulva, Barth, Celine, Soukane, Assia, Ramdane-Cherif, Amar

arXiv.org Artificial IntelligenceAug-31-2022

Road accidents have become the eight leading cause of death all over the world. Lots of these accidents are due to a driver's inattention or lack of focus, due to fatigue. Various factors cause driver's fatigue. This paper considers all the measureable data that manifest driver's fatigue, namely those manifested in the vehicle measureable data while driving as well as the driver's physical and physiological data. Each of the three main factors are further subdivided into smaller details. For example, the vehicle's data is composed of the values obtained from the steering wheel's angle, yaw angle, the position on the lane, and the speed and acceleration of the vehicle while moving. Ontological knowledge and rules for driver fatigue detection are to be integrated into an intelligent system so that on the first sign of dangerous level of fatigue is detected, a warning notification is sent to the driver. This work is intended to contribute to safe road driving.

intelligent system, ontology, vehicle, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.5220/0010689700003064

2208.14694

Country:

Europe > Latvia > Riga Municipality > Riga (0.04)
North America > United States > Virginia (0.04)
North America > United States > Montana (0.04)
(11 more...)

Genre: Research Report (0.82)

Industry:

Transportation > Ground > Road (1.00)
Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)

Add feedback

Bisecting for selecting: using a Laplacian eigenmaps clustering approach to create the new European football Super League

Bond, A. J., Beggs, C. B.

arXiv.org Machine LearningApr-20-2021

We use European football performance data to select teams to form the proposed European football Super League, using only unsupervised techniques. We first used random forest regression to select important variables predicting goal difference, which we used to calculate the Euclidian distances between teams. Creating a Laplacian eigenmap, we bisected the Fielder vector to identify the five major European football leagues' natural clusters. Our results showed how an unsupervised approach could successfully identify four clusters based on five basic performance metrics: shots, shots on target, shots conceded, possession, and pass success. The top two clusters identify those teams who dominate their respective leagues and are the best candidates to create the most competitive elite super league. Keywords: OR in sports; Selection; Unsupervised; Spectral clustering; Laplacian Eigenmap; Machine Learning 1. Introduction Operational research (OR) has a long history of using sport to explore operational insights and methodologies (see Wright, 2009 for a review).

goal difference, laplacian eigenmap, serie, (14 more...)

arXiv.org Machine Learning

2104.10125

Country:

Europe > Spain > Galicia > Madrid (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(26 more...)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Sports > Soccer (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback