Ajaccio
U.N. calls for probe after alleged drone attack on Gaza-bound aid flotilla
U.N. calls for probe after alleged drone attack on Gaza-bound aid flotilla Activists wave Palestinian flags as they gather to support a flotilla carrying humanitarian aid in Ajaccio, on the French Mediterranean island of Corsica, on Sept 12. | AFP-JIJI Rome - The United Nations called Wednesday for an investigation into alleged drone attacks against a Gaza-bound aid flotilla that prompted Italy and Spain to send naval ships to help. The Global Sumud Flotilla, carrying activists including Swedish environmentalist Greta Thunberg, blamed Israel for more than a dozen explosions heard around its vessels off Greece late on Tuesday. U.N. Human Rights Office spokesperson Thameen Al-Kheetan said anyone responsible for the violations should be held accountable, and called for an independent, impartial and thorough investigation. In a time of both misinformation and too much information, quality journalism is more crucial than ever. By subscribing, you can help us get the story right. With your current subscription plan you can comment on stories.
- Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.83)
- Europe > Spain (0.25)
- Europe > Italy (0.25)
- (8 more...)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.83)
- Information Technology > Communications > Social Media (0.79)
Short-Term Forecasting of Energy Production and Consumption Using Extreme Learning Machine: A Comprehensive MIMO based ELM Approach
Voyant, Cyril, Despotovic, Milan, Garcia-Gutierrez, Luis, Asloune, Mohammed, Saint-Drenan, Yves-Marie, Duchaud, Jean-Laurent, Faggianelli, hjuvan Antone, Magliaro, Elena
A novel methodology for short-term energy forecasting using an Extreme Learning Machine ($\mathtt{ELM}$) is proposed. Using six years of hourly data collected in Corsica (France) from multiple energy sources (solar, wind, hydro, thermal, bioenergy, and imported electricity), our approach predicts both individual energy outputs and total production (including imports, which closely follow energy demand, modulo losses) through a Multi-Input Multi-Output ($\mathtt{MIMO}$) architecture. To address non-stationarity and seasonal variability, sliding window techniques and cyclic time encoding are incorporated, enabling dynamic adaptation to fluctuations. The $\mathtt{ELM}$ model significantly outperforms persistence-based forecasting, particularly for solar and thermal energy, achieving an $\mathtt{nRMSE}$ of $17.9\%$ and $5.1\%$, respectively, with $\mathtt{R^2} > 0.98$ (1-hour horizon). The model maintains high accuracy up to five hours ahead, beyond which renewable energy sources become increasingly volatile. While $\mathtt{MIMO}$ provides marginal gains over Single-Input Single-Output ($\mathtt{SISO}$) architectures and offers key advantages over deep learning methods such as $\mathtt{LSTM}$, it provides a closed-form solution with lower computational demands, making it well-suited for real-time applications, including online learning. Beyond predictive accuracy, the proposed methodology is adaptable to various contexts and datasets, as it can be tuned to local constraints such as resource availability, grid characteristics, and market structures.
- Europe > Serbia > Šumadija and Western Serbia > Šumadija District > Kragujevac (0.04)
- Europe > Italy (0.04)
- Europe > France > Corsica > Ajaccio (0.04)
- (3 more...)
NICE^k Metrics: Unified and Multidimensional Framework for Evaluating Deterministic Solar Forecasting Accuracy
Voyant, Cyril, Despotovic, Milan, Garcia-Gutierrez, Luis, Silva, Rodrigo Amaro e, Lauret, Philippe, Soubdhan, Ted, Bailek, Nadjem
Accurate solar energy output prediction is key for integrating renewables into grids, maintaining stability, and improving energy management. However, standard error metrics such as Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Skill Scores (SS) fail to capture the multidimensional nature of solar irradiance forecasting. These metrics lack sensitivity to forecastability, rely on arbitrary baselines (e.g., clear-sky models), and are poorly suited for operational use. To address this, we introduce the NICEk framework (Normalized Informed Comparison of Errors, with k = 1, 2, 3, Sigma), offering a robust and interpretable evaluation of forecasting models. Each NICEk score corresponds to an Lk norm: NICE1 targets average errors, NICE2 emphasizes large deviations, NICE3 highlights outliers, and NICESigma combines all. Using Monte Carlo simulations and data from 68 stations in the Spanish SIAR network, we evaluated methods including autoregressive models, extreme learning, and smart persistence. Theoretical and empirical results align when assumptions hold (e.g., R^2 ~ 1.0 for NICE2). Most importantly, NICESigma consistently shows higher discriminative power (p < 0.05), outperforming traditional metrics (p > 0.05). The NICEk metrics exhibit stronger statistical significance (e.g., p-values from 10^-6 to 0.004 across horizons) and greater generalizability. They offer a unified and operational alternative to standard error metrics in deterministic solar forecasting.
- Europe > Portugal > Coimbra > Coimbra (0.04)
- Africa > Middle East > Algeria > Adrar Province > Adrar (0.04)
- Europe > Serbia > Šumadija and Western Serbia > Šumadija District > Kragujevac (0.04)
- (7 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Energy > Renewable > Solar (1.00)
- Energy > Power Industry (1.00)
RisingBALLER: A player is a token, a match is a sentence, A path towards a foundational model for football players data analytics
In this paper, I introduce RisingBALLER, the first publicly available approach that leverages a transformer model trained on football match data to learn matchspecific player representations. Drawing inspiration from advances in language modeling, RisingBALLER treats each football match as a unique sequence in which players serve as tokens, with their embeddings shaped by the specific context of the match. Through the use of masked player prediction (MPP) as a pre-training task, RisingBALLER learns foundational features for football player representations, similar to how language models learn semantic features for text representations. As a downstream task, I introduce next match statistics prediction (NMSP) to showcase the effectiveness of the learned player embeddings. The NMSP model surpasses a strong baseline commonly used for performance forecasting within the community. Furthermore, I conduct an in-depth analysis to demonstrate how RisingBALLER's learned embeddings can be used in various football analytics tasks, such as producing meaningful positional features that capture the essence and variety of player roles beyond rigid x,y coordinates, team cohesion estimation, and similar player retrieval for more effective data-driven scouting. More than a simple machine learning model, RisingBALLER is a comprehensive framework designed to transform football data analytics by learning high-level foundational features for players, taking into account the context of each match. It offers a deeper understanding of football players beyond individual statistics. In recent years, the field of machine learning has been revolutionized by the introduction of the transformer architecture [1], which initially gained prominence in natural language processing (NLP) with models like BERT [2], RoBERTa [3], and more recently, the widespread use of large language models (LLMs). These models, often trained on seemingly simple tasks such as next token prediction or masked token prediction, have demonstrated remarkable performance in learning high-level features that effectively represent each word and model language intricately. They are capable of learning nuanced representations of the multiple meanings a word can have depending on its context.
- Europe > Spain > Galicia > Madrid (0.05)
- Europe > United Kingdom > England > Leicestershire > Leicester (0.04)
- Europe > Germany > Rheinland-Pfalz > Mainz (0.04)
- (13 more...)
- Leisure & Entertainment > Sports > Soccer (1.00)
- Leisure & Entertainment > Sports > Football (1.00)
Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness
Large language models (LLMs), such as ChatGPT, have rapidly penetrated into people's work and daily lives over the past few years, due to their extraordinary conversational skills and intelligence. ChatGPT has become the fastest-growing software in terms of user numbers in human history and become an important foundational model for the next generation of artificial intelligence applications. However, the generations of LLMs are not entirely reliable, often producing content with factual errors, biases, and toxicity. Given their vast number of users and wide range of application scenarios, these unreliable responses can lead to many serious negative impacts. This thesis introduces the exploratory works in the field of language model reliability during the PhD study, focusing on the correctness, non-toxicity, and fairness of LLMs from both software testing and natural language processing perspectives. First, to measure the correctness of LLMs, we introduce two testing frameworks, FactChecker and LogicAsker, to evaluate factual knowledge and logical reasoning accuracy, respectively. Second, for the non-toxicity of LLMs, we introduce two works for red-teaming LLMs. Third, to evaluate the fairness of LLMs, we introduce two evaluation frameworks, BiasAsker and XCulturalBench, to measure the social bias and cultural bias of LLMs, respectively.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)
- Europe > United Kingdom (0.13)
- Asia > Russia (0.13)
- (26 more...)
- Research Report > New Finding (1.00)
- Questionnaire & Opinion Survey (1.00)
- Overview (1.00)
- Workflow (0.92)
- Media (1.00)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- (6 more...)
Know When To Stop: A Study of Semantic Drift in Text Generation
Spataru, Ava, Hambro, Eric, Voita, Elena, Cancedda, Nicola
In this work, we explicitly show that modern LLMs tend to generate correct facts first, then "drift away" and generate incorrect facts later: this was occasionally observed but never properly measured. We develop a semantic drift score that measures the degree of separation between correct and incorrect facts in generated texts and confirm our hypothesis when generating Wikipedia-style biographies. This correct-then-incorrect generation pattern suggests that factual accuracy can be improved by knowing when to stop generation. Therefore, we explore the trade-off between information quantity and factual accuracy for several early stopping methods and manage to improve factuality by a large margin. We further show that reranking with semantic similarity can further improve these results, both compared to the baseline and when combined with early stopping. Finally, we try calling external API to bring the model back to the right generation path, but do not get positive results. Overall, our methods generalize and can be applied to any long-form text generation to produce more reliable information, by balancing trade-offs between factual accuracy, information quantity and computational cost.
- North America > United States > California (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Europe > France > Corsica > Ajaccio (0.04)
- (12 more...)
- Media (0.68)
- Leisure & Entertainment > Sports > Rugby > Rugby League (0.46)
The Earth is Flat? Unveiling Factual Errors in Large Language Models
Wang, Wenxuan, Shi, Juluan, Tu, Zhaopeng, Yuan, Youliang, Huang, Jen-tse, Jiao, Wenxiang, Lyu, Michael R.
Large Language Models (LLMs) like ChatGPT are foundational in various applications due to their extensive knowledge from pre-training and fine-tuning. Despite this, they are prone to generating factual and commonsense errors, raising concerns in critical areas like healthcare, journalism, and education to mislead users. Current methods for evaluating LLMs' veracity are limited by test data leakage or the need for extensive human labor, hindering efficient and accurate error detection. To tackle this problem, we introduce a novel, automatic testing framework, FactChecker, aimed at uncovering factual inaccuracies in LLMs. This framework involves three main steps: First, it constructs a factual knowledge graph by retrieving fact triplets from a large-scale knowledge database. Then, leveraging the knowledge graph, FactChecker employs a rule-based approach to generates three types of questions (Yes-No, Multiple-Choice, and WH questions) that involve single-hop and multi-hop relations, along with correct answers. Lastly, it assesses the LLMs' responses for accuracy using tailored matching strategies for each question type. Our extensive tests on six prominent LLMs, including text-davinci-002, text-davinci-003, ChatGPT~(gpt-3.5-turbo, gpt-4), Vicuna, and LLaMA-2, reveal that FactChecker can trigger factual errors in up to 45\% of questions in these models. Moreover, we demonstrate that FactChecker's test cases can improve LLMs' factual accuracy through in-context learning and fine-tuning (e.g., llama-2-13b-chat's accuracy increase from 35.3\% to 68.5\%). We are making all code, data, and results available for future research endeavors.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Russia (0.14)
- North America > United States > District of Columbia > Washington (0.05)
- (12 more...)
- Information Technology (1.00)
- Health & Medicine (1.00)
- Media (0.87)
- Government > Regional Government > North America Government > United States Government (0.68)
Modelling and Detection of Driver's Fatigue using Ontology
Lambert, Alexandre, Hina, Manolo Dulva, Barth, Celine, Soukane, Assia, Ramdane-Cherif, Amar
Road accidents have become the eight leading cause of death all over the world. Lots of these accidents are due to a driver's inattention or lack of focus, due to fatigue. Various factors cause driver's fatigue. This paper considers all the measureable data that manifest driver's fatigue, namely those manifested in the vehicle measureable data while driving as well as the driver's physical and physiological data. Each of the three main factors are further subdivided into smaller details. For example, the vehicle's data is composed of the values obtained from the steering wheel's angle, yaw angle, the position on the lane, and the speed and acceleration of the vehicle while moving. Ontological knowledge and rules for driver fatigue detection are to be integrated into an intelligent system so that on the first sign of dangerous level of fatigue is detected, a warning notification is sent to the driver. This work is intended to contribute to safe road driving.
- Europe > Latvia > Riga Municipality > Riga (0.04)
- North America > United States > Virginia (0.04)
- North America > United States > Montana (0.04)
- (11 more...)
- Transportation > Ground > Road (1.00)
- Health & Medicine > Therapeutic Area (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.93)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.86)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)
Solar radiation forecasting using ad-hoc time series preprocessing and neural networks
Paoli, Christophe, Voyant, Cyril, Muselli, Marc, Nivet, Marie-Laure
In this paper, we present an application of neural networks in the renewable energy domain. We have developed a methodology for the daily prediction of global solar radiation on a horizontal surface. We use an ad-hoc time series preprocessing and a Multi-Layer Perceptron (MLP) in order to predict solar radiation at daily horizon. First results are promising with nRMSE < 21% and RMSE < 998 Wh/m2. Our optimized MLP presents prediction similar to or even better than conventional methods such as ARIMA techniques, Bayesian inference, Markov chains and k-Nearest-Neighbors approximators. Moreover we found that our data preprocessing approach can reduce significantly forecasting errors.
- North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.25)
- Europe > France > Corsica > Ajaccio (0.05)
- Europe > United Kingdom > England > Lancashire > Lancaster (0.04)
- Asia > Middle East > Jordan (0.04)
- Research Report > New Finding (0.68)
- Research Report > Experimental Study (0.46)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)