Drava
Days really are dragging! Length of days on Earth is increasing at an 'unprecedented' rate - and scientists say climate change is to blame
'Comatose' Mojtaba Khamenei'is UNAWARE there is a war on and has no idea he is supreme leader', report says - despite regime issuing his'first statement' FBI storms home of Lebanese-born restaurant worker who drove truck filled with explosives into synagogue and opened fire after his'family were killed in airstrike' Trump slammed after lifting oil sanctions on Russia as gas prices skyrocket: 'It's a betrayal' Alexander brothers' alleged HIGH SCHOOL rape video: Classmates speak out on sickening footage... as creepy unseen photos are exposed Kylie Jenner's total humiliation in Hollywood: Derogatory rumor leaves her boyfriend's peers'laughing at her' behind her back Billy Joel's daughter Alexa Ray gives health update amid his battle with rare brain disorder Concerning whispers inside Trump World that Operation Epic Fury is suddenly at risk... and the critical question that will determine how this ends: MARK HALPERIN Meghan Markle masks up to cheer young patients at Los Angeles children's hospital as she agrees deal to sign her latest documentary Beauty queen slams Trump as she's FIRED by White House: 'I stood by you for 20 years... now, I don't even recognize you' Wall Street issues stark warning that Iran oil attacks could wreck Trump's key election promises Truth behind the massacre of 110 school girls in Iran: How shameful episode sparked a deluge of conspiracy theories and lies... as JAKE WALLIS SIMONS explores what really happened Long hair over 45 is ageing and try-hard. I've finally cut mine off. NFL fans left divided as team replace historic logo with'boring' new design as part of franchise rebrand I worked with Carolyn Bessette. This is the'messy' truth about what she was REALLY like in secret. After she met JFK Jr she tried to hide it... but we all knew the nighttime gossip Trump says US is'totally destroying' Iran as he issues chilling threat of more action coming TODAY The 7 types of'hyperarousal' - so, do you get cold sweats or tingling fingers?
- Asia > Middle East > Iran (0.76)
- North America > United States > California > Los Angeles County > Los Angeles (0.24)
- North America > United States > New York > New York County > New York City (0.24)
- (14 more...)
- Media > Television (1.00)
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- (4 more...)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- North America > Canada (0.04)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.14)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- (25 more...)
- Health & Medicine (0.46)
- Transportation (0.46)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis
Watanabe, Chihiro, Sun, Jingyu
Large language models (LLMs) are increasingly used as agents to solve complex tasks such as question answering (QA), scientific debate, and software development. A standard evaluation procedure aggregates multiple responses from LLM agents into a single final answer, often via majority voting, and compares it against reference answers. However, this process can obscure the quality and distributional characteristics of the original responses. In this paper, we propose a novel evaluation framework based on the empirical cumulative distribution function (ECDF) of cosine similarities between generated responses and reference answers. This enables a more nuanced assessment of response quality beyond exact match metrics. To analyze the response distributions across different agent configurations, we further introduce a clustering method for ECDFs using their distances and the $k$-medoids algorithm. Our experiments on a QA dataset demonstrate that ECDFs can distinguish between agent settings with similar final accuracies but different quality distributions. The clustering analysis also reveals interpretable group structures in the responses, offering insights into the impact of temperature, persona, and question topics.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.35)
- North America > United States > Michigan (0.04)
- Europe > Sweden > Uppsala County > Uppsala (0.04)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- (4 more...)
- North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)
- Europe > Slovenia > Drava > Municipality of Maribor > Maribor (0.04)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- (3 more...)
- Education (0.92)
- Government > Regional Government > Asia Government > North Korea Government (0.46)
- Government > Regional Government > North America Government > United States Government (0.45)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Vision (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
- Asia > Vietnam (0.04)
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- Europe > Switzerland (0.04)
- (2 more...)
- Research Report > Experimental Study (0.93)
- Instructional Material (0.87)
- Health & Medicine > Therapeutic Area (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
On the Convergence of Loss and Uncertainty-based Active Learning Algorithms
We investigate the convergence rates and data sample sizes required for training a machine learning model using a stochastic gradient descent (SGD) algorithm, where data points are sampled based on either their loss value or uncertainty value. These training methods are particularly relevant for active learning and data subset selection problems. For SGD with a constant step size update, we present convergence results for linear classifiers and linearly separable datasets using squared hinge loss and similar training loss functions.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- North America > United States > Wisconsin > Dane County > Madison (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (3 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Asia > Singapore (0.04)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- Europe > Netherlands > North Brabant > Eindhoven (0.04)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.92)
- Information Technology > Security & Privacy (1.00)
- Government > Military (0.68)