AITopics | variance 0

Collaborating Authors

variance 0

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

derivation of Eqs . 3 and 5

Neural Information Processing SystemsApr-25-2026, 01:15:19 GMT

A.1 Derivation of Eq. (3) By expanding Eq. (2) with the definition of εli,t = xli,t µli,t, we have: Et = We note that each xli,t influences Et in two ways: (i) it occurs in Eq. (6) explicitly, but (ii) it also determines the values of µl 1k,t via Eq. Considering also the special cases of l = Land l = 0, we obtain Eq. (3). We note that θl+1i,j affects the value of the function Et of Eq. (6) by influencing µli,t via Eq. Here, we provide further details about training PCNs, useful to reproduce them. Furthermore, we have applied a decay factor of 0.9 to γ, applied each time the energy failed to decrease.

artificial intelligence, machine learning, reconstruction, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

KS-GNN: Keywords Search over Incomplete Graphs via Graphs Neural Network

Neural Information Processing SystemsApr-24-2026, 16:30:56 GMT

For PCA-based methods, the dimensionality reduction is performed via singular value decomposition (SVD) of the input one-hot encoding matrix X. As mentioned above, we utilize grid search for tuning the hyper-parameters. In particular, for the learning-based methods, including GraphSAGE and KS-GNN, the learning rates are selected from {0.1, 0.01, 0.001, 0.0001}. GraphSAGE, SAT, Conv-PCA, KS-PCA, KS-GNN), we swept the number of hidden layers in the set {1, 2, 3, 4, 5}. For the other hyper-parameters used in KS-GNN, such as λ1, λ2 and λ3, we tune them from 0.1 to 1 with a step of 0.1.

artificial intelligence, keyword, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

KS-GNN: KeywordsSearchoverIncompleteGraphs viaGraphsNeuralNetwork

Neural Information Processing SystemsFeb-7-2026, 11:39:30 GMT

Conv-PCA is a naive method proposed in Section 4.1.

artificial intelligence, graphsage 0, variance 0, (15 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.05)
Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback

Comparison of Unsupervised Metrics for Evaluating Judicial Decision Extraction

Litvak, Ivan Leonidovich, Kostin, Anton, Lashkin, Fedor, Maksiyan, Tatiana, Lagutin, Sergey

arXiv.org Artificial IntelligenceOct-3-2025

The integration of artificial intelligence (AI) into the legal domain has revolutionized judicial processes, enabling tasks such as legal judgment prediction (LJP), case summarization, precedent retrieval, and automated legal research. Text extraction, the process of isolating seven semantically meaningful segments--referred to as blocks--from unstructured judicial decisions, is a cornerstone of these applications. These blocks include plaintiff demands, plaintiff arguments, defendant arguments, court evaluation of evidence, judicial reasoning steps, applicable legal norms, and court decision. Accurate extraction is critical, as errors can lead to misinterpretations of case facts, biased predictions, or inefficiencies in judicial workflows, potentially undermining justice delivery in high-stakes contexts. Evaluation metrics are essential for quantifying extraction quality, enabling iterative model improvements and ensuring reliability. Traditional metrics rely on annotated ground truth, which is resource-intensive to produce, particularly for legal texts characterized by verbose narratives, domain-specific terminology, and jurisdiction-specific nuances. The scarcity of annotated legal corpora has driven the development of unsupervised metrics that leverage intrinsic document properties, such as term frequencies, semantic coherence, and structural patterns. These metrics must capture surface-level accuracy, semantic fidelity, logical structure, and legal-specific elements like citations and temporal consistency, while addressing ethical concerns such as fairness and neutrality in AI-driven legal systems [1, 2].

machine learning, natural language, plaintiff, (20 more...)

arXiv.org Artificial Intelligence

2510.01792

Country: Europe > Russia (0.14)

Genre: Research Report (0.82)

Industry: Law > Litigation (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

WENDy for Nonlinear-in-Parameter ODEs

Rummel, Nic, Messenger, Daniel A., Becker, Stephen, Dukic, Vanja, Bortz, David M.

arXiv.org Machine LearningFeb-12-2025

The Weak-form Estimation of Non-linear Dynamics (WENDy) algorithm is extended to accommodate systems of ordinary differential equations that are nonlinear-in-parameters (NiP). The extension rests on derived analytic expressions for a likelihood function, its gradient and its Hessian matrix. WENDy makes use of these to approximate a maximum likelihood estimator based on optimization routines suited for non-convex optimization problems. The resulting parameter estimation algorithm has better accuracy, a substantially larger domain of convergence, and is often orders of magnitude faster than the conventional output error least squares method (based on forward solvers). The WENDy.jl algorithm is efficiently implemented in Julia. We demonstrate the algorithm's ability to accommodate the weak form optimization for both additive normal and multiplicative log-normal noise, and present results on a suite of benchmark systems of ordinary differential equations. In order to demonstrate the practical benefits of our approach, we present extensive comparisons between our method and output error methods in terms of accuracy, precision, bias, and coverage.

artificial intelligence, machine learning, variance 0, (18 more...)

arXiv.org Machine Learning

2502.08881

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Colorado > Boulder County > Boulder (0.14)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
(3 more...)

Genre:

Research Report (0.64)
Overview (0.45)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

Add feedback

A Language Model-Guided Framework for Mining Time Series with Distributional Shifts

Zhu, Haibei, El-Laham, Yousef, Fons, Elizabeth, Vyetrenko, Svitlana

arXiv.org Artificial IntelligenceJun-7-2024

Effective utilization of time series data is often constrained by the scarcity of data quantity that reflects complex dynamics, especially under the condition of distributional shifts. Existing datasets may not encompass the full range of statistical properties required for robust and comprehensive analysis. And privacy concerns can further limit their accessibility in domains such as finance and healthcare. This paper presents an approach that utilizes large language models and data source interfaces to explore and collect time series datasets. While obtained from external sources, the collected data share critical statistical properties with primary time series datasets, making it possible to model and adapt to various scenarios. This method enlarges the data quantity when the original data is limited or lacks essential properties. It suggests that collected datasets can effectively supplement existing datasets, especially involving changes in data distribution. We demonstrate the effectiveness of the collected datasets through practical examples and show how time series forecasting foundation models fine-tuned on these datasets achieve comparable performance to those models without fine-tuning.

dataset, distributional shift, time sery, (13 more...)

arXiv.org Artificial Intelligence

2406.05249

Country:

North America > United States (0.15)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)

Genre: Research Report (0.82)

Industry:

Banking & Finance > Economy (0.93)
Health & Medicine (0.88)
Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Federated learning model for predicting major postoperative complications

Park, Yonggi, Ren, Yuanfang, Shickel, Benjamin, Guan, Ziyuan, Patela, Ayush, Ma, Yingbo, Hu, Zhenhong, Loftus, Tyler J., Rashidi, Parisa, Ozrazgat-Baslanti, Tezcan, Bihorac, Azra

arXiv.org Artificial IntelligenceApr-9-2024

Background: The accurate prediction of postoperative complication risk using Electronic Health Records (EHR) and artificial intelligence shows great potential. Training a robust artificial intelligence model typically requires large-scale and diverse datasets. In reality, collecting medical data often encounters challenges surrounding privacy protection. Methods: This retrospective cohort study includes adult patients who were admitted to UFH Gainesville (GNV) (n = 79,850) and Jacksonville (JAX) (n = 28,636) for any type of inpatient surgical procedure. Using perioperative and intraoperative features, we developed federated learning models to predict nine major postoperative complications (i.e., prolonged intensive care unit stay and mechanical ventilation). We compared federated learning models with local learning models trained on a single site and central learning models trained on pooled dataset from two centers. Results: Our federated learning models achieved the area under the receiver operating characteristics curve (AUROC) values ranged from 0.81 for wound complications to 0.92 for prolonged ICU stay at UFH GNV center. At UFH JAX center, these values ranged from 0.73-0.74 for wound complications to 0.92-0.93 for hospital mortality. Federated learning models achieved comparable AUROC performance to central learning models, except for prolonged ICU stay, where the performance of federated learning models was slightly higher than central learning models at UFH GNV center, but slightly lower at UFH JAX center. In addition, our federated learning model obtained comparable performance to the best local learning model at each center, demonstrating strong generalizability. Conclusion: Federated learning is shown to be a useful tool to train robust and generalizable models from large scale data across multiple institutions where data protection barriers are high.

complication, learning model, surgery, (14 more...)

arXiv.org Artificial Intelligence

2404.06641

Country:

North America > United States > Florida > Alachua County > Gainesville (0.14)
North America > United States > Alaska (0.04)
Asia > Taiwan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)
Research Report > Strength Medium (0.66)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Nephrology (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

Balanced Off-Policy Evaluation for Personalized Pricing

Elmachtoub, Adam N., Gupta, Vishal, Zhao, Yunfan

arXiv.org Artificial IntelligenceFeb-24-2023

We consider a personalized pricing problem in which we have data consisting of feature information, historical pricing decisions, and binary realized demand. The goal is to perform off-policy evaluation for a new personalized pricing policy that maps features to prices. Methods based on inverse propensity weighting (including doubly robust methods) for off-policy evaluation may perform poorly when the logging policy has little exploration or is deterministic, which is common in pricing applications. Building on the balanced policy evaluation framework of Kallus (2018), we propose a new approach tailored to pricing applications. The key idea is to compute an estimate that minimizes the worst-case mean squared error or maximizes a worst-case lower bound on policy performance, where in both cases the worst-case is taken with respect to a set of possible revenue functions. We establish theoretical convergence guarantees and empirically demonstrate the advantage of our approach using a real-world pricing dataset.

artificial intelligence, machine learning, target policy, (16 more...)

arXiv.org Artificial Intelligence

2302.12736

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)

Genre: Research Report (0.64)

Industry: Banking & Finance (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Improved Predictive Models for Acute Kidney Injury with IDEAs: Intraoperative Data Embedded Analytics

Adhikari, Lasith, Ozrazgat-Baslanti, Tezcan, Thottakkara, Paul, Ebadi, Ashkan, Motaei, Amir, Rashidi, Parisa, Li, Xiaolin, Bihorac, Azra

arXiv.org Machine LearningMay-11-2018

Acute kidney injury (AKI) is a common and serious complication after a surgery which is associated with morbidity and mortality. The majority of existing perioperative AKI risk score prediction models are limited in their generalizability and do not fully utilize the physiological intraoperative time-series data. Thus, there is a need for intelligent, accurate, and robust systems, able to leverage information from large-scale data to predict patient's risk of developing postoperative AKI. A retrospective single-center cohort of 2,911 adult patients who underwent surgery at the University of Florida Health has been used for this study. We used machine learning and statistical analysis techniques to develop perioperative models to predict the risk of AKI (risk during the first 3 days, 7 days, and until the discharge day) before and after the surgery. In particular, we examined the improvement in risk prediction by incorporating three intraoperative physiologic time series data, i.e., mean arterial blood pressure, minimum alveolar concentration, and heart rate. For an individual patient, the preoperative model produces a probabilistic AKI risk score, which will be enriched by integrating intraoperative statistical features through a machine learning stacking approach inside a random forest classifier. We compared the performance of our model based on the area under the receiver operating characteristics curve (AUROC), accuracy and net reclassification improvement (NRI). The predictive performance of the proposed model is better than the preoperative data only model. For AKI-7day outcome: The AUC was 0.86 (accuracy was 0.78) in the proposed model, while the preoperative AUC was 0.84 (accuracy 0.76). Furthermore, with the integration of intraoperative features, we were able to classify patients who were misclassified in the preoperative model.

binary derived 2, data mining, machine learning, (15 more...)

arXiv.org Machine Learning

1805.05452

Country: North America > United States > Florida > Alachua County > Gainesville (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Nephrology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Surgery (1.00)
(2 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.87)

Add feedback

Efficient coordinate-descent for orthogonal matrices through Givens rotations

Shalit, Uri, Chechik, Gal

arXiv.org Machine LearningDec-13-2013

Optimizing over the set of orthogonal matrices is a central component in problems like sparse-PCA or tensor decomposition. Unfortunately, such optimization is hard since simple operations on orthogonal matrices easily break orthogonality, and correcting orthogonality usually costs a large amount of computation. Here we propose a framework for optimizing orthogonal matrices, that is the parallel of coordinate-descent in Euclidean spaces. It is based on {\em Givens-rotations}, a fast-to-compute operation that affects a small number of entries in the learned matrix, and preserves orthogonality. We show two applications of this approach: an algorithm for tensor decomposition that is used in learning mixture models, and an algorithm for sparse-PCA. We study the parameter regime where a Givens rotation approach converges faster and achieves a superior model on a genome-wide brain-wide mRNA expression dataset.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1312.0624

Country: Asia > Middle East (0.28)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback