AITopics

2211.1701

Country:

Asia (1.00)
North America > Canada (0.53)

Genre: Research Report > Promising Solution (0.48)

Industry:

Energy > Oil & Gas (1.00)
Energy > Coal (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Yoo, Jay J., Namdar, Khashayar, McIntosh, Chris, Khalvati, Farzad, Rogalla, Patrik

A Comprehensive Study of Radiomics-based Machine Learning for Fibrosis Detection

Objectives: Early detection of liver fibrosis can help cure the disease or prevent disease progression. We perform a comprehensive study of machine learning-based fibrosis detection in CT images using radiomic features to develop a non-invasive approach to fibrosis detection. Methods: Two sets of radiomic features were extracted from spherical ROIs in CT images of 182 patients who underwent simultaneous liver biopsy and CT examinations, one set corresponding to biopsy locations and another distant from biopsy locations. Combinations of contrast, normalization, machine learning model, feature selection method, bin width, and kernel radius were investigated, each of which were trained and evaluated 100 times with randomized development and test cohorts. The best settings were evaluated based on their mean test AUC and the best features were determined based on their frequency among the best settings. Results: Logistic regression models with NC images normalized using Gamma correction with $\gamma = 1.5$ performed best for fibrosis detection. Boruta was the best for radiomic feature selection method. Training a model using these optimal settings and features consisting of first order energy, first order kurtosis, and first order skewness, resulted in a model that achieved mean test AUCs of 0.7549 and 0.7166 on biopsy-based and non-biopsy ROIs respectively, outperforming a baseline and best models found during the initial study. Conclusions: Logistic regression models trained on radiomic features from NC images normalized using Gamma correction with $\gamma = 1.5$ that underwent Boruta feature selection are effective for liver fibrosis detection. Energy, kurtosis, and skewness are particularly effective features for fibrosis detection.

artificial intelligence, fibrosis detection, machine learning, (15 more...)

2211.14396

Country:

North America > Canada > Ontario > Toronto (0.16)
North America > United States (0.14)
Asia > Japan (0.04)
Asia > British Indian Ocean Territory > Diego Garcia (0.04)

Genre:

Research Report > Experimental Study (0.89)
Research Report > New Finding (0.70)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Hepatology (0.96)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Loog, Marco, Viering, Tom

A Survey of Learning Curves with Bad Behavior: or How More Data Need Not Lead to Better Performance

Plotting a learner's generalization performance against the training set size results in a so-called learning curve. This tool, providing insight in the behavior of the learner, is also practically valuable for model selection, predicting the effect of more training data, and reducing the computational complexity of training. We set out to make the (ideal) learning curve concept precise and briefly discuss the aforementioned usages of such curves. The larger part of this survey's focus, however, is on learning curves that show that more data does not necessarily leads to better generalization performance. A result that seems surprising to many researchers in the field of artificial intelligence. We point out the significance of these findings and conclude our survey with an overview and discussion of open problems in this area that warrant further theoretical and empirical investigation.

artificial intelligence, learner, machine learning, (17 more...)

2211.14061

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
(4 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Houssiau, Florimond, Schellekens, Vincent, Chatalic, Antoine, Annamraju, Shreyas Kumar, de Montjoye, Yves-Alexandre

M$^2$M: A general method to perform various data analysis tasks from a differentially private sketch

Differential privacy is the standard privacy definition for performing analyses over sensitive data. Yet, its privacy budget bounds the number of tasks an analyst can perform with reasonable accuracy, which makes it challenging to deploy in practice. This can be alleviated by private sketching, where the dataset is compressed into a single noisy sketch vector which can be shared with the analysts and used to perform arbitrarily many analyses. However, the algorithms to perform specific tasks from sketches must be developed on a case-by-case basis, which is a major impediment to their use. In this paper, we introduce the generic moment-to-moment (M$^2$M) method to perform a wide range of data exploration tasks from a single private sketch. Among other things, this method can be used to estimate empirical moments of attributes, the covariance matrix, counting queries (including histograms), and regression models. Our method treats the sketching mechanism as a black-box operation, and can thus be applied to a wide variety of sketches from the literature, widening their ranges of applications without further engineering or privacy loss, and removing some of the technical barriers to the wider adoption of sketches for data exploration under differential privacy. We validate our method with data exploration tasks on artificial and real-world data, and show that it can be used to reliably estimate statistics and train classification models from private sketches.

artificial intelligence, machine learning, sketch, (17 more...)

2211.14062

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Liguria > Genoa (0.04)

Genre: Research Report > New Finding (0.47)

Industry: Information Technology > Security & Privacy (0.88)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Lee, Kyeongwon, Lee, Jaeyong

Asymptotic Properties for Bayesian Neural Network in Besov Space

Neural networks have shown great predictive power when dealing with various unstructured data such as images and natural languages. The Bayesian neural network captures the uncertainty of prediction by putting a prior distribution for the parameter of the model and computing the posterior distribution. In this paper, we show that the Bayesian neural network using spike-and-slab prior has consistency with nearly minimax convergence rate when the true regression function is in the Besov space. Even when the smoothness of the regression function is unknown the same posterior convergence rate holds and thus the spike-and-slab prior is adaptive to the smoothness of the regression function. We also consider the shrinkage prior, which is more feasible than other priors, and show that it has the same convergence rate. In other words, we propose a practical Bayesian neural network with guaranteed asymptotic properties.

artificial intelligence, bayesian inference, machine learning, (20 more...)

2206.00241

Country:

Asia > South Korea > Seoul > Seoul (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Singapore (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.75)

Exposure and Emergence in Usage-Based Grammar: Computational Experiments in 35 Languages

Dunn, Jonathan

This paper uses computational experiments to explore the role of exposure in the emergence of construction grammars. While usage-based grammars are hypothesized to depend on a learner's exposure to actual language use, the mechanisms of such exposure have only been studied in a few constructions in isolation. This paper experiments with (i) the growth rate of the constructicon, (ii) the convergence rate of grammars exposed to independent registers, and (iii) the rate at which constructions are forgotten when they have not been recently observed. These experiments show that the lexicon grows more quickly than the grammar and that the growth rate of the grammar is not dependent on the growth rate of the lexicon. At the same time, register-specific grammars converge onto more similar constructions as the amount of exposure increases. This means that the influence of specific registers becomes less important as exposure increases. Finally, the rate at which constructions are forgotten when they have not been recently observed mirrors the growth rate of the constructicon. This paper thus presents a computational model of usage-based grammar that includes both the emergence and the unentrenchment of constructions.

artificial intelligence, machine learning, natural language, (19 more...)

doi: 10.18710/CES0L8

2211.1416

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Pacific Ocean (0.04)
Oceania > New Zealand > South Island > Canterbury Region > Christchurch (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.92)
Research Report > Experimental Study (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(2 more...)

#artificialintelligenceNov-24-2022, 15:40:17 GMT

How to Choose the Best Machine Learning Technique: Comparison Table - DataScienceCentral.com

While the comparison table in this article applies to a specific problem in FinTech, the conclusions are consistent with findings in other frameworks. There is no single method that outperforms all the other ones, for obvious reasons. To be the global winner means winning on all potential datasets. The immense majority of datasets are pure noise, for which performance is meaningless, and the winner randomly changes from one dataset to another. Yet, in real-life applications, each dataset comes with patterns or meaningful signal.

best machine learning technique, logistic regression, regression, (14 more...)

#artificialintelligence

Country:

North America > United States > Washington (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)

Industry: Banking & Finance (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

#artificialintelligenceNov-24-2022, 11:30:53 GMT

Make Data Work for You with These Top Data Mining Tools and Techniques

With everything going computerized and digital, the amount of data generated by us is humongous. Organizations collectively spend billions of dollars to just store and analyze this data. They make efforts to drive valuable business insights from this data using data mining. Data Mining is the process of discovering hidden patterns in a pile of big data. Business executives use these emerging patterns to make informed business strategy decisions.

data science team, data scientist, platform, (6 more...)

#artificialintelligence

Industry: Information Technology > Services (0.48)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.31)

arXiv.org Artificial IntelligenceNov-24-2022

Statistical Optimality of Divide and Conquer Kernel-based Functional Linear Regression

Liu, Jiading, Shi, Lei

Previous analysis of regularized functional linear regression in a reproducing kernel Hilbert space (RKHS) typically requires the target function to be contained in this kernel space. This paper studies the convergence performance of divide-and-conquer estimators in the scenario that the target function does not necessarily reside in the underlying RKHS. As a decomposition-based scalable approach, the divide-and-conquer estimators of functional linear regression can substantially reduce the algorithmic complexities in time and memory. We develop an integral operator approach to establish sharp finite sample upper bounds for prediction with divide-and-conquer estimators under various regularity conditions of explanatory variables and target function. We also prove the asymptotic optimality of the derived rates by building the mini-max lower bounds. Finally, we consider the convergence of noiseless estimators and show that the rates can be arbitrarily fast under mild conditions.

artificial intelligence, inequality, machine learning, (17 more...)

2211.10968

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceNov-23-2022, 19:47:46 GMT

deeplearning

In this video, you will learn from basics to advanced machine learning concepts from Great Learning's top faculties, including professor Mukesh Rao, Bharani Akella & many other leading industry experts. If you are an enthusiast who wants to start with machine learning from scratch, this machine learning beginner video is the best to start with. Agenda: • Python for Machine Learning • Role of Statistics in Machine Learning • Introduction to Machine Learning and its types • How does a Machine learning model learn? Topics Covered: 00:01:09 – What Is Machine learning?

artificial intelligence, introduction, machine learning, (12 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)