AITopics | independent feature

Collaborating Authors

independent feature

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Comprehensive Analysis on the Learning Curve in Kernel Ridge Regression

Neural Information Processing SystemsOct-9-2025, 21:56:56 GMT

Kernel ridge regression (KRR) is a central tool in machine learning due to its ability to provide a flexible and efficient framework for capturing intricate patterns within data.

assumption, def, probability, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > North Carolina > Mecklenburg County > Charlotte (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(3 more...)

Genre:

Research Report > Experimental Study (0.92)
Research Report > New Finding (0.67)

Industry:

Government (0.67)
Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)

Add feedback

Dependency-aware synthetic tabular data generation

Umesh, Chaithra, Schultz, Kristian, Mahendra, Manjunath, Bej, Saptarshi, Wolkenhauer, Olaf

arXiv.org Artificial IntelligenceJul-28-2025

Synthetic tabular data is increasingly used in privacy-sensitive domains such as health care, but existing generative models often fail to preserve inter-attribute relationships. In particular, functional dependencies (FDs) and logical dependencies (LDs), which capture deterministic and rule-based associations between features, are rarely or often poorly retained in synthetic datasets. To address this research gap, we propose the Hierarchical Feature Generation Framework (HFGF) for synthetic tabular data generation. We created benchmark datasets with known dependencies to evaluate our proposed HFGF. The framework first generates independent features using any standard generative model, and then reconstructs dependent features based on predefined FD and LD rules. Our experiments on four benchmark datasets with varying sizes, feature imbalance, and dependency complexity demonstrate that HFGF improves the preservation of FDs and LDs across six generative models, including CTGAN, TVAE, and GReaT. Our findings demonstrate that HFGF can significantly enhance the structural fidelity and downstream utility of synthetic tabular data.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.19211

Country: Europe > Germany (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Comprehensive Analysis on the Learning Curve in Kernel Ridge Regression

Cheng, Tin Sum, Lucchi, Aurelien, Kratsios, Anastasis, Belius, David

arXiv.org Artificial IntelligenceOct-23-2024

This paper conducts a comprehensive study of the learning curves of kernel ridge regression (KRR) under minimal assumptions. Our contributions are three-fold: 1) we analyze the role of key properties of the kernel, such as its spectral eigen-decay, the characteristics of the eigenfunctions, and the smoothness of the kernel; 2) we demonstrate the validity of the Gaussian Equivalent Property (GEP), which states that the generalization performance of KRR remains the same when the whitened features are replaced by standard Gaussian vectors, thereby shedding light on the success of previous analyzes under the Gaussian Design Assumption; 3) we derive novel bounds that improve over existing bounds across a broad range of setting such as (in)dependent feature vectors and various combinations of eigen-decay rates in the over/underparameterized regimes.

artificial intelligence, assumption, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2410.17796

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > North Carolina > Mecklenburg County > Charlotte (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)

Add feedback

EigenNet: A Bayesian hybrid of generative and conditional models for sparse learning Yuan Qi

Neural Information Processing SystemsMar-15-2024, 10:00:18 GMT

For many real-world applications, we often need to select correlated variables-- such as genetic variations and imaging features associated with Alzheimer's disease--in a high dimensional space. The correlation between variables presents a challenge to classical variable selection methods. To address this challenge, the elastic net has been developed and successfully applied to many applications. Despite its great success, the elastic net does not exploit the correlation information embedded in the data to select correlated variables. To overcome this limitation, we present a novel hybrid model, EigenNet, that uses the eigenstructures of data to guide variable selection.

eigennet, eigenvector, elastic net, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

Add feedback

How to Verify the Assumptions of Linear Regression

#artificialintelligenceAug-2-2022, 09:11:10 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. Linear regression is a model that estimates the relationship between independent variables and a dependent variable using a straight line.

assumption, independent feature, multicollinearity, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.82)

Add feedback

All About Decision Tree

#artificialintelligenceApr-4-2022, 16:23:29 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. The decision tree is one of the most powerful and important algorithms present in supervised machine learning.

decision tree, information gain, node, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.79)

Add feedback

How to Use PySpark for Data Processing and Machine Learning

#artificialintelligenceFeb-17-2022, 06:20:47 GMT

PySpark is an interface for Apache Spark in Python. PySpark is often used for large-scale data processing and machine learning. We just released a PySpark crash course on the freeCodeCamp.org Krish is a lead data scientist and he runs a popular YouTube channel. Apache Spark is written in the Scala programming language. To support Python with Spark, the Apache Spark community released a tool called PySpark. PySpark allows people to work with Resilient Distributed Datasets (RDDs) in Python through a library called Py4j. PiSpark is an interface for Apache Spark in Python is often used for large scale data processing and machine learning. Krish knack teaches this course. So we are going to start Apache Spark series. And specifically, if I talk about Spark, we will be focusing on how we can use spark with Python. So we are going to discuss about the library called pi Spark, we will try to understand everything why spark is actually required. And probably will also try to cover a lot of ...

data frame, opération, salary, (14 more...)

#artificialintelligence

Country:

Africa > Sudan (0.04)
North America > Cuba (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.47)

Industry: Information Technology > Software (0.80)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Multiple Linear Regression Using Python and Scikit-learn

#artificialintelligenceAug-14-2021, 09:55:42 GMT

This article was published as a part of the Data Science Blogathon. If you are on the path of learning data science, then you definitely have an understanding of what machine learning is. In today's digital world everyone knows what Machine Learning is because it was a trending digital technology across the world. Every step towards adaptation of the future world leads by this current technology, and this current technology is led by data scientists like you and me . Here we only discuss machine learning, If you don't know what it is, then we take a brief introduction to it: Machine learning is the study of the algorithms of computers, that improve automatically through experience and by the use of data. This is the simple definition of machine learning, and when we go into deep then we find that there are huge numbers of algorithms that are used in model building.

linear regression, multiple linear regression, regression, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.84)

Add feedback

Fully Explained K-Nearest Neighbors with Python

#artificialintelligenceFeb-26-2021, 23:45:16 GMT

Hello Everyone, another article in the series fully explained machine learning algorithms. In this article, we will discuss the k nearest neighbor classification problem. A good article is like a flow of the story and readers get as much information in a small amount of time. So, we will discuss the supervised classification problem learning technique. The main goal is to predict the new data point based on samples near that data point.

algorithm, explained k-nearest neighbor, graph, (10 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (1.00)

Add feedback

Random Forests in Machine Learning

#artificialintelligenceDec-5-2020, 12:55:11 GMT

This article was published as a part of the Data Science Blogathon. Random Forests are always referred to as black-box models. Let's try to crack open it and see what is inside it. Oops!!! Our plane has crashed, but fortunately, we all are safe. We are Data scientists, so we want to open the black box and see what random things have been recorded inside it. Yes, let's come to our topic.

dataset, decision tree, random forest, (13 more...)

#artificialintelligence

Industry: Transportation > Air (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback