AITopics

2211.10015

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

#artificialintelligenceNov-16-2022, 05:15:06 GMT

Machine Learning Clustering Algorithms Explanation and Examples

In this Machine Learning article, let's learn about Clustering Algorithms in Machine Learning. Machine Learning problems deal with a great deal of data and depend heavily on the algorithms that are used to train the model. There are various approaches and algorithms to train a machine learning model based on the problem at hand. Supervised and unsupervised learning are the two most prominent of these approaches. An important real-life problem of marketing a product or service to a specific target audience can be easily resolved with the help of a form of unsupervised learning known as Clustering.

algorithm, clustering, machine learning, (11 more...)

#artificialintelligence

Industry: Education (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Dziwulska-Hunek, Agata, Niemczynowicz, Agnieszka, Kycia, Radosław A., Matwijczuk, Arkadiusz, Kornarzyński, Krzysztof, Stadnik, Joanna, Szymanek, Mariusz

Stimulation of soy seeds using environmentally friendly magnetic and electric fields

arXiv.org Artificial IntelligenceNov-16-2022

The study analyzes the impact of constant and alternating magnetic fields and alternating electric fields on various growth parameters of soy plants: the germination energy and capacity, plants emergence and number, the Yield(II) of the fresh mass of seedlings, protein content, and photosynthetic parameters. Four cultivars were used: MAVKA, MERLIN, VIOLETTA, and ANUSZKA. Moreover, the advanced Machine Learning processing pipeline was proposed to distinguish the impact of physical factors on photosynthetic parameters. It is possible to distinguish exposition on different physical factors for the first three cultivars; therefore, it indicates that the EM factors have some observable effect on soy plants. Moreover, some influence of physical factors on growth parameters was observed. The use of ELM (Electromagnetic) fields had a positive impact on the germination rate in Merlin plants. The highest values were recorded for the constant magnetic field (CMF) - Merlin, and the lowest for the alternating electric field (AEF) - Violetta. An increase in terms of emergence and number of plants after seed stimulation was observed for the Mavka cultivar, except for the AEF treatment (number of plants after 30 days) (...)

artificial intelligence, data mining, machine learning, (19 more...)

2211.0924

Country:

Europe > Poland > Lesser Poland Province > Kraków (0.14)
Europe > Poland > Lublin Province > Lublin (0.05)
South America > Brazil (0.04)
(7 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Materials > Chemicals (1.00)
Food & Agriculture > Agriculture (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.47)

arXiv.org Artificial IntelligenceNov-16-2022

Auditing Algorithmic Fairness in Machine Learning for Health with Severity-Based LOGAN

Ovalle, Anaelia, Dev, Sunipa, Zhao, Jieyu, Sarrafzadeh, Majid, Chang, Kai-Wei

Auditing machine learning-based (ML) healthcare tools for bias is critical to preventing patient harm, especially in communities that disproportionately face health inequities. General frameworks are becoming increasingly available to measure ML fairness gaps between groups. However, ML for health (ML4H) auditing principles call for a contextual, patient-centered approach to model assessment. Therefore, ML auditing tools must be (1) better aligned with ML4H auditing principles and (2) able to illuminate and characterize communities vulnerable to the most harm. To address this gap, we propose supplementing ML4H auditing frameworks with SLOGAN (patient Severity-based LOcal Group biAs detectioN), an automatic tool for capturing local biases in a clinical prediction task. SLOGAN adapts an existing tool, LOGAN (LOcal Group biAs detectioN), by contextualizing group bias detection in patient illness severity and past medical history. We investigate and compare SLOGAN's bias detection capabilities to LOGAN and other clustering techniques across patient subgroups in the MIMIC-III dataset. On average, SLOGAN identifies larger fairness disparities in over 75% of patient groups than LOGAN while maintaining clustering quality. Furthermore, in a diabetes case study, health disparity literature corroborates the characterizations of the most biased clusters identified by SLOGAN. Our results contribute to the broader discussion of how machine learning biases may perpetuate existing healthcare disparities.

artificial intelligence, machine learning, slogan, (15 more...)

2211.08742

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England (0.04)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.48)

Industry:

Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.74)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.34)

Isufi, Elvin, Gama, Fernando, Shuman, David I., Segarra, Santiago

Graph Filters for Signal Processing and Machine Learning on Graphs

arXiv.org Artificial IntelligenceNov-16-2022

Filters are fundamental in extracting information from data. For time series and image data that reside on Euclidean domains, filters are the crux of many signal processing and machine learning techniques, including convolutional neural networks. Increasingly, modern data also reside on networks and other irregular domains whose structure is better captured by a graph. To process and learn from such data, graph filters account for the structure of the underlying data domain. In this article, we provide a comprehensive overview of graph filters, including the different filtering categories, design strategies for each type, and trade-offs between different types of graph filters. We discuss how to extend graph filters into filter banks and graph neural networks to enhance the representational power; that is, to model a broader variety of signal classes, data patterns, and relationships. We also showcase the fundamental role of graph filters in signal processing and machine learning applications. Our aim is that this article serves the dual purpose of providing a unifying framework for both beginner and experienced researchers, as well as a common understanding that promotes collaborations between signal processing, machine learning, and application domains.

artificial intelligence, graph, machine learning, (18 more...)

2211.08854

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Netherlands > South Holland > Delft (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(10 more...)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Information Technology (0.92)
Health & Medicine > Health Care Technology (0.67)
Health & Medicine > Therapeutic Area > Neurology (0.67)
Telecommunications (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.45)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

#artificialintelligenceNov-15-2022, 20:23:54 GMT

Transfer Learning

Machine Learning (ML) involves data analysis and enables the system to improve and learn from experience without explicit programming required constantly. There have been many ML approaches that came into existence constantly. Supervised learning was a game-changing approach that was adopted widely across many industries. However, a few limitations of supervised learning can be overcome with the onset of various other approaches. Transfer Learning is a method under research in Machine Learning that stores the knowledge obtained from solving one problem and uses it to solve problems that are different but related to the solved one. Since training a model takes more computational power, time, and data, Transfer Learning helps reduce the same while improving learning accuracy. The target learner learns from the model, which is already trained initially by using the stored knowledge.

knowledge, source domain, target domain, (15 more...)

#artificialintelligence

Genre: Overview (0.34)

Industry: Health & Medicine (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.47)

Solving clustering as ill-posed problem: experiments with K-Means algorithm

Vergani, Alberto Arturo

In this contribution, the clustering procedure based on K-Means algorithm is studied as an inverse problem, which is a special case of the illposed problems. The attempts to improve the quality of the clustering inverse problem drive to reduce the input data via Principal Component Analysis (PCA). Since there exists a theorem by Ding and He that links the cardinality of the optimal clusters found with K-Means and the cardinality of the selected informative PCA components, the computational experiments tested the theorem between two quantitative features selection methods: Kaiser criteria (based on imperative decision) versus Wishart criteria (based on random matrix theory). The results suggested that PCA reduction with features selection by Wishart criteria leads to a low matrix condition number and satisfies the relation between clusters and components predicts by the theorem. The data used for the computations are from a neuroscientific repository: it regards healthy and young subjects that performed a task-oriented functional Magnetic Resonance Imaging (fMRI) paradigm.

artificial intelligence, criteria, machine learning, (19 more...)

2211.08302

Country:

North America > United States > California > Alameda County > Oakland (0.04)
Europe > Russia (0.04)
Europe > Italy (0.04)
Asia > Russia (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Kang, Woo Seok, Kim, Eunchan, Heo, Wookjae

The Association Between SOC and Land Prices Considering Spatial Heterogeneity Based on Finite Mixture Modeling

An understanding of how Social Overhead Capital (SOC) is associated with the land value of the local community is important for effective urban planning. However, even within a district, there are multiple sections used for different purposes; the term for this is spatial heterogeneity. The spatial heterogeneity issue has to be considered when attempting to comprehend land prices. If there is spatial heterogeneity within a district, land prices can be managed by adopting the spatial clustering method. In this study, spatial attributes including SOC, socio-demographic features, and spatial information in a specific district are analyzed with Finite Mixture Modeling (FMM) in order to find (a) the optimal number of clusters and (b) the association among SOCs, socio-demographic features, and land prices. FMM is a tool used to find clusters and the attributes' coefficients simultaneously. Using the FMM method, the results show that four clusters exist in one district and the four clusters have different associations among SOCs, demographic features, and land prices. Policymakers and managerial administration need to look for information to make policy about land prices. The current study finds the consideration of closeness to SOC to be a significant factor on land prices and suggests the potential policy direction related to SOC.

artificial intelligence, machine learning, spatial reasoning, (16 more...)

doi: 10.15793/kspr.2022.114..004

2211.08566

Country:

Asia > South Korea > Seoul > Seoul (0.05)
North America > United States > New York (0.04)
North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government (1.00)
Banking & Finance (0.93)
Education > Educational Setting > K-12 Education (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.89)

Silva, Miguel G., Henriques, Rui, Madeira, Sara C.

User-Specific Bicluster-based Collaborative Filtering: Handling Preference Locality, Sparsity and Subjectivity

As an attempt to cope with massive range of options, there has been large academic and industry interest in automatically recommending items to individuals since last century. Spotify, Amazon, Netflix, and Facebook are some popular platforms that actively use recommender systems [13]. From e-commerce to online advertisement, these systems are unavoidable in our daily online journeys to suggest items in a personalized way. Collaborative Filtering (CF) approaches, firstly proposed by [19], are currently seen as the widest implemented and most mature of the technologies to build recommender systems. Given a set of observed item ratings, CF aims at estimating unknown preferences based on the assumption that users with similar preferences in the past will yield similar preferences in the future. Despite the role of Collaborative Filtering, significant challenges limit its effectiveness, including the diversity and locality of user preferences, the structural sparsity of user-item ratings, the subjectivity of rating scales, and the increasingly large user and item bases [13, 49]. To address the diversity of user profiles, reduce the dimensionality and minimize rating sparsity, matrix factorization and clustering approaches have been combined within CF approaches for two decades [13]. However, traditional clustering techniques are typically applied to either group users or items separately. In real-world CF scenarios, the preferences of a subset of users is frequently only significantly correlated on a subset of the overall items, and vice versa [47].

artificial intelligence, machine learning, social media, (15 more...)

2211.08366

Country:

Europe > Portugal > Lisbon > Lisbon (0.14)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
South America > Brazil > Bahia > Salvador (0.04)
(14 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Information Technology > Services (0.88)
Media (0.74)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Chourasia, Prakash, Ali, Sarwan, Ciccolella, Simone, Della Vedova, Gianluca, Patterson, Murray

Reads2Vec: Efficient Embedding of Raw High-Throughput Sequencing Reads Data

The massive amount of genomic data appearing for SARS-CoV-2 since the beginning of the COVID-19 pandemic has challenged traditional methods for studying its dynamics. As a result, new methods such as Pangolin, which can scale to the millions of samples of SARS-CoV-2 currently available, have appeared. Such a tool is tailored to take as input assembled, aligned and curated full-length sequences, such as those found in the GISAID database. As high-throughput sequencing technologies continue to advance, such assembly, alignment and curation may become a bottleneck, creating a need for methods which can process raw sequencing reads directly. In this paper, we propose Reads2Vec, an alignment-free embedding approach that can generate a fixed-length feature vector representation directly from the raw sequencing reads without requiring assembly. Furthermore, since such an embedding is a numerical representation, it may be applied to highly optimized classification and clustering algorithms. Experiments on simulated data show that our proposed embedding obtains better classification results and better clustering properties contrary to existing alignment-free baselines. In a study on real data, we show that alignment-free embeddings have better clustering properties than the Pangolin tool and that the spike region of the SARS-CoV-2 genome heavily informs the alignment-free clusterings, which is consistent with current biological knowledge of SARS-CoV-2.

artificial intelligence, data mining, machine learning, (18 more...)

2211.08267

Country:

Asia > India (0.04)
South America > Brazil (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(7 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)