AITopics

2507.14189

Country: North America > Mexico (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

arXiv.org Artificial IntelligenceJul-31-2024

UnPaSt: unsupervised patient stratification by differentially expressed biclusters in omics data

Hartung, Michael, Maier, Andreas, Delgado-Chaves, Fernando, Burankova, Yuliya, Isaeva, Olga I., Patroni, Fábio Malta de Sá, He, Daniel, Shannon, Casey, Kaufmann, Katharina, Lohmann, Jens, Savchik, Alexey, Hartebrodt, Anne, Chervontseva, Zoe, Firoozbakht, Farzaneh, Probul, Niklas, Zotova, Evgenia, Tsoy, Olga, Blumenthal, David B., Ester, Martin, Laske, Tanja, Baumbach, Jan, Zolotareva, Olga

Most complex diseases, including cancer and non-malignant diseases like asthma, have distinct molecular subtypes that require distinct clinical approaches. However, existing computational patient stratification methods have been benchmarked almost exclusively on cancer omics data and only perform well when mutually exclusive subtypes can be characterized by many biomarkers. Here, we contribute with a massive evaluation attempt, quantitatively exploring the power of 22 unsupervised patient stratification methods using both, simulated and real transcriptome data. From this experience, we developed UnPaSt (https://apps.cosy.bio/unpast/) optimizing unsupervised patient stratification, working even with only a limited number of subtype-predictive biomarkers. We evaluated all 23 methods on real-world breast cancer and asthma transcriptomics data. Although many methods reliably detected major breast cancer subtypes, only few identified Th2-high asthma, and UnPaSt significantly outperformed its closest competitors in both test datasets. Essentially, we showed that UnPaSt can detect many biologically insightful and reproducible patterns in omic datasets.

bicluster, dataset, subtype, (15 more...)

2408.002

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(10 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.57)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Biomedical Informatics > Translational Bioinformatics (0.68)

arXiv.org Artificial IntelligenceApr-17-2024

Improvement in Semantic Address Matching using Natural Language Processing

Gupta, Vansh, Gupta, Mohit, Garg, Jai, Garg, Nitesh

Address matching is an important task for many businesses especially delivery and take out companies which help them to take out a certain address from their data warehouse. Existing solution uses similarity of strings, and edit distance algorithms to find out the similar addresses from the address database, but these algorithms could not work effectively with redundant, unstructured, or incomplete address data. This paper discuss semantic Address matching technique, by which we can find out a particular address from a list of possible addresses. We have also reviewed existing practices and their shortcoming. Semantic address matching is an essentially NLP task in the field of deep learning. Through this technique We have the ability to triumph the drawbacks of existing methods like redundant or abbreviated data problems. The solution uses the OCR on invoices to extract the address and create the data pool of addresses. Then this data is fed to the algorithm BM-25 for scoring the best matching entries. Then to observe the best result, this will pass through BERT for giving the best possible result from the similar queries. Our investigation exhibits that our methodology enormously improves both accuracy and review of cutting-edge technology existing techniques.

algorithm, belleville, category, (13 more...)

doi: 10.1109/INCET51464.2021.9456342

2404.11691

Country:

North America > United States > Nebraska > Douglas County > Omaha (0.05)
North America > United States > Ohio > Lake County > Mentor (0.05)
North America > United States > Mississippi > Hinds County > Jackson (0.05)
(13 more...)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

O'Sullivan, Fintan, Escott, Kirita-Rose, Shaw, Rachael C., Lensen, Andrew

Feature-based Image Matching for Identifying Individual K\=ak\=a

arXiv.org Artificial IntelligenceJan-23-2023

This report investigates an unsupervised, feature-based image matching pipeline for the novel application of identifying individual k\=ak\=a. Applied with a similarity network for clustering, this addresses a weakness of current supervised approaches to identifying individual birds which struggle to handle the introduction of new individuals to the population. Our approach uses object localisation to locate k\=ak\=a within images and then extracts local features that are invariant to rotation and scale. These features are matched between images with nearest neighbour matching techniques and mismatch removal to produce a similarity score for image match comparison. The results show that matches obtained via the image matching pipeline achieve high accuracy of true matches. We conclude that feature-based image matching could be used with a similarity network to provide a viable alternative to existing supervised approaches.

data mining, machine learning, pattern recognition, (22 more...)

2301.06678

Country:

Oceania > New Zealand > North Island > Hawke's Bay (0.04)
North America > Canada > British Columbia (0.04)
Africa > Zimbabwe (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
(5 more...)

#artificialintelligenceOct-8-2021, 06:50:08 GMT

How to create a chatbot in Python

Natural language processing (NLP) is one of the most promising fields of artificial intelligence that uses natural languages to enable human interactions with machines. There are two main approaches to NLP: – rule-based methods, – statistical methods, i.e., methods related to machine learning. There are several exciting Python libraries for NLP, such as Natural Language Toolkit (NLTK), spaCy, TextBlob, etc. A chatbot is a computer software able to interact with humans using a natural language. They usually rely on machine learning, especially on NLP. Apple's Siri, Amazon's Alexa, Google Assitant, and Microsoft's Cortana are some well-known examples of software able to process natural languages.

bot, chatbot, chatterbot, (12 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

#artificialintelligenceNov-23-2020, 11:15:20 GMT

How to create a chatbot in Python

Today we will talk about how to create a chatbot with Python. Natural language processing (NLP) is one of the most promising fields of artificial intelligence that uses natural languages to enable human interactions with machines. There are two main approaches to NLP: – rule-based methods, – statistical methods, i.e., methods related to machine learning. There are several exciting Python libraries for NLP, such as Natural Language Toolkit (NLTK), spaCy, TextBlob, etc. A chatbot is a computer software able to interact with humans using a natural language. They usually rely on machine learning, especially on NLP.

bot, chatbot, python, (12 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

#artificialintelligenceOct-9-2019, 17:41:21 GMT

Disrupting Healthcare with Artificial Intelligence

The healthcare industry is evolving with the exponential increase in the exploration of artificial intelligence (AI). These implications go far beyond technology, points out the Everest Group, with the majority of AI decisions impacting everything from customer experience to cost to business processes. While there are certainly huge cost impacts (think: reduced need for customer care executives and reduced cost of population health management) as well as significant business impacts (think: increased healthcare savings and enhanced patient experience), the operational impact is perhaps the most vital because it personalizes patient care. To that end, physicians can make more accurate diagnoses and more efficiently engage with patients on a daily basis. This is where today's blog will focus: preventing physician burnout in the healthcare industry with the help of AI.

artificial intelligence, disrupting healthcare, physician, (11 more...)

Industry:

Health & Medicine > Consumer Health (0.93)
Health & Medicine > Therapeutic Area > Oncology (0.50)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.32)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.40)

Senter, James K., Royalty, Taylor M., Steen, Andrew D., Sadovnik, Amir

Unaligned Sequence Similarity Search Using Deep Learning

arXiv.org Machine LearningSep-15-2019

--Gene annotation has traditionally required direct comparison of DNA sequences between an unknown gene and a database of known ones using string comparison methods. However, these methods do not provide useful information when a gene does not have a close match in the database. In addition, each comparison can be costly when the database is large since it requires alignments and a series of string comparisons. In this work we propose a novel approach: using recurrent neural networks to embed DNA or amino-acid sequences in a low-dimensional space in which distances correlate with functional similarity. This embedding space overcomes both shortcomings of the method of aligning sequences and comparing homology. First, it allows us to obtain information about genes which do not have exact matches by measuring their similarity to other ones in the database. If our database is labeled this can provide labels for a query gene as is done in traditional methods. However, even if the database is unlabeled it allows us to find clusters and infer some characteristics of the gene population. In addition, each comparison is much faster than traditional methods since the distance metric is reduced to the Euclidean distance, and thus efficient approximate nearest neighbor algorithms can be used to find the best match. More specifically we show how our embedding can be useful for both classification tasks when our labels are known, and clustering tasks where our sequences belong to classes which have not been seen before. The central dogma of biology states that all organisms contain DNA, which is transcribed into RNA and then translated into proteins, which catalyze the chemical reactions that define life.

artificial intelligence, machine learning, sequence, (18 more...)

arXiv.org Machine Learning

1909.06929

Country: North America > United States > Tennessee (0.28)

Genre: Research Report > Promising Solution (0.34)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

#artificialintelligenceJul-12-2018, 10:26:12 GMT

Hinge's newest feature claims to use machine learning to find your best match

Most Compatible -- attempts to use all your cumulative data to find the perfect match for you. The company's been testing this feature, which occasionally recommends a possible match to users, for at least month now. Those recommendations were only offered once a week during testing but will now come every day. Justin McLeod, Hinge's CEO, tells me the company spent the testing time honing its backend algorithm and getting Most Compatible to a point where the company feels confident putting it fully out there. Most Compatible, he says, uses machine learning to figure out each user's taste.

artificial intelligence, hinge, machine learning, (9 more...)

Country: North America > United States > New York (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.62)

Helfmann, Luzie, von Lindheim, Johannes, Mollenhauer, Mattes, Banisch, Ralf

On Hyperparameter Search in Cluster Ensembles

arXiv.org Machine LearningMar-29-2018

Quality assessments of models in unsupervised learning and clustering verification in particular have been a long-standing problem in the machine learning research. The lack of robust and universally applicable cluster validity scores often makes the algorithm selection and hyperparameter evaluation a tough guess. In this paper, we show that cluster ensemble aggregation techniques such as consensus clustering may be used to evaluate clusterings and their hyperparameter configurations. We use normalized mutual information to compare individual objects of a clustering ensemble to the constructed consensus of the whole ensemble and show, that the resulting score can serve as an overall quality measure for clustering problems. This method is capable of highlighting the standout clustering and hyperparameter configuration in the ensemble even in the case of a distorted consensus. We apply this very general framework to various data sets and give possible directions for future research.

artificial intelligence, consensus, machine learning, (15 more...)

arXiv.org Machine Learning

1803.11008

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)