AITopics

1711.02074

Country:

Asia (0.29)
North America > United States > Massachusetts (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Machine LearningNov-6-2017

Trimmed Density Ratio Estimation

Liu, Song, Takeda, Akiko, Suzuki, Taiji, Fukumizu, Kenji

Density ratio estimation (DRE) [18, 11, 27] is an important tool in various branches of machine learning and statistics. Due to its ability of directly modelling the differences between two probability density functions, DRE finds its applications in change detection [13, 6], twosample test [32] and outlier detection [1, 26]. In recent years, a sampling framework called Generative Adversarial Network (GAN) (see e.g., [9, 19]) uses the density ratio function to compare artificial samples from a generative distribution and real samples from an unknown distribution. DRE has also been widely discussed in statistical literatures for adjusting nonparametric density estimation [5], stabilizing the estimation of heavy tailed distribution [7] and fitting multiple distributions at once [8]. However, as a density ratio function can grow unbounded, DRE can suffer from robustness and stability issues: a few corrupted points may completely mislead the estimator (see Figure 2 in Section 6 for example).

artificial intelligence, data mining, machine learning, (16 more...)

1703.03216

Country: Asia (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.48)
(4 more...)

#artificialintelligenceNov-5-2017, 16:20:12 GMT

Artificial intelligence helps detect ovarian cancer early and accurately

Ovarian cancer is difficult to diagnose, particularly in its early stages, when survival rates are much higher. Because there is no consistently reliable screening test to detect ovarian cancer, most women are diagnosed with the disease when it's in an advanced stage. However, researchers at Brigham and Women's Hospital and Dana-Farber Cancer Institute have developed a non-invasive diagnostic test using artificial intelligence for the accurate detection of true cases of early-stage disease. Results of their study were published online this week in the journal eLife. By combining next generation sequencing with artificial intelligence, researchers have created a novel blood test based on serum microRNAs--small, non-coding pieces of genetic material that help control where and when genes are activated--for the early diagnosis of ovarian cancer.

cancer, detect ovarian cancer, ovarian cancer, (12 more...)

Country: Europe > Poland (0.05)

Industry:

Health & Medicine > Therapeutic Area > Oncology > Ovarian Cancer (1.00)
Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.57)

#artificialintelligenceNov-4-2017, 23:15:23 GMT

How to assess quality and correctness of classification models? Part 4 - ROC Curve

We test the classifier for different alpha thresholds. Recall that alpha is the threshold of the estimated probability, above which an observation is assigned to one category (positive class) and below to the other category (negative class).

artificial intelligence, machine learning, quality and correctness, (5 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)

Pleiss, Geoff, Raghavan, Manish, Wu, Felix, Kleinberg, Jon, Weinberger, Kilian Q.

On Fairness and Calibration

arXiv.org Machine LearningNov-3-2017

The machine learning community has become increasingly concerned with the potential for bias and discrimination in predictive models. This has motivated a growing line of work on what it means for a classification procedure to be "fair." In this paper, we investigate the tension between minimizing error disparity across different population groups while maintaining calibrated probability estimates. We show that calibration is compatible only with a single error constraint (i.e. equal false-negatives rates across groups), and show that any algorithm that satisfies this relaxation is no better than randomizing a percentage of predictions for an existing classifier. These unsettling findings, which extend and generalize existing results, are empirically confirmed on several datasets.

artificial intelligence, classifier, machine learning, (16 more...)

1709.02012

Country: North America > United States (0.93)

Genre: Research Report > New Finding (0.46)

Industry: Law > Civil Rights & Constitutional Law (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

#artificialintelligenceNov-2-2017, 00:20:04 GMT

New blood test developed to diagnose ovarian cancer

Investigators from Brigham and Women's Hospital and Dana-Farber Cancer Institute are leveraging the power of artificial intelligence to develop a new technique to detect ovarian cancer early and accurately. The team has identified a network of circulating microRNAs - small, non-coding pieces of genetic material - that are associated with risk of ovarian cancer and can be detected from a blood sample. Their findings are published online in eLife. Most women are diagnosed with ovarian cancer when the disease is at an advanced stage, at which point only about a quarter of patients will survive for at least five years. But for women whose cancer is serendipitously picked up at an early stage, survival rates are much higher.

artificial intelligence, machine learning, ovarian cancer, (17 more...)

Country:

North America > United States (0.30)
Europe > Poland > Łódź Province > Łódź (0.05)

Genre: Research Report > New Finding (0.69)

Industry:

Health & Medicine > Therapeutic Area > Oncology > Ovarian Cancer (1.00)
Government > Regional Government > North America Government > United States Government > FDA (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.52)

Bouchard, Kristofer E., Bujan, Alejandro F., Roosta-Khorasani, Farbod, Ubaru, Shashanka, Prabhat, null, Snijders, Antoine M., Mao, Jian-Hua, Chang, Edward F., Mahoney, Michael W., Bhattacharyya, Sharmodeep

Union of Intersections (UoI) for Interpretable Data Driven Discovery and Prediction

arXiv.org Machine LearningNov-2-2017

The increasing size and complexity of scientific data could dramatically enhance discovery and prediction for basic scientific applications. Realizing this potential, however, requires novel statistical analysis methods that are both interpretable and predictive. We introduce Union of Intersections (UoI), a flexible, modular, and scalable framework for enhanced model selection and estimation. Methods based on UoI perform model selection and model estimation through intersection and union operations, respectively. We show that UoI-based methods achieve low-variance and nearly unbiased estimation of a small number of interpretable features, while maintaining high-quality prediction accuracy. We perform extensive numerical investigation to evaluate a UoI algorithm ($UoI_{Lasso}$) on synthetic and real data. In doing so, we demonstrate the extraction of interpretable functional networks from human electrophysiology recordings as well as accurate prediction of phenotypes from genotype-phenotype data with reduced features. We also show (with the $UoI_{L1Logistic}$ and $UoI_{CUR}$ variants of the basic framework) improved prediction parsimony for classification and matrix factorization on several benchmark biomedical data sets. These results suggest that methods based on the UoI framework could improve interpretation and prediction in data-driven discovery across scientific fields.

algorithm, artificial intelligence, machine learning, (15 more...)

1705.07585

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)

#artificialintelligenceNov-1-2017, 15:55:48 GMT

Join the disruptors of health science

Thomas Insel left Verily, a health-science spin-off formed by Google's parent company, to co-found a start-up called Mindstrong Health this year. In early 2015, I testified with several other National Institutes of Health (NIH) directors at an annual hearing held by the US Senate. It was my 13th and final year as director of the US National Institute of Mental Health (NIMH) in Bethesda, Maryland. What struck me most was how the harsh fiscal reality tempered the passionate bipartisan support for the NIH. As one senator noted, with a federal deficit of nearly US$500 billion, there was little hope of any significant increase in funding. Six months after that hearing, I left the NIH for Silicon Valley, first working at Verily in South San Francisco, California, a health-science spin-off formed by Google's parent company Alphabet.

artificial intelligence, machine learning, tech company, (16 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.55)
North America > United States > Maryland > Montgomery County > Bethesda (0.24)
North America > United States > California > San Mateo County > South San Francisco (0.24)
(4 more...)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Consumer Health (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

arXiv.org Machine LearningOct-31-2017

Calibration for Stratified Classification Models

Zuo, Chandler

In classification problems, sampling bias between training data and testing data is critical to the ranking performance of classification scores. Such bias can be both unintentionally introduced by data collection and intentionally introduced by the algorithm, such as under-sampling or weighting techniques applied to imbalanced data. When such sampling bias exists, using the raw classification score to rank observations in the testing data can lead to suboptimal results. In this paper, I investigate the optimal calibration strategy in general settings, and develop a practical solution for one specific sampling bias case, where the sampling bias is introduced by stratified sampling. The optimal solution is developed by analytically solving the problem of optimizing the ROC curve. For practical data, I propose a ranking algorithm for general classification models with stratified data. Numerical experiments demonstrate that the proposed algorithm effectively addresses the stratified sampling bias issue. Interestingly, the proposed method shows its potential applicability in two other machine learning areas: unsupervised learning and model ensembling, which can be future research topics.

artificial intelligence, machine learning, testing data, (16 more...)

1711.00064

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Aoshima, Makoto, Yata, Kazuyoshi

Distance-based classifier by data transformation for high-dimension, strongly spiked eigenvalue models

arXiv.org Machine LearningOct-30-2017

We consider classifiers for high-dimensional data under the strongly spiked eigenvalue (SSE) model. We first show that high-dimensional data often have the SSE model. We consider a distance-based classifier using eigenstructures for the SSE model. We apply the noise reduction methodology to estimation of the eigenvalues and eigenvectors in the SSE model. We create a new distance-based classifier by transforming data from the SSE model to the non-SSE model. We give simulation studies and discuss the performance of the new classifier. Finally, we demonstrate the new classifier by using microarray data sets.

bioinformatics, classifier, machine learning, (18 more...)

1710.10768

Country: Asia > Japan (0.15)

Genre: Research Report (0.40)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.57)

Technology:

Information Technology > Data Science > Data Mining (0.71)
Information Technology > Biomedical Informatics > Translational Bioinformatics (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)