AITopics

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.32)

He, Fengxiang, Liu, Tongliang, Webb, Geoffrey I, Tao, Dacheng

Instance-Dependent PU Learning by Bayesian Optimal Relabeling

arXiv.org Machine LearningAug-6-2018

When learning from positive and unlabelled data, it is a strong assumption that the positive observations are randomly sampled from the distribution of $X$ conditional on $Y = 1$, where X stands for the feature and Y the label. Most existing algorithms are optimally designed under the assumption. However, for many real-world applications, the observed positive examples are dependent on the conditional probability $P(Y = 1|X)$ and should be sampled biasedly. In this paper, we assume that a positive example with a higher $P(Y = 1|X)$ is more likely to be labelled and propose a probabilistic-gap based PU learning algorithms. Specifically, by treating the unlabelled data as noisy negative examples, we could automatically label a group positive and negative examples whose labels are identical to the ones assigned by a Bayesian optimal classifier with a consistency guarantee. The relabelled examples have a biased domain, which is remedied by the kernel mean matching technique. The proposed algorithm is model-free and thus do not have any parameters to tune. Experimental results demonstrate that our method works well on both generated and real-world datasets.

artificial intelligence, classifier, machine learning, (19 more...)

1808.0218

Country:

Oceania > Australia (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.66)

arXiv.org Machine LearningAug-6-2018

Hashing with Binary Matrix Pursuit

Cakir, Fatih, He, Kun, Sclaroff, Stan

We propose theoretical and empirical improvements for two-stage hashing methods. We first provide a theoretical analysis on the quality of the binary codes and show that, under mild assumptions, a residual learning scheme can construct binary codes that fit any neighborhood structure with arbitrary accuracy. Secondly, we show that with high-capacity hash functions such as CNNs, binary code inference can be greatly simplified for many standard neighborhood definitions, yielding smaller optimization problems and more robust codes. Incorporating our findings, we propose a novel two-stage hashing method that significantly outperforms previous hashing studies on widely used image retrieval benchmarks.

artificial intelligence, binary code, machine learning, (18 more...)

1808.0199

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > Massachusetts > Middlesex County > Lexington (0.04)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.94)
(2 more...)

#artificialintelligenceAug-3-2018, 02:35:49 GMT

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning Amazon Web Services

Supervised learning needs labels, or annotations, that tell the algorithm what the right answers are in the training phases of your project. In fact, many of the examples of using MXNet, TensorFlow, and PyTorch start with annotated data sets you can use to explore the various features of those frameworks. Unfortunately, when you move from the examples to application, it's much less common to have a fully annotated set of data at your fingertips. This tutorial will show you how you can use Amazon Mechanical Turk (MTurk) from within your Amazon SageMaker notebook to get annotations for your data set and use them for training. TensorFlow provides an example of using an Estimator to classify irises using a neural network classifier.

annotation, artificial intelligence, machine learning, (17 more...)

Country: North America > United States > California > Orange County > Irvine (0.04)

Genre: Instructional Material (0.54)

Industry:

Retail > Online (0.40)
Information Technology > Services (0.40)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.62)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)

U.S. NewsAug-2-2018, 20:51:36 GMT

Rainfall Records Set Across North Carolina During Soggy July

The weather service reported Cape Hatteras got 20.31 inches (50 centimeters) of rain last month, well above the normal of 4.99 inches (12.66 centimeters), based on a 30-year average. It's the wettest July on record and the second wettest month ever, trailing only the 21.40 inches (54 centimeters) that fell on Cape Hatteras in September 1999 due to Hurricane Floyd.

artificial intelligence, machine learning, rainfall record set, (3 more...)

U.S. News

Country: North America > United States > North Carolina (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

arXiv.org Machine LearningAug-2-2018

Mobile big data analysis with machine learning

Xie, Jiyang, Song, Zeyu, Li, Yupeng, Ma, Zhanyu

Wi-Fi) and the second/third/fourth generation (2/3/4G) mobile network, the number of mobile phones, which is 7.74 billion, 103.5 per 100 inhabitants all over the world in 2017, is rising dramatically [1]. Nowadays, mobile phone can not only send voice and text messages, but also easily and conveniently access the Internet which has been recognized as the most revolutionary development of Mobile Internet (M-Internet). Meanwhile, worldwide active mobile-broadband subscriptions in 2017 have increased to 4.22 billion, which is 9.21% higher than that in 2016 [1]. Figure 1 shows the numbers of mobile-cellular telephone and active mobile-broadband subscriptions of the world and main districts from 2010 to 2017. The numbers which are up to the bars are the mobile-cellular telephone or active mobile-broadband subscriptions (million) in the world of the year which increase each year. Under the M-Internet, various kinds of content (image, voice, video, etc.) can be sent and received everywhere and the related applications emerge to satisfy people's requirements, including working, study, daily life, entertainment, education, healthcare, etc. In China, mobile applications giants, i.e., Baidu, Alibaba and Tencent, held 78% of M-Internet online time per day in App which was about 2,412 minutes in 2017 [2]. This figure indicates that M-Internet has entered a rapidly growth stage.

artificial intelligence, data mining, machine learning, (15 more...)

1808.00803

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > Middle East > Jordan (0.04)
Africa > Senegal > Kolda Region > Kolda (0.04)
(3 more...)

Genre:

Overview (0.93)
Research Report > New Finding (0.46)
Research Report > Promising Solution (0.46)

Industry:

Telecommunications (1.00)
Health & Medicine (1.00)
Information Technology > Services (0.68)
Energy > Power Industry (0.68)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
(7 more...)

U.S. NewsAug-1-2018, 08:51:32 GMT

Tulane University: Fundraising Record Set With $150M Raised

Among the major donations: $25 million from the family of Dr. John Winton Deming to name the John W. Deming Department of Medicine; and a $10 million gift from Tulane alumni Steven and Jann Paul to build the Steven and Jann Paul Hall for Science and Engineering. There also was an anonymous lead gift and other donations to begin construction on a $55 million building to be called The Commons, which will include a new dining hall and meeting spaces.

artificial intelligence, fundraising record set, inductive learning, (3 more...)

U.S. News

Country: North America > United States > Louisiana (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

Liu, Bin, Tsoumakas, Grigorios

Making Classifier Chains Resilient to Class Imbalance

arXiv.org Machine LearningJul-31-2018

Class imbalance is an intrinsic characteristic of multi-label data. Most of the labels in multi-label data sets are associated with a small number of training examples, much smaller compared to the size of the data set. Class imbalance poses a key challenge that plagues most multi-label learning methods. Ensemble of Classifier Chains (ECC), one of the most prominent multi-label learning methods, is no exception to this rule, as each of the binary models it builds is trained from all positive and negative examples of a label. To make ECC resilient to class imbalance, we first couple it with random undersampling. We then present two extensions of this basic approach, where we build a varying number of binary models per label and construct chains of different sizes, in order to improve the exploitation of majority examples with approximately the same computational budget. Experimental results on 16 multi-label datasets demonstrate the effectiveness of the proposed approaches in a variety of evaluation metrics.

artificial intelligence, machine learning, majority example, (16 more...)

1807.11393

Country:

Europe > Greece > Central Macedonia > Thessaloniki (0.04)
Europe > Greece > Attica > Athens (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

#artificialintelligenceJul-28-2018, 23:10:03 GMT

Train a model on fashion dataset

Fashion MNIST is a direct drop-in replacement for the original MNIST dataset. The dataset is made up of 60,000 training examples and 10,000 testing examples, where each example is a 28 28 grayscaled picture of various articles of clothing. The Fashion MNIST dataset is more difficult than the original MNIST, and thus serves as a more complete benchmarking tool. The model being trained is a CNN with three convolutional layers followed by two dense layers. The job will run for 30 epochs, with a batch size of 128.

artificial intelligence, dataset, inductive learning, (3 more...)

Industry: Information Technology (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.65)

#artificialintelligenceJul-19-2018, 03:01:29 GMT

Machine Learning : What is Machine Learning ?

Machine learning is a method used to make complex models and algorithms by analysing huge amount of data, that lend themselves to prediction, making use of computers. It has strong relation with mathematics. Which optimizes and delivers methods, theory and application domains to this field. It is sometimes conflated with data mining, whereas Data Mining is process where intelligent methods are applied to extract data patterns. Tom M. Mitchell provided a widely quoted, more formal definition of the algorithms studied in the machine learning field: "A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E. This definition of the tasks in which machine learning is concerned offers a fundamentally operational definition rather than defining the field in cognitive terms.

artificial intelligence, inductive learning, machine learning, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.34)