Unsupervised or Indirectly Supervised Learning
SupplementaryMaterialsVIME: ExtendingtheSuccessofSelf-and Semi-supervisedLearningtoTabularDomain
Semisupervised learning uses the trained encoder in learning a predictive model on both labeled and unlabeleddata. Figure 3: The proposed data corruption procedure. Original feature matrix(X) consists of four samples xi,i = 1...,4, where each row/column represents a sample/feature, and the features in each sample are represented by the same color. In the experiment section of the main manuscript, we evaluate VIME and its benchmarks on 11 datasets(6genomics,2clinical,and3publicdatasets). The selected SNPs and the corresponding blood cell trait together form an independent labeled dataset.
3953630da28e5181cffca1278517e3cf-Supplemental.pdf
However, ifฯ is too high, most of the unlabeled data points would not be used for consistency regularization. Based on these insights, we setฯ as 0.95 in our experiments. We describe further details of the experimental setup. To train the ReMixMatch, we gradually increased the coefficient of the loss associated with the unlabeled data points, following [4]. We found that without this gradual increase, the validation loss of the ReMixMatch did not converge.