Goto

Collaborating Authors

 inaddition



3d779cae2d46cf6a8a99a35ba4167977-AuthorFeedback.pdf

Neural Information Processing Systems

Our approach is purely based on 2D convolutions. Nevertheless, it3 outperforms or performs comparably to many more costly 3D models. We thank the reviewers for pointing out some related (or missing) references. The12 Timeception layers involve group convolutions at different time scales while our TAM layers only use depthwise13 convolution. As a result, the Timeception has significantly more parameters than the TAM (10% vs. 0.1% of the14 totalmodelparameters).


6 SupplementaryMaterial

Neural Information Processing Systems

The original CLUTRR data generation framework made sure that each testproof is not in the training set in order to test whether a model is able to generalize to unseen proofs. Initial results on the original CLUTRR test sets resulted in strong model performance ( 99%) on levels seen during training (2, 4, 6) but no generalization at all ( 0%) to other levels. The models are given as input " [story] [query] " and asked to generate the proof and answer. Models are trained on levels2,4,6only. In our case, the entity names are important to evaluate systematic generalization.



6f5216f8d89b086c18298e043bfe48ed-Paper.pdf

Neural Information Processing Systems

Withoutrequiring repeatable trials, itcanflexibly capture covariate-dependent jointSCDs, andprovide interpretable latent causes underlying the statistical dependencies between neurons.





Fast Sparse Group Lasso

Yasutoshi Ida, Yasuhiro Fujiwara, Hisashi Kashima

Neural Information Processing Systems

However,asan update ofonlyoneparameter group depends onalltheparameter groups ordata points, the computation cost is high when the number of the parameters or data points islarge. This paper proposes afast Block Coordinate Descent for Sparse GroupLasso.


Multi-labelCo-regularizationforSemi-supervised FacialActionUnitRecognition

Neural Information Processing Systems

Facial action units (AUs) recognition is essential for emotion analysis and has been widely applied in mental state analysis. Existing work on AU recognition usually requires big face dataset with accurate AU labels. However, manual AU annotation requires expertise and can be time-consuming. In this work, we propose asemi-supervised approach forAUrecognition utilizing alargenumber of web face images without AU labels and a small face dataset with AU labels inspired by the co-training methods.