Reviews: Graph Agreement Models for Semi-Supervised Learning

Neural Information Processing Systems 

Is there any comparison with the baselines in terms of the number of network parameters? What is the performance of baselines with the same number of parameters as GAM? This seems to be a fairer comparison.