Appendix T able of Contents

Neural Information Processing Systems 

For all baseline methods, we use the MinMaxScaler from sklearn. The likelihood of generating the validation conditioned on the remaining training series is used to select the hyperparameters. We compare the performance of our GPT -3 predictor against popular time series models. GPT -3 continues to be competitive with or outperforms the baselines on all of the tasks, from in-context learning alone. GPT -3's performance is not due to memorization of the test data. Even if our evaluation datasets are present in the GPT -3 training data, it's unlikely that GPT -3's good performance is the result of memorization for at least two reasons a priori.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found