How Good Is Your NLP Model Really?
SageMaker Processing allows us to provision a GPU machine on demand, and only for the time needed to evaluate the model. To do so, we use a slightly modified evaluation script that can interact with the Processing job. And this time we will run the evaluation on the entire test dataset, i.e. 15K records. Once the run is complete, we can find the evaluation results in a JSON file on the specified output folder in S3 (in our case the file will be called evaluation.json): In fact, the evaluation results tell us that the Processing job managed to run 177 samples per second.
Aug-24-2021, 12:35:15 GMT
- Technology: