Execution-Based Evaluation for Open-Domain Code Generation