GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks