Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios