A new dataset for multilingual keyphrase generation