Duplicate Question Detection with Deep Learning on Quora Dataset - A Blog From Human-engineer-being
Quora recently announced the first public dataset that they ever released. It includes 404351 question pairs with a label column indicating if they are duplicate or not. In this post, I like to investigate this dataset and at least propose a baseline method with deep learning. Beside the proposed method, it includes some examples showing how to use Pandas, Gensim, Spacy and Keras. For the full code you check Github.
Apr-6-2017, 08:08:05 GMT
- Technology: