A sparse negative binomial mixture model for clustering RNA-seq count data