AITopics | dcnt

The Doubly Correlated Nonparametric Topic Model

Neural Information Processing SystemsApr-6-2023, 12:46:43 GMT

Topic models are learned via a statistical model of variation within document collections, but designed to extract meaningful semantic structure. Desirable traits include the ability to incorporate annotations or metadata associated with documents; the discovery of correlated patterns of topic usage; and the avoidance of parametric assumptions, such as manual specification of the number of topics. We propose a doubly correlated nonparametric topic (DCNT) model, the first model to simultaneously capture all three of these properties.

dcnt, doubly correlated nonparametric topic model, metadata

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.67)

Add feedback

The Doubly Correlated Nonparametric Topic Model

Kim, Dae I., Sudderth, Erik B.

Neural Information Processing SystemsFeb-14-2020, 23:27:12 GMT

Topic models are learned via a statistical model of variation within document collections, but designed to extract meaningful semantic structure. Desirable traits include the ability to incorporate annotations or metadata associated with documents; the discovery of correlated patterns of topic usage; and the avoidance of parametric assumptions, such as manual specification of the number of topics. We propose a doubly correlated nonparametric topic (DCNT) model, the first model to simultaneously capture all three of these properties. Papers published at the Neural Information Processing Systems Conference.

dcnt, doubly correlated nonparametric topic model, metadata

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.66)

Add feedback

The Doubly Correlated Nonparametric Topic Model

Kim, Dae I., Sudderth, Erik B.

Neural Information Processing SystemsDec-31-2011

Topic models are learned via a statistical model of variation within document collections, but designed to extract meaningful semantic structure. Desirable traits include the ability to incorporate annotations or metadata associated with documents; the discovery of correlated patterns of topic usage; and the avoidance of parametric assumptions, such as manual specification of the number of topics. We propose a doubly correlated nonparametric topic (DCNT) model, the first model to simultaneously capture all three of these properties. The DCNT models metadata via a flexible, Gaussian regression on arbitrary input features; correlations via a scalable square-root covariance representation; and nonparametric selection from an unbounded series of potential topics via a stick-breaking construction. We validate the semantic structure and predictive performance of the DCNT using a corpus of NIPS documents annotated by various metadata.

correlation, dataset, metadata, (15 more...)

Neural Information Processing Systems

Country: