Subject-specific Deep Neural Networks for Count Data with High-cardinality Categorical Features

Lee, Hangbin, Ha, Il Do, Hwang, Changha, Lee, Youngjo

Oct-17-2023–arXiv.org Machine Learning

Deep neural networks (DNNs), which have been proposed to capture the nonlinear relationship between input and output variables (LeCun et al., 2015; Goodfellow et al., 2016), provide outstanding marginal predictions for independent outputs. However, in practical applications, it is common to encounter correlated data with high-cardinality categorical features, which can pose challenges for DNNs. While the traditional DNN framework overlooks such correlation, random effect models have emerged in statistics to make subject-specific predictions for correlated data. Lee and Nelder (1996) proposed hierarchical generalized linear models (HGLMs), which allow the incorporation of random effects from an arbitrary conjugate distribution of generalized linear model (GLM) family. Both DNNs and random effect models have been successful in improving prediction accuracy of linear models but in different ways. Recently, there has been a rising interest in combining these two extensions. Simchoni and Rosset (2021, 2023) proposed the linear mixed model neural network for continuous (Gaussian) outputs with Gaussian random effects, which allow explicit expressions for likelihoods. Lee and Lee (2023) introduced the hierarchical likelihood (h-likelihood) approach, as an extension of classical likelihood for Gaussian outputs, which provides an efficient likelihood-based procedure. For non-Gaussian (discrete) outputs, Tran et al. (2020) proposed a Bayesian approach for DNNs with normal random effects using the variational approximation method (Bishop and Nasrabadi, 2006; Blei

artificial intelligence, machine learning, random effect, (19 more...)

arXiv.org Machine Learning

Oct-17-2023

arXiv.org PDF

Add feedback

Country:
- Asia > South Korea > Seoul > Seoul (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Health & Medicine > Therapeutic Area (0.95)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Learning Graphical Models > Directed Networks
    - Bayesian Learning (0.34)
  - Neural Networks > Deep Learning (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found