Does a Neural Network Really Encode Symbolic Concepts?
–arXiv.org Artificial Intelligence
Recently, a series of studies have tried to extract interactions between input variables modeled by a DNN and define such interactions as concepts encoded by the DNN. However, strictly speaking, there still lacks a solid guarantee whether such interactions indeed represent meaningful concepts. Therefore, in this paper, we examine the trustworthiness of interaction concepts from four perspectives. Extensive empirical studies have verified that a well-trained DNN usually encodes sparse, transferable, and discriminative concepts, which is partially aligned with human intuition.
arXiv.org Artificial Intelligence
Dec-1-2023
- Country:
- Asia
- Europe
- Austria (0.04)
- Italy > Marche
- Ancona Province > Ancona (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > Hawaii
- Honolulu County > Honolulu (0.04)
- Canada > Quebec
- Genre:
- Research Report (1.00)
- Industry:
- Leisure & Entertainment > Games (0.47)
- Technology: