Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning
–Neural Information Processing Systems
Human reasoning can be understood as a cooperation between the intuitive, associative System-1'' and the deliberative, logical System-2''. For existing System-1-like methods in visual activity understanding, it is crucial to integrate System-2 processing to improve explainability, generalization, and data efficiency. One possible path of activity reasoning is building a symbolic system composed of symbols and rules, where one rule connects multiple symbols, implying human knowledge and reasoning abilities.Previous methods have made progress, but are defective with limited symbols from handcraft and limited rules from visual-based annotations, failing to cover the complex patterns of activities and lacking compositional generalization. To overcome the defects, we propose a new symbolic system with two ideal important properties: broad-coverage symbols and rational rules. Collecting massive human knowledge via manual annotations is expensive to instantiate this symbolic system.
Neural Information Processing Systems
Dec-25-2025, 14:05:35 GMT
- Technology: