|
Chair:
|
|
Introduction: This session focuses on multi-modal data analysis, exploring how to break single-modal data limitations and boost intelligent system performance. Topics of interest include, but are not limited to: novel architectures for multi-modal learning, advanced approaches for multi-modal tasks, visual-inertial positioning, visual-acoustic event detection, visual-temperature industrial inspection, multi-modal data registration & alignment, advanced large language models for multi-modal learning, and real-world applications of multi-modal learning. We invite papers on model theory and technology, complex-environment robustness, and industry applications. Discussions will center on technological innovation and practical implementation, welcoming submissions of innovative frameworks, case studies, and challenge solutions. |
||
Topics: Novel architectures for multi-modal learning |
||