Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >
Multi-modal shared module that enables the bottom-up formation of map representation and top-down map reading
Title: | Multi-modal shared module that enables the bottom-up formation of map representation and top-down map reading |
Authors: | Noguchi, Wataru Browse this author →KAKEN DB | Iizuka, Hiroyuki Browse this author →KAKEN DB | Yamamoto, Masahito Browse this author →KAKEN DB |
Keywords: | Cognitive map | multimodal learning | predictive learning | deep neural networks | symbol grounding |
Issue Date: | 2-Nov-2021 |
Publisher: | Taylor & Francis |
Journal Title: | Advanced Robotics |
Volume: | 36 |
Issue: | 1-2 |
Start Page: | 85 |
End Page: | 99 |
Publisher DOI: | 10.1080/01691864.2021.1993334 |
Abstract: | Humans create internal models of an environment (i.e. cognitive maps) through subjective sensorimotor experiences and can also understand spatial locations by looking at an external map as a symbol of an environment. We simulate the development of the cognitive map from sensorimotor experiences and grounding of the external map in a single deep neural network model. Our proposed network has a shared module that processes the features of multiple modalities (i.e. vision, hearing, and touch) and even external maps in the same manner. The multiple modalities are encoded into feature vectors by modality-specific encoders, and the encoded features are processed by the same shared module. The proposed network was trained to predict the sensory inputs of a simulated mobile robot. After the predictive learning, the spatial representation was developed in the internal states of the shared module, and the same spatial representation was used for predicting multiple modalities, including the external map. The network can also perform spatial navigation by associating the external map with the cognitive map. This implies that the external maps are grounded in subjective sensorimotor experiences, being bridged through the developed internal spatial representation in the shared module. |
Rights: | This is an Accepted Manuscript of an article published by Taylor & Francis in Advanced robotics on 02 Nov 2021, available online: https://www.tandfonline.com/doi/10.1080/01691864.2021.1993334 |
Type: | article (author version) |
URI: | http://hdl.handle.net/2115/87060 |
Appears in Collections: | 情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)
|
Submitter: 野口 渉
|