用户名: 密码: 验证码:
Symbolic Association Using Parallel Multilayer Perceptron
详细信息    查看全文
  • 关键词:Symbol grounding ; Neural network ; Cognitive model
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9887
  • 期:1
  • 页码:347-354
  • 全文大小:683 KB
  • 参考文献:1.Andersen, E.S., Dunlea, A., Kekelis, L.: The impact of input: language acquisition in the visually impaired. First Lang. 13(37), 23–49 (1993)CrossRef
    2.Asano, M., Imai, M., Kita, S., Kitajo, K., Okada, H., Thierry, G.: Sound symbolism scaffolds language development in preverbal infants. Cortex 63, 196–205 (2015)CrossRef
    3.Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)MATH
    4.Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. 39(1), 1–38 (1977)MathSciNet MATH
    5.Harnad, S.: The symbol grounding problem. Phys. D Nonlinear Phenom. 42(1), 335–346 (1990)MathSciNet CrossRef
    6.Khan, I., Saffari, A., Bischof, H.: Tvgraz: multi-modal learning of object categories by combining textual and visual features. In: AAPR Workshop, pp. 213–224 (2009)
    7.Lecun, Y., Cortes, C.: The MNIST database of handwritten digits
    8.Lowe, D.: Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)
    9.Nakamura, T., Araki, T., Nagai, T., Iwahashi, N.: Grounding of word meanings in latent dirichlet allocation-based multimodal concepts. Adv. Robot. 25(17), 2189–2206 (2011)CrossRef
    10.Needham, C.J., Santos, P.E., Magee, D.R., Devin, V., Hogg, D.C., Cohn, A.G.: Protocols from perceptual observations. Artif. Intell. 167(1), 103–136 (2005)CrossRef
    11.Nene, S.A., Nayar, S.K., Murase, H.: Columbia Object Image Library (COIL-20). Technical report, February 1996
    12.Pereira, J.C., Vasconcelos, N.: Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems. Comput. Vis. Image Underst. 124, 123–135 (2014)CrossRef
    13.Plunkett, K., Sinha, C., Møller, M.F., Strandsby, O.: Symbol grounding or the emergence of symbols? vocabulary growth in children and a connectionist net. connection Sci. 4(3–4), 293–312 (1992)CrossRef
    14.Rasiwasia, N., Costa Pereira, J., Coviello, E., Doyle, G., Lanckriet, G., Levy, R., Vasconcelos, N.: A new approach to cross-modal multimedia retrieval. In: ACM International Conference on Multimedia, pp. 251–260 (2010)
    15.Raue, F., Byeon, W., Breuel, T., Liwicki, M.: Parallel sequence classification using recurrent neural networks and alignment. In: 13th International Conference on Document Analysis and Recognition (ICDAR) (2015)
    16.Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, ICCV 2003, vol. 2, p. 1470. IEEE Computer Society, Washington, DC (2003)
    17.Spencer, P.E.: Looking without listening: is audition a prerequisite for normal development of visual attention during infancy? J. Deaf Stud. Deaf Educ. 5(4), 291–302 (2000)CrossRef
    18.Yu, C., Ballard, D.H.: A multimodal learning interface for grounding spoken language in sensory perceptions. ACM Trans. Appl. Percept. (TAP) 1(1), 57–80 (2004)CrossRef
  • 作者单位:Federico Raue (16) (17)
    Sebastian Palacio (17)
    Thomas M. Breuel (16)
    Wonmin Byeon (16) (17)
    Andreas Dengel (16) (17)
    Marcus Liwicki (16)

    16. University of Kaiserslautern, Kaiserslautern, Germany
    17. German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, Germany
  • 丛书名:Artificial Neural Networks and Machine Learning – ICANN 2016
  • ISBN:978-3-319-44781-0
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9887
文摘
The goal of our paper is to learn the association and the semantic grounding of two sensory input signals that represent the same semantic concept. The input signals can be or cannot be the same modality. This task is inspired by infants learning. We propose a novel framework that has two symbolic Multilayer Perceptron (MLP) in parallel. Furthermore, both networks learn to ground semantic concepts and the same coding scheme for all semantic concepts in both networks. In addition, the training rule follows EM-approach. In contrast, the traditional setup of association task pre-defined the coding scheme before training. We have tested our model in two cases: mono- and multi-modal. Our model achieves similar accuracy association to MLPs with pre-defined coding schemes.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700