We develop a novel spatio-temporal CNN architecture for feature learning from video frames.
CRF is coupled with CNN to achieve feature learning and structured prediction simultaneously.
We explore different combinations of feature functions for sequence labeling.
We validate our framework on segmented and unsegmented action datasets, respectively.
© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号 地址:北京市海淀区学院路29号 邮编:100083 电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700 |