Exploring CNN-Based Architectures for Multimodal Salient Event Detection in Videos | IEEE Conference Publication | IEEE Xplore