default search action
ICMR 2016: New York, NY, USA
- John R. Kender, John R. Smith, Jiebo Luo, Susanne Boll, Winston H. Hsu:
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, ICMR 2016, New York, New York, USA, June 6-9, 2016. ACM 2016, ISBN 978-1-4503-4359-6
Tutorials
- Vivek K. Singh, Siripen Pongpaichet, Ramesh C. Jain:
Situation Recognition from Multimodal Data. 1-2 - Ranran Feng, Balakrishnan Prabhakaran:
On the "Face of Things". 3-4
Keynote
- Shih-Fu Chang:
New Frontiers of Large Scale Multimedia Information Retrieval. 5
Oral: Deep Learning and Applications
- Xi Wang, Zhenfeng Sun, Wenqiang Zhang, Yu Zhou, Yu-Gang Jiang:
Matching User Photos to Online Products with Robust Deep Features. 7-14 - Baohan Xu, Yanwei Fu, Yu-Gang Jiang, Boyang Li, Leonid Sigal:
Video Emotion Recognition with Transferred Deep Feature Encodings. 15-22 - Lorenzo Baraldi, Costantino Grana, Rita Cucchiara:
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features. 23-29 - Jiyang Gao, Chen Sun, Ram Nevatia:
ACD: Action Concept Discovery from Image-Sentence Corpora. 31-38
Oral: Image and Video Content Analysis
- Wenjing Ma, Liangliang Cao, Lei Yu, Guoping Long, Yucheng Li:
GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring. 39-46 - Gloria Zen, Paloma de Juan, Yale Song, Alejandro Jaimes:
Mouse Activity as an Indicator of Interestingness in Video. 47-54 - Prithwi Raj Chakraborty, Dian Tjondronegoro, Ligang Zhang, Vinod Chandran:
Automatic Identification of Sports Video Highlights using Viewer Interest Features. 55-62 - Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot:
Diverse Concept-Level Features for Multi-Object Classification. 63-70
Oral: Brave New Ideas
- Eleftherios Spyromitros Xioufis, Symeon Papadopoulos, Adrian Popescu, Yiannis Kompatsiaris:
Personalized Privacy-aware Image Classification. 71-78 - Xingjie Wei, Jussi Palomäki, Jeff Yan, Peter Robinson:
The Science and Detection of Tilting. 79-86 - Siripen Pongpaichet, Mengfan Tang, Laleh Jalali, Ramesh C. Jain:
Using Photos as Micro-Reports of Events. 87-94 - Peter Knees, Kristina Andersen:
Searching for Audio by Sketching Mental Images of Sound: A Brave New Idea for Audio Retrieval in Creative Music Production. 95-102
Oral: Multimedia Datasets and Applications
- Markus Schedl:
The LFM-1b Dataset for Music Retrieval and Recommendation. 103-110 - Hengliang Zhu, Bin Sheng, Xiao Lin, Yangyang Hao, Lizhuang Ma:
Foreground Object Sensing for Saliency Detection. 111-118 - Youssef Tamaazousti, Hervé Le Borgne, Adrian Popescu:
Constrained Local Enhancement of Semantic Features by Content-Based Sparsity. 119-126 - Yi-Jie Lu, Hao Zhang, Maaike de Boer, Chong-Wah Ngo:
Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts. 127-134
Oral: Best Paper Candidates
- Shilun Lin, Zhicheng Zhao, Fei Su:
Homemade TS-Net for Automatic Face Recognition. 135-142 - Svetlana Kordumova, Thomas Mensink, Cees G. M. Snoek:
Pooling Objects for Recognizing Scenes without Examples. 143-150 - Nikolaos Pappas, Miriam Redi, Mercan Topkara, Brendan Jou, Hongyi Liu, Tao Chen, Shih-Fu Chang:
Multilingual Visual Sentiment Concept Matching. 151-158 - Qing Li, Zhaofan Qiu, Ting Yao, Tao Mei, Yong Rui, Jiebo Luo:
Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation. 159-166
Special: Learning with Semantic Information for Large Scale Multimedia Understanding
- Junchi Yan, Xu-Cheng Yin, Weiyao Lin, Cheng Deng, Hongyuan Zha, Xiaokang Yang:
A Short Survey of Recent Advances in Graph Matching. 167-174 - Pascal Mettes, Dennis C. Koelma, Cees G. M. Snoek:
The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection. 175-182 - Yiyang Yao, Yingjie Xia, Zhenyu Shan, Zhengguang Liu:
Learning for Traffic State Estimation on Large Scale of Incomplete Data. 183-187
Oral: Image and Video Search
- Vidyadhar Rao, Prateek Jain, C. V. Jawahar:
Diverse Yet Efficient Retrieval using Locality Sensitive Hashing. 189-196 - Yue Cao, Mingsheng Long, Jianmin Wang, Han Zhu:
Correlation Autoencoder Hashing for Supervised Cross-Modal Search. 197-204 - Mingmin Zhen, Wenmin Wang, Ronggang Wang:
Regional Subspace Projection Coding for Image Retrieval. 205-212 - Ahmet Iscen, Laurent Amsaleg, Teddy Furon:
Scaling Group Testing Similarity Search. 213-220
Posters
- Edward Kim, Shruthika Vangala:
Vinereactor: Crowdsourced Spontaneous Facial Expression Data. 221-224 - Yuchi Huang, Saad M. Khan:
Mirroring Facial Expressions: Evidence from Visual Analysis of Dyadic Interactions. 225-228 - Jianfei Xue, Koji Eguchi:
Sequential Correspondence Hierarchical Dirichlet Processes for Video Data Analysis. 229-233 - Zi-Yi Ke, Mei-Chen Yeh:
A Computational Approach to Finding Facial Patterns of a Babyface. 235-238 - Qin Jin, Junwei Liang:
Video Description Generation using Audio and Visual Cues. 239-242 - Sreyasi Nag Chowdhury, Mateusz Malinowski, Andreas Bulling, Mario Fritz:
Xplore-M-Ego: Contextual Media Retrieval Using Natural Language Queries. 243-247 - Dongjing Wang, ShuiGuang Deng, Xin Zhang, Guandong Xu:
Learning Music Embedding with Metadata for Context Aware Recommendation. 249-253 - Yuancheng Ye, Xuejian Rong, Xiaodong Yang, Yingli Tian:
Region Trajectories for Video Semantic Concept Detection. 255-259 - Chidansh Amitkumar Bhatt, Andrei Popescu-Belis, Matthew Cooper:
Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph. 261-264 - Yun Wang, Florian Metze:
Recurrent Support Vector Machines for Audio-Based Multimedia Event Detection. 265-269 - Xirong Li, Weiyu Lan, Jianfeng Dong, Hailong Liu:
Adding Chinese Captions to Images. 271-275 - Tanfang Chen, Shangfei Wang, Zhen Gao, Chongliang Wu:
Emotion Recognition from EEG Signals Enhanced by User's Profile. 277-280 - Shiqing Zhang, Shiliang Zhang, Tiejun Huang, Wen Gao:
Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition. 281-284 - Shichao Zhao, Youjiang Xu, Yahong Han:
Large-Scale E-Commerce Image Retrieval with Top-Weighted Convolutional Neural Networks. 285-288 - Giulia Fontanini, Marco Bertini, Alberto Del Bimbo:
Web Video Popularity Prediction using Sentiment and Content Visual Features. 289-292 - Takahiko Furuya, Ryutarou Ohbuchi:
Accurate Aggregation of Local Features by using K-sparse Autoencoder for 3D Model Retrieval. 293-297 - Venkatesh N. Murthy, Avinash Sharma, Visesh Chari, R. Manmatha:
Image Annotation using Multi-scale Hypergraph Heat Diffusion Framework. 299-303 - Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen:
Discriminant Cross-modal Hashing. 305-308 - Shin Matsuo, Keiji Yanai:
CNN-based Style Vector for Style Image Retrieval. 309-312 - Kuan-Hsien Liu, Ting-Yen Chen, Chu-Song Chen:
MVC: A Dataset for View-Invariant Clothing Retrieval and Attribute Prediction. 313-316 - Rishabh Gupta, Mojtaba Khomami Abadi, Jesús Alejandro Cárdenes Cabré, Fabio Morreale, Tiago H. Falk, Nicu Sebe:
A Quality Adaptive Multimodal Affect Recognition System for User-Centric Multimedia Indexing. 317-320 - Daniel Carlos Guimarães Pedronette, Ricardo da Silva Torres:
Rank Diffusion for Context-Based Image Retrieval. 321-325 - Eva Mohedano, Kevin McGuinness, Noel E. O'Connor, Amaia Salvador, Ferran Marqués, Xavier Giró-i-Nieto:
Bags of Local Convolutional Features for Scalable Instance Search. 327-331 - Jan Zahálka, Stevan Rudinac, Björn Þór Jónsson, Dennis C. Koelma, Marcel Worring:
Interactive Multimodal Learning on 100 Million Images. 333-337 - Rao Muhammad Anwer, Fahad Shahbaz Khan, Joost van de Weijer, Jorma Laaksonen:
Combining Holistic and Part-based Deep Representations for Computational Painting Categorization. 339-342 - Vedran Vukotic, Christian Raymond, Guillaume Gravier:
Bidirectional Joint Representation Learning with Symmetrical Deep Neural Networks for Multimodal and Crossmodal Applications. 343-346 - Björn Þór Jónsson, Laurent Amsaleg, Herwig Lejsek:
SSD Technology Enables Dynamic Maintenance of Persistent High-Dimensional Indexes. 347-350 - Andrea Ferracani, Daniele Pezzatini, Marco Bertini, Alberto Del Bimbo:
Item-Based Video Recommendation: An Hybrid Approach considering Human Factors. 351-354 - Yuxiang Ye, Yijuan Lu, Hao Jiang:
Human's Scene Sketch Understanding. 355-358 - Ilias Gialampoukidis, Anastasia Moumtzidou, Theodora Tsikrika, Stefanos Vrochidis, Ioannis Kompatsiaris:
Retrieval of Multimedia Objects by Fusing Multiple Modalities. 359-362 - Liangliang Cao, Jenhao Hsiao, Paloma de Juan, Yuncheng Li, Bart Thomee:
Incremental Learning for Fine-Grained Image Recognition. 363-366 - Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez:
Spatially Localized Visual Dictionary Learning. 367-370 - Sravanthi Bondugula, Larry S. Davis:
Semantic Binary Codes. 371-375 - Matthias Springstein, Ralph Ewerth:
On the Effects of Spam Filtering and Incremental Learning for Web-Supervised Visual Concept Classification. 377-380 - Eric Müller, Christian Otto, Ralph Ewerth:
Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels. 381-384 - Philipp Blandfort, Tushar Karayil, Damian Borth, Andreas Dengel:
Introducing Concept And Syntax Transition Networks for Image Captioning. 385-388
Demos
- Brendan Jou, Margaret Yuying Qian, Shih-Fu Chang:
SentiCart: Cartography and Geo-contextualization for Multilingual Visual Sentiment. 389-392 - Marko Tkalcic, Markus Schedl, Cynthia C. S. Liem, Mark S. Melenhorst:
Personalized Retrieval and Browsing of Classical Music and Supporting Multimedia Material. 393-396 - Sebastiano Battiato, Giovanni Maria Farinella, Filippo Luigi Maria Milotta, Alessandro Ortis, Luca Addesso, Antonino Casella, Valeria D'Amico, Giovanni Torrisi:
The Social Picture. 397-400 - Emily Song, Joseph G. Ellis, Hongzhi Li, Shih-Fu Chang:
Watching What and How Politicians Discuss Various Topics: A Large-Scale Video Analytics UI. 401-404 - Zhiwei Fang, Jing Liu, Yuhang Wang, Yong Li, Hang Song, Jinhui Tang, Hanqing Lu:
Object-aware Deep Network for Commodity Image Retrieval. 405-408 - Baptist Vandersmissen, Lucas Sterckx, Thomas Demeester, Azarakhsh Jalalvand, Wesley De Neve, Rik Van de Walle:
An Automated End-To-End Pipeline for Fine-Grained Video Annotation using Deep Neural Networks. 409-412 - Shujun Yang, Lei Pang, Chong-Wah Ngo, Benoit Huet:
Serendipity-driven Celebrity Video Hyperlinking. 413-416 - Hongyi Liu, Brendan Jou, Tao Chen, Mercan Topkara, Nikolaos Pappas, Miriam Redi, Shih-Fu Chang:
Complura: Exploring and Leveraging a Large-scale Multilingual Visual Sentiment Ontology. 417-420 - Manos Schinas, Symeon Papadopoulos, Georgios Petkos, Yiannis Kompatsiaris, Pericles A. Mitkas:
Multimodal Event Detection and Summarization in Large Scale Image Collections. 421-422
Oral: Student Symposium
- Rajiv Ratn Shah:
Multimodal Analysis of User-Generated Content in Support of Social Media Applications. 423-426 - Hongzhi Li:
Multimodal Visual Pattern Mining with Convolutional Neural Networks. 427-430 - Yue Wu:
Facial Landmark Detection and Tracking for Facial Behavior Analysis. 431-434
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.