default search action
14th ACM Multimedia 2006: Santa Barbara, CA, USA
- Klara Nahrstedt, Matthew A. Turk, Yong Rui, Wolfgang Klas, Ketan Mayer-Patel:
Proceedings of the 14th ACM International Conference on Multimedia, Santa Barbara, CA, USA, October 23-27, 2006. ACM 2006, ISBN 1-59593-447-2
Tutorials
- Jin Li:
Peer-to-peer multimedia applications. 3-6 - Pablo César, Konstantinos Chorianopoulos:
Interactive digital television and multimedia systems. 7 - Samarjit Chakraborty:
Flexible modelling and performance debugging of real-time embedded multimedia systems. 8 - Shlomo Dubnov:
Computer audition: an introduction and research survey. 9 - Eamonn J. Keogh:
Data mining and information retrieval in time series/multimedia databases. 10 - Iole Moccagatta:
Recent developments in video compression standards and their impact on embedded platforms: from scalable to multi-view video coding. 11 - Dulce B. Ponceleon, Julian A. Cerruti:
Multimedia content protection. 12 - Marcel Worring, Cees Snoek:
Semantic indexing and retrieval of video. 13
Keynote
- Kenneth Y. Goldberg:
Sensitivity analysis: unexpected outcomes in art and engineering. 15
Best papers session
- Peter Knees, Markus Schedl, Tim Pohle, Gerhard Widmer:
An innovative three-dimensional user interface for exploring music collections enriched. 17-24 - Jun-Cheng Chen, Wei-Ta Chu, Jin-Hau Kuo, Chung-Yi Weng, Ja-Ling Wu:
Tiling slideshow. 25-34 - Winston H. Hsu, Lyndon S. Kennedy, Shih-Fu Chang:
Video search reranking via information bottleneck principle. 35-44
Short papers session 1
- Huan Wang, Shuicheng Yan, Thomas S. Huang, Xiaoou Tang:
Maximum unfolded embedding: formulation, solution, and application for image clustering. 45-48 - Joseph Thomas-Kerr, Ian S. Burnett, Christian H. Ritz:
An efficient approach to generic multimedia adaptation. 49-52 - Ying Li, Youngja Park, Chitra Dorai:
Atomic topical segments detection for instructional videos. 53-56 - Hangzai Luo, Jianping Fan:
Building concept ontology for medical video annotation. 57-60 - Lee M. Seversky, Lijun Yin:
Real-time automatic 3D scene generation from natural language voice and text descriptions. 61-64 - Zhihong Zeng, Yuxiao Hu, Ming Liu, Yun Fu, Thomas S. Huang:
Training combination strategy of multi-stream fused hidden Markov model for audio-visual affect recognition. 65-68 - Andrew S. Gordon:
Fourth frame forums: interactive comics for collaborative learning. 69-72 - Na Li, Neema Moraveji, Hiroaki Kimura, Eyal Ofek:
Improving the experience of controlling avatars in camera-based games using physical input. 73-76 - Qing-Fang Zheng, Wei-Qiang Wang, Wen Gao:
Effective and efficient object-based image retrieval using visual phrases. 77-80 - Yi-Hsuan Yang, Chia Chu Liu, Homer H. Chen:
Music emotion classification: a fuzzy approach. 81-84 - Zhenguo Li, Jianzhuang Liu, Xiaoou Tang:
Shape from regularities for interactive 3D reconstruction of piecewise planar objects from single images. 85-88 - Jinhui Tang, Yan Song, Xian-Sheng Hua, Tao Mei, Xiuqing Wu:
To construct optimal training set for video annotation. 89-92 - Ho-Jae Lee, Jeho Nam:
Low complexity controllable scrambler/descrambler for H.264/AVC in compressed domain. 93-96 - Yuli Gao, Jianping Fan:
Automatic function selection for large scale salient object detection. 97-100 - Ismo Rakkolainen:
Tracking users through a projection screen. 101-104 - Liangliang Cao, Jianzhuang Liu, Xiaoou Tang:
3D object retrieval using 2D line drawing and graph based relevance reedback. 105-108 - Huan Wang, Song Liu, Liang-Tien Chia:
Does ontology help in image retrieval?: a comparison between keyword, text ontology and multi-modality ontology approaches. 109-112 - Shijian Lu, Chew Lim Tan:
Automatic document orientation detection and categorization through document vectorization. 113-116 - Mario Baldi, Juan Carlos De Martin, Enrico Masala, Andrea Vesco:
Distortion-aware video communication with pipeline forwarding. 117-120 - Jiqi Zhang, Hau-San Wong, Zhiwen Yu:
3D model metrieval based on volumetric extended gaussian image and hierarchical self organizing map. 121-124 - Mika Rautiainen, Tapio Seppänen, Timo Ojala:
On the significance of cluster-temporal browsing for generic video retrieval: a statistical analysis. 125-128 - Qifeng Liu, Cheolkon Jung, Youngsu Moon:
Text segmentation based on stroke filter. 129-132 - Timothy K. Shih, Nick C. Tang, Wei-Sung Yeh, Ta-Jen Chen, Wonjun Lee:
Video inpainting and implant via diversified temporal continuations. 133-136 - Prarthana Shrestha, Hans Weda, Mauro Barbieri, Dragan Sekulovski:
Synchronization of multiple video recordings based on still camera flashes. 137-140 - Jay Summet, Matthew Flagg, James M. Rehg, Gregory D. Abowd, Neil Weston:
GVU-PROCAMS: enabling novel projected interfaces. 141-144 - Gustavo B. Borba, Humberto R. Gamba, Oge Marques, Liam M. Mayron:
An unsupervised method for clustering images based on their salient regions of interest. 145-148 - Tsz Kin Tsui, Xiao-Ping (Steven) Zhang, Dimitrios Androutsos:
Quaternion image watermarking using the spatio-chromatic fourier coefficients analysis. 149-152 - Min Qin, Roger Zimmermann:
Supporting guaranteed continuous media streaming in mobile ad-hoc networks with link availability prediction. 153-156 - Yelizaveta Marchenko, Tat-Seng Chua, Ramesh C. Jain:
Transductive inference using multiple experts for brushwork annotation in paintings domain. 157-160 - Won-gyum Kim, Yong-seok Seo, Young-Ho Suh:
Hybrid watermarking for improving detector performance. 161-164 - Joonyoung Jung, Ohyung Kwon, Sooin Lee:
Design and implementation of a multi-stream cableCARD with a high-speed DVB-common descrambler. 165-168 - Shi-Yong Neo, Yantao Zheng, Tat-Seng Chua, Qi Tian:
News video search with fuzzy event clustering using high-level features. 169-172 - En Cheng, Feng Jing, Lei Zhang, Hai Jin:
Scalable relevance feedback using click-through data for web image retrieval. 173-176
Arts short papers poster session 1
- Steve Mann, James Fung, Raymond Chun Hing Lo:
Cyborglogging with camera phones: steps toward equiveillance. 177-180 - Steve Mann:
The andantephone: a musical instrument that you play by simply walking. 181-184 - Ismo Rakkolainen, A. Tanju Erdem, Çigdem Eroglu Erdem, Mehmet K. Özkan, Markku Laitinen:
Interactive "immaterial" screen for performing arts. 185-188 - Paolo Bottoni, Anna Labella, Stefano Faralli, Mario Pierro, Claudio Scozzafava:
Interactive composition, performance and music generation through iterative structures. 189-192 - Yu-Chuan Tseng, Chia-Hsiang Lee:
Flow: an interactive AJAX-based internet information requesting system. 193-196 - Ann Judith Morrison, Peta Mitchell, Ralf Mühlberger:
Talk2Me: the art of augmenting conversations. 197-200
Content session 1: multi-modal analysis
- Ling-Yu Duan, Jinqiao Wang, Yantao Zheng, Jesse S. Jin, Hanqing Lu, Changsheng Xu:
Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis. 201-210 - Qiang Zhu, Mei-Chen Yeh, Kwang-Ting Cheng:
Multimodal fusion using learned text concepts for image categorization. 211-220 - Changsheng Xu, Jinjun Wang, Kongwah Wan, Yiqun Li, Lingyu Duan:
Live sports event detection based on broadcast video and web-casting text. 221-230
Applications session 1: media presentation
- Berna Erol, Kathrin Berkner, Siddharth Joshi:
Multimedia thumbnails for documents. 231-240 - Feng Liu, Michael Gleicher:
Video retargeting: automating pan and scan. 241-250 - Chao Wang, Qiong Yang, Mo Chen, Xiaoou Tang, Zhongfu Ye:
Progressive cut. 251-260
Arts session 1: installations and media archaeology
- Sara Owsley, Kristian J. Hammond, David A. Shamma, Sanjay Sood:
Buzz: telling compelling stories. 261-268 - Vincenzo Lombardo, Andrea Valle, Fabrizio Nunnari, Francesco Giordana, Andrea Arghinenti:
Archeology of multimedia. 269-278 - Petra Gemeinboeck, Atau Tanaka, Andy Dong:
Instant archaeologies: digital lenses to probe and to perforate the urban fabric. 279-286
Content session 2: machine learning in multimedia
- Hui Zhang, Rouhollah Rahmani, Sharath R. Cholleti, Sally A. Goldman:
Local image representations using pruned salient points with applications to CBIR. 287-296 - Jie Yu, Qi Tian:
Learning image manifolds by semantic subspace projection. 297-306 - Xin Geng, Zhi-Hua Zhou, Yu Zhang, Gang Li, Honghua Dai:
Learning from facial aging patterns for automatic age estimation. 307-316 - Navneet Panda, Edward Y. Chang:
Efficient top-k hyperplane query processing for multimedia information retrieval. 317-326
Systems session 1: streaming
- Asfandyar Qureshi, Jennifer N. Carlisle, John V. Guttag:
Tavarua: video streaming with WWAN striping. 327-336 - Liqi Shi, Phillipa Sessini, Anirban Mahanti, Zongpeng Li, Derek L. Eager:
Scalable streaming for heterogeneous clients. 337-346 - Bashar Qudah, Nabil J. Sarhan:
Towards scalable delivery of video streams to heterogeneous receivers. 347-356 - David Gotz:
Scalable and adaptive streaming for non-linear media. 357-366
Applications session 2: searching media I
- Lei Zhang, Le Chen, Feng Jing, Kefeng Deng, Wei-Ying Ma:
EnjoyPhoto: a vertical image search engine for enjoying high-quality photos. 367-376 - Feng Jing, Changhu Wang, Yuhuan Yao, Kefeng Deng, Lei Zhang, Wei-Ying Ma:
IGroup: web image search results clustering. 377-384 - Alexander G. Hauptmann, Wei-Hao Lin, Rong Yan, Jun Yang, Ming-Yu Chen:
Extreme video retrieval: joint maximization of human and computer performance. 385-394
Applications session 3: entertainment & home environments CWI
- David J. Chatting, Josie S. Galpin, Judith S. Donath:
Presence and portrayal: video for casual home dialogues. 395-401 - Beomjoo Seo, Roger Zimmermann:
Edge indexing in a grid for highly dynamic virtual environments. 402-411 - Bin Cui, Jialie Shen, Gao Cong, Heng Tao Shen, Cui Yu:
Exploring composite acoustic features for efficient music similarity query. 412-420
Content session 3: semantic concepts
- Cees Snoek, Marcel Worring, Jan C. van Gemert, Jan-Mark Geusebroek, Arnold W. M. Smeulders:
The challenge problem for automated detection of 101 semantic concepts in multimedia. 421-430 - Guangyu Zhu, Changsheng Xu, Qingming Huang, Wen Gao, Liyuan Xing:
Player action recognition in broadcast tennis video with applications to semantic analysis of sports game. 431-440 - Jinhui Yuan, Jianmin Li, Bo Zhang:
Learning concepts from large scale imbalanced data sets using support cluster machines. 441-450
Arts session 2: interactive spaces and performance
- Andrew M. Webb, Andruid Kerne, Eunyee Koh, Pranesh Joshi, YoungJoo Park, Ross Graeber:
Choreographic buttons: promoting social interaction through human movement and clear affordances. 451-460 - Quoc Nguyen, Scott Novakowski, Jeffrey E. Boyd, Christian Jacob, Gerald Hushlak:
Motion swarms: video interaction for art in complex environments. 461-469 - Jodi James, Todd Ingalls, Gang Qian, Loren Olson, Daniel Whiteley, Siew Wong, Thanassis Rikakis:
Movement-based interactive dance performance. 470-480
Demo session 1
- Chabane Djeraba, Stanislas Lew, Dan A. Simovici, Sylvain Mongy, Nacim Ihaddadene:
Eye/gaze tracking in web, image and video documents. 481-482 - Ole-Ivar Holthe, Leif Arne Rønningen:
Geelix.com: sharing gaming experiences. 483-484 - Benjamin N. Lee, WenYen Chen, Edward Y. Chang:
Fotofiti: web service for photo management. 485-486 - Berna Erol, Kathrin Berkner, Siddharth Joshi:
Multimedia thumbnails for documents: implementation and demonstration. 487-488 - Herwig Lejsek, Friðrik Heiðar Ásmundsson, Björn Þór Jónsson, Laurent Amsaleg:
Blazingly fast image copyright enforcement. 489-490 - Chitra Dorai, Robert G. Farrell, Amy Katriel, Galina Kofman, Ying Li, Youngja Park:
MAGICAL demonstration: system for automated metadata generation for instructional content. 491-492 - Daniel Heesch, Alexei Yavlinsky, Stefan M. Rüger:
NNk networks and automated annotation for browsing large image collections from the world wide web. 493-494 - Dulce B. Ponceleon, Stefan Nusser, Vladimir Zbarsky, Julian A. Cerruti, Sigfredo I. Nin:
Enabling secure distribution of digital media to SD-cards. 495-496 - Feng Jing, Changhu Wang, Yuhuan Yao, Kefeng Deng, Lei Zhang, Wei-Ying Ma:
IGroup: a web image search engine with semantic clustering of search results. 497-498 - Utz Westermann, Srikanth Agaram, Bo Gong, Ramesh C. Jain:
Event-centric multimedia data management for reconnaissance mission analysis and reporting. 499-500 - Yinpeng Chen, Weiwei Xu, Richard Isaac Wallis, Hari Sundaram, Thanassis Rikakis, Todd Ingalls, Loren Olson, Jiping He:
A real-time, multimodal biofeedback system for stroke patient rehabilitation. 501-502 - Jun Yang, Alexander G. Hauptmann:
3WNews: who, where, and when in news video. 503-504 - Lakis Christodoulou, Liam M. Mayron, Hari Kalva, Oge Marques, Borko Furht:
3D TV using MPEG-2 and H.264 view coding and autostereoscopic displays. 505-506 - Leslie S. Liu, Roger Zimmermann, Baoxuan Xiao, Jon Christen:
PartyPeer: a P2P massively multiplayer online game. 507-508
Arts session 3: tools for creativity and art analysis
- Corey Manders, Steve Mann:
Handheld electronic camera flash lamp as a tangible user-interface for creating expressive visual art works. 509-518 - Steve Mann, Ryan E. Janzen, Mark A. Post:
Hydraulophone design considerations: absement, displacement, and velocity-sensitive music keyboard in which each key is a water jet. 519-528 - Yelizaveta Marchenko, Tat-Seng Chua, Ramesh C. Jain:
Semi-supervised annotation of brushwork in paintings domain using serial combinations of multiple experts. 529-538
Systems session 2: distributed systems
- Tara Small, Ben Liang, Baochun Li:
Scaling laws and tradeoffs in peer-to-peer live multimedia streaming. 539-548 - Gisik Kwon, K. Selçuk Candan:
DANS: decentralized, autonomous, and networkwide service delivery and multimedia workflow processing. 549-558 - Xiaohui Gu, Zhen Wen, Ching-Yung Lin, Philip S. Yu:
ViCo: an adaptive distributed video correlation system. 559-568
Applications session 4: searching media II
- Jialie Shen, John Shepherd:
Efficient benchmarking of content-based image retrieval via resampling. 569-578 - Stewart Greenhill, Svetha Venkatesh:
Virtual observers in a mobile surveillance system. 579-588 - Herwig Lejsek, Friðrik Heiðar Ásmundsson, Björn Þór Jónsson, Laurent Amsaleg:
Scalability of local image descriptors: a comparative study. 589-598
Short papers poster session 2
- Feng Jing, Lei Zhang, Wei-Ying Ma:
VirtualTour: an online travel assistant based on high quality images. 599-602 - Yuxin Peng, Chong-Wah Ngo, Cuihua Fang, Xiaoou Chen, Jianguo Xiao:
Audio similarity measure by graph modeling and matching. 603-606 - Xirong Li, Le Chen, Lei Zhang, Fuzong Lin, Wei-Ying Ma:
Image annotation by large-scale content-based image retrieval. 607-610 - Jeannie Su Ann Lee, Nikil Jayant:
Mixed-initiative multimedia for mobile devices: a voting-based user interface for news videos. 611-614 - Morgan Ames, Lilia Manguy:
PhotoArcs: Ludic tools for sharing photographs. 615-618 - Xinguo Yu, Xin Yan, Tran Thi Phuong Chi, Loong Fah Cheong:
Inserting 3D projected virtual content into broadcast tennis video. 619-622 - Xun Yuan, Xian-Sheng Hua, Meng Wang, Xiuqing Wu:
Manifold-ranking based video concept detection on large database and feature pool. 623-626 - Wei Lai, Xian-Sheng Hua, Wei-Ying Ma:
Towards content-based relevance ranking for video search. 627-630 - Bogdan Ionescu, Patrick Lambert, Didier Coquin, Laurent Ott, Vasile Buzuloiu:
Animation movies trailer computation. 631-634 - Junil Kim, Yeonjeong Jeong, Kisong Yoon, Jaecheol Ryou:
A trustworthy end-to-end key management scheme for digital rights management. 635-638 - Yun Li, Chunjing Xu, Jianzhuang Liu, Xiaoou Tang:
Detecting irregularity in videos using kernel estimation and KD trees. 639-642 - Pengpeng Ni, Damir Isovic, Gerhard Fohler:
User-friendly H.264/AVC for remote browsing. 643-646 - Changhu Wang, Feng Jing, Lei Zhang, HongJiang Zhang:
Image annotation refinement using random walk with restarts. 647-650 - Dick C. A. Bulterman, Pablo César, A. J. Jansen:
An architecture for viewer-side enrichment of TV content. 651-654 - Kihwan Kim, Irfan A. Essa, Gregory D. Abowd:
Interactive mosaic generation for video navigation. 655-658 - Denny Iskandar, Ye Wang, Min-Yen Kan, Haizhou Li:
Syllabic level automatic synchronization of music signals and text lyrics. 659-662 - Marco Bertini, Alberto Del Bimbo, Walter Nunziati:
Automatic detection of player's identity in soccer videos using faces and text cues. 663-666 - Jinwuk Seok, Jeong-Woo Lee, Chang-Sik Cho:
The differential structure of sub pixels interpolated from integer pixels using n-tab FIR filters for high definition H.264 video encoding. 667-670 - Patrick Schmitz:
Leveraging community annotations for image adaptation to small presentation formats. 671-674 - Wolfgang Hürst:
Interactive audio-visual video browsing. 675-678 - Marco Bertini, Alberto Del Bimbo, Carlo Torniai:
Automatic annotation and semantic retrieval of video sequences using multimedia ontologies. 679-682 - Junfa Liu, Yiqiang Chen, Wen Gao:
Mapping learning in eigenspace for harmonious caricature generation. 683-686 - J. J. Nixdorf, David Gerhard:
RITZ: a RealTime interactive tool for spatialization. 687-690 - Azzedine Boukerche, Richard Werner Nelem Pazzi:
Remote rendering and streaming of progressive panoramas for mobile devices. 691-694 - Jan C. van Gemert, Cees Snoek, Cor J. Veenman, Arnold W. M. Smeulders:
The influence of cross-validation on video classification performance. 695-698 - Benjamin N. Lee, WenYen Chen, Edward Y. Chang:
A scalable service for photo annotation, sharing, and search. 699-702 - Amin Shah-Hosseini, Gerald M. Knapp:
Semantic image retrieval based on probabilistic latent semantic analysis. 703-706 - Kai Song, Yonghong Tian, Wen Gao, Tiejun Huang:
Diversifying the image retrieval results. 707-710 - Brett Adams, Stewart Greenhill, Svetha Venkatesh:
Browsing personal media archives with spatial context using panoramas. 711-714 - Humera Noor, Shahid H. Mirza, Yaser Sheikh, Amit Jain, Mubarak Shah:
Model generation for video-based object recognition. 715-718 - Kanav Kahol, Narayanan Chatapuram Krishnan, Vineeth Nallure Balasubramanian, Sethuraman Panchanathan, Marshall L. Smith, John Ferrara:
Measuring movement expertise in surgical tasks. 719-722 - Zhenyu Yang, Bin Yu, Wanmin Wu, Ross Diankov, Ruzena Bajcsy:
Collaborative dancing in tele-immersive environment. 723-726 - Hendrik Knoche, John D. McCarthy, Martina Angela Sasse:
Reading the fine print: the effect of text legibility on perceived video quality in mobile tv. 727-730
Arts short poster session 2
- Karl D. D. Willis:
User authorship and creativity within interactivity. 731-735 - Nicholas A. Knouf:
Variations 10b: a digital realization of cage's variations II. 736-739 - Noriyuki Fujimura, Satoshi Fujiyoshi, Tom Hope, Takuichi Nishimura:
Tabletop community: artwork for visualization of social interactions using a bipartite network. 740-743 - Jason Lewis, Yannick Assogba:
Taking sides: dynamic text and hip-hop performance. 744-747 - Vidyarani M. Dyaberi, Hari Sundaram, Thanassis Rikakis, Jodi James:
The computational extraction of temporal formal structures in the interactive dance work '22'. 748-751
Multimedia and web 2.0 - hype, challenge, synergy
Applications session 5: multimedia applications potpourri
- Koichi Kamijo, Noboru Kamijo, Masaharu Sakamoto:
Electronic clipping system with invisible barcodes. 753-762 - Yinpeng Chen, He Huang, Weiwei Xu, Richard Isaac Wallis, Hari Sundaram, Thanassis Rikakis, Todd Ingalls, Loren Olson, Jiping He:
The design of a real-time, multimodal biofeedback system for stroke patient rehabilitation. 763-772 - Mei Han, Wei Xu, Yihong Gong:
Video object segmentation by motion-based sequential feature clustering. 773-782
Demo session 2
- Hangzai Luo, Jianping Fan, Yuli Gao, William Ribarsky, Shin'ichi Satoh:
Large-scale news video retrieval via visualization. 783-784 - Marcel Worring, Cees G. M. Snoek, Bouke Huurnink, Jan C. van Gemert, Dennis C. Koelma, Ork de Rooij:
The mediamill large.lexicon concept suggestion engine. 785-786 - Marco Bertini, Alberto Del Bimbo, Carlo Torniai, Rita Cucchiara, Costantino Grana:
MOM: multimedia ontology manager. A framework for automatic annotation and semantic retrieval of video sequences. 787-788 - Masanori Sano, Yoshihiko Kawai, Hideki Sumiyoshi, Nobuyuki Yagi:
Metadata production framework and metadata editor. 789-790 - Qiong Liu, Paul McEvoy, Cheng-Jia Lai:
Mobile camera supported document redirection. 791-792 - Costantino Grana, Roberto Vezzani, Daniele Bulgarelli, Giovanni Gualdi, Rita Cucchiara, Marco Bertini, Carlo Torniai, Alberto Del Bimbo:
PEANO: pictorial enriched annotation of video. 793-794 - Ross Graeber, Andruid Kerne, M. Kathryn Henderson:
ZooMICSS: a zoomable map image collection sensemaking system (the Katrina Rita context). 795-796 - Patrick Schmitz, Peter L. Shafton, Ryan Shaw, Samantha Tripodi, Brian Williams, Jeannie Yang:
International remix: video editing for the web. 797-798 - S. H. Srinivasan:
Speakr: auditory skimming and scrolling. 799-800 - Subhajit Sanyal, S. H. Srinivasan:
3dB: a system for geometric tagging. 801-802 - WenYen Chen, Benjamin N. Lee, Edward Y. Chang:
Fotowiki: distributed map enhancement service. 803-804 - Wladimir Palant, Carsten Griwodz, Pål Halvorsen:
GLS: simulator for online multi-player games. 805-806 - Wolfgang Hürst, Tobias Lauer, Robert Kaschuba:
Interfaces for interactive audio-visual media browsing. 807-808 - Xin Yan, Xinguo Yu, Tran Thi Phuong Chi:
A system for 3D projected virtual content insertion into broadcast tennis video. 809-810 - Yuli Gao, Hangzai Luo, Jianping Fan:
Searching and browsing large scale image database using keywords and ontology. 811-812 - Sama'a Al Hashimi, Gordon Davies:
Vocal telekinesis: physical control of inanimate objects with minimal paralinguistic voice input. 813-814
Content session 4: event and copy detection
- Yun Zhai, Mubarak Shah:
Visual attention detection in video sequences using spatiotemporal cues. 815-824 - Lie Lu, Alan Hanjalic:
Towards optimal audio "keywords" detection for audio content analysis and discovery. 825-834 - Julien Law-To, Olivier Buisson, Valérie Gouet-Brunet, Nozha Boujemaa:
Robust voting algorithm based on labels of behavior for video copy detection. 835-844 - Chong-Wah Ngo, Wanlei Zhao, Yu-Gang Jiang:
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation. 845-854
Brave new topics session 1 - human-centered multimedia
- Alejandro Jaimes, Nicu Sebe, Daniel Gatica-Perez:
Human-centered computing: a multimedia perspective. 855-864 - Alex Pentland, Jonathan Gips, Wen Dong, Will Stoltzman:
Human computing for interactive digital media. 865-870 - Sharon L. Oviatt:
Human-centered design meets cognitive load theory: designing interfaces that help people think. 871-880
Doctoral symposium session
- Zhenyu Yang:
A multi-stream adaptation framework for tele-immersion. 881-883 - Hangzai Luo, Jianping Fan:
Large-scale video retrieval via semantic classification. 884-886 - Pavel Korshunov:
Rate-accuracy tradeoff in automated, distributed video surveillance systems. 887-889
Keynote
- Bradley Horowitz:
Implicit participation. 890
Content session 5: image annotation
- Wen Wu, Jie Yang:
SmartLabel: an object labeling tool using iterated harmonic energy minimization. 891-900 - Yuli Gao, Jianping Fan, Xiangyang Xue, Ramesh C. Jain:
Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers. 901-910 - Jia Li, James Ze Wang:
Real-time computerized annotation of pictures. 911-920
Systems session 3: assorted topics
- Min Xu, Jiaming Li, Liang-Tien Chia, Yiqun Hu, Bu-Sung Lee, Deepu Rajan, Jesse S. Jin:
Event on demand with MPEG-21 video adaptation system. 921-930 - Gerardo Fernández, Pedro Cuenca, Luis Orozco-Barbosa, Hari Kalva:
Very low complexity MPEG-2 to H.264 transcoding using machine learning. 931-940 - Alexander Eichhorn:
Modelling dependency in multimedia streams. 941-950
Open source and video program session
- Xavier Amatriain, Pau Arumí, David García:
CLAM: a framework for efficient and rapid development of cross-platform audio applications. 951-954 - Jun-Cheng Chen, Wei-Ta Chu, Jin-Hau Kuo, Chung-Yi Weng, Ja-Ling Wu:
Audiovisual slideshow: present your journey by photos. 955-956 - Stephan Kopf, Fleming Lampi, Thomas King, Wolfgang Effelsberg:
Automatic scaling and cropping of videos for devices with limited screen resolution. 957-958 - Rick Companje, Nico M. van Dijk, Hanco Hogenbirk, Danica Mast:
Globe4D: time-traveling with an interactive four-dimensional globe. 959-960 - Rachel Heck, Michael N. Wallick, Michael Gleicher:
Virtual videography. 961-962 - Stephan Kopf, Thomas King, Fleming Lampi, Wolfgang Effelsberg:
Video color adaptation for mobile devices. 963-964 - Timothy K. Shih, Nick C. Tang, Wei-Sung Yeh, Ta-Jen Chen:
Video inpainting and implant via diversified temporal continuations (video demonstration). 965-966
Content session 6: multimedia exploration
- Meng Wang, Yan Song, Xun Yuan, HongJiang Zhang, Xian-Sheng Hua, Shipeng Li:
Automatic video annotation by semi-supervised learning with kernel density estimation. 967-976 - Ritendra Datta, Weina Ge, Jia Li, James Ze Wang:
Toward bridging the annotation-retrieval gap in image search by a generative modeling approach. 977-986 - Brett Adams, Dinh Q. Phung, Svetha Venkatesh:
Extraction of social context and application to personal multimedia exploration. 987-996
Brave new topics session 2 - multimedia signal processing and systems in healthcare and life science
- Shahram Ebadollahi, Anni Coden, Michael A. Tanenblatt, Shih-Fu Chang, Tanveer Fathima Syeda-Mahmood, Arnon Amir:
Concept-based electronic health records: opportunities and challenges. 997-1006 - Peter Andrews, Haibin Wang, Dan Valente, Jihène Serkhane, Partha P. Mitra, Sigal Saar, Ofer Tchernichovski, Ilan Golani:
Multimedia signal processing for behavioral quantification in neuroscience. 1007-1016 - Nevenka Dimitrova, Yee Him Cheung, Michael Q. Zhang:
Analysis and visualization of DNA spectrograms: open possibilities for the genome research. 1017-1024
Interactive arts program exhibition session
- Márton Fernezelyi, Zoltán Szegedy Maszák, Róbert Langh:
Smalltalk: interactive installation. 1025-1026 - Luc Courchesne, Guillaume Langlois, Luc Martinez:
Where are you?: an immersive experience in the panoscope 360degree. 1027-1028 - Jed Berk, Nikhil Mitter:
Autonomous light air vessels (ALAVs). 1029-1030 - Eunsu Kang:
Imago. 1031-1032 - Mark David Hosale, John Thompson:
DEFENDEX-ESPGX. 1033-1034 - Noriyuki Fujimura, Satoshi Fujiyoshi, Tom Hope, Takuichi Nishimura:
Tabletop community: visualization of real world oriented social network. 1035-1036 - Annie On Ni Wan, Hiroki Nishino, Pamela Pietro:
Tre marie. 1037-1038 - Jee Hyun Oh:
GORI.node garden. 1039-1040 - Sardón Mariano:
Books of sand. 1041-1042 - Jean-Marie Dallet, Christian Laroche, Frédéric Curien:
SLIDERS: a collective experience of interactive cinema. 1043-1044 - Takashi Kawashima, Togo Kida, Yoshimasa Niwa:
Takashi's seasons. 1045-1046 - Eitan Mendelowitz:
Drafting poems: inverted potentialities. 1047-1048
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.