TY - GEN
T1 - Semantic saliency driven camera control for personal remote collaboration
AU - Zhang, Cha
AU - Liu, Zicheng
AU - Zhang, Zhengyou
AU - Zhao, Qi
PY - 2008
Y1 - 2008
N2 - This paper presents a camera combo system for personal remote collaboration applications. The system consists of two different cameras. One camera has a wide field of view, and the other can pan/tilt/zoom (PTZ) based on analysis of the images captured by the wide angle camera. Unlike traditional approaches which usually drive the PTZ camera to follow the person or his/her head, our system is capable of capturing general objects of interest in remote collaboration. For instance, when the user raises something trying to show it to the remote person, our system will automatically position the PTZ camera to zoom in at the object. At the core of our system is a semantic saliency map that overcomes many limitations of low-level saliency maps computed from preliminary image features. We demonstrate how such a semantic saliency map can be computed through contextual analysis, sign analysis and transitional analysis, and how it can be used for PTZ camera control with a novel information loss optimization based virtual director. The effectiveness of the proposed method is demonstrated with real-world sequences.
AB - This paper presents a camera combo system for personal remote collaboration applications. The system consists of two different cameras. One camera has a wide field of view, and the other can pan/tilt/zoom (PTZ) based on analysis of the images captured by the wide angle camera. Unlike traditional approaches which usually drive the PTZ camera to follow the person or his/her head, our system is capable of capturing general objects of interest in remote collaboration. For instance, when the user raises something trying to show it to the remote person, our system will automatically position the PTZ camera to zoom in at the object. At the core of our system is a semantic saliency map that overcomes many limitations of low-level saliency maps computed from preliminary image features. We demonstrate how such a semantic saliency map can be computed through contextual analysis, sign analysis and transitional analysis, and how it can be used for PTZ camera control with a novel information loss optimization based virtual director. The effectiveness of the proposed method is demonstrated with real-world sequences.
UR - http://www.scopus.com/inward/record.url?scp=58049133434&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=58049133434&partnerID=8YFLogxK
U2 - 10.1109/MMSP.2008.4665044
DO - 10.1109/MMSP.2008.4665044
M3 - Conference contribution
AN - SCOPUS:58049133434
SN - 9781424422951
T3 - Proceedings of the 2008 IEEE 10th Workshop on Multimedia Signal Processing, MMSP 2008
SP - 28
EP - 33
BT - Proceedings of the 2008 IEEE 10th Workshop on Multimedia Signal Processing, MMSP 2008
T2 - 2008 IEEE 10th Workshop on Multimedia Signal Processing, MMSP 2008
Y2 - 8 October 2008 through 10 October 2008
ER -