TY - GEN
T1 - Parsing and browsing tools for colonoscopy videos
AU - Cao, Yu
AU - Li, Dalei
AU - Tavanapong, Wallapak
AU - Oh, Junghwan
AU - Wong, Johnny
AU - De Groen, Piet C.
PY - 2004
Y1 - 2004
N2 - Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.
AB - Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.
KW - Browsing
KW - Image and audio analysis
KW - Video segmentation
UR - http://www.scopus.com/inward/record.url?scp=13444310520&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=13444310520&partnerID=8YFLogxK
U2 - 10.1145/1027527.1027723
DO - 10.1145/1027527.1027723
M3 - Conference contribution
AN - SCOPUS:13444310520
SN - 1581138938
SN - 9781581138931
T3 - ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
SP - 844
EP - 851
BT - ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
PB - Association for Computing Machinery (ACM)
T2 - ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
Y2 - 10 October 2004 through 16 October 2004
ER -