TY - JOUR
T1 - JumpCut
T2 - Non-successive mask transfer and interpolation for video cutout
AU - Fan, Qingnan
AU - Zhong, Fan
AU - Lischinski, Dani
AU - Cohen-Or, Daniel
AU - Chen, Baoquan
N1 - Publisher Copyright:
© Copyright 2015 ACM.
PY - 2015/11
Y1 - 2015/11
N2 - We introduce JumpCut, a new mask transfer and interpolation method for interactive video cutout. Given a source frame for which a foreground mask is already available, we compute an estimate of the foreground mask at another, typically non-successive, target frame. Observing that the background and foreground regions typically exhibit different motions, we leverage these differences by computing two separate nearest-neighbor fields (split-NNF) from the target to the source frame. These NNFs are then used to jointly predict a coherent labeling of the pixels in the target frame. The same split-NNF is also used to aid a novel edge classifier in detecting silhouette edges (S-edges) that separate the foreground from the background. A modified level set method is then applied to produce a clean mask, based on the pixel labels and the S-edges computed by the previous two steps. The resulting mask transfer method may also be used for coherently interpolating the foreground masks between two distant source frames. Our results demonstrate that the proposed method is significantly more accurate than the existing state-of-the-art on a wide variety of video sequences. Thus, it reduces the required amount of user effort, and provides a basis for an effective interactive video object cutout tool.
AB - We introduce JumpCut, a new mask transfer and interpolation method for interactive video cutout. Given a source frame for which a foreground mask is already available, we compute an estimate of the foreground mask at another, typically non-successive, target frame. Observing that the background and foreground regions typically exhibit different motions, we leverage these differences by computing two separate nearest-neighbor fields (split-NNF) from the target to the source frame. These NNFs are then used to jointly predict a coherent labeling of the pixels in the target frame. The same split-NNF is also used to aid a novel edge classifier in detecting silhouette edges (S-edges) that separate the foreground from the background. A modified level set method is then applied to produce a clean mask, based on the pixel labels and the S-edges computed by the previous two steps. The resulting mask transfer method may also be used for coherently interpolating the foreground masks between two distant source frames. Our results demonstrate that the proposed method is significantly more accurate than the existing state-of-the-art on a wide variety of video sequences. Thus, it reduces the required amount of user effort, and provides a basis for an effective interactive video object cutout tool.
KW - Foreground extraction
KW - Object cutout
KW - Video segmentation
UR - http://www.scopus.com/inward/record.url?scp=84996533970&partnerID=8YFLogxK
U2 - 10.1145/2816795.2818105
DO - 10.1145/2816795.2818105
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:84996533970
SN - 0730-0301
VL - 34
JO - ACM Transactions on Graphics
JF - ACM Transactions on Graphics
IS - 6
M1 - 195
ER -