© acts-momusys 1999. all rights reserved. vogue the video object generator with user environment...

15
© ACTS-MoMuSys 1999. All Rights Reserved. M oM uSys VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France Instituto Superior Técnico, Portugal Universitat Politècnica de Catalunya , Spain University of Hannover, Germany

Upload: quentin-chandler

Post on 18-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

VOGUE

The Video Object Generator with User Environment

Ecole Nationale Supérieure des Mines de Paris, France

Instituto Superior Técnico, Portugal

Universitat Politècnica de Catalunya , Spain

University of Hannover, Germany

Page 2: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Objective

To provide an interactive authoring tool to create video objects suitable for MPEG-4 encoding.

Characteristics: Ease of use No expert knowledge is needed High performance

Page 3: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Algorithms included

Spatial segmentation segmentation of still images

Temporal segmentation detection of moving

objects

Tracking following the object through the video sequence

The three algorithms are efficiently combined to provide the user with a performant tool

Page 4: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Spatial

segmentation

Temporalsegmentation

Initial mask

Fine segmentationInitial mask

Initial mask

Tracking

Correction of the resultsC

orre

ctio

n of

the

resu

lts

Page 5: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Graphical User Interface (GUI)

GUI provides interface to automatic segmentation + user interaction

GUI supports: interaction between the different algorithms general as well as algorithm specific user

interaction User interaction should be

minimal but effective

Page 6: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

The Graphical User Interface

Page 7: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

The spatial segmentation

It is based on a family of nested partitions of the input image.

A certain level of the family is reached through the fusion of two or more regions of a finer level.

The finest level of the family corresponds to the fine mosaic resulting from flooding a gradient image from all minima. The coarsest level corresponds to the partition that contains the whole image.

Page 8: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

The spatial segmentation

Original imageObject

Partition family

Region selection

Page 9: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

The spatial segmentation The whole family of partitions is created during a

single morphological flooding of the input gradient image very fast calculation.

The hierarchy of fusions is stored in the form of a tree (typically less than 1000 nodes for QCIF images) and most of the calculations issued from the interaction with the user are carried out on this tree the results of the interaction are perceived as immediate by the user.

Page 10: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Object Tracking: Approach

Projection of a homogeneous color partition.

Page 11: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Object Tracking: Partition Projection

Estimate the motion between the previous and current frames: Link motion vectors.

Motion compensate the previous texture partition: Backward block matching.

Fitting of the projected partition into a fine partition of the current image: Two steps.

Page 12: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Object Tracking: Partition Projection

In the fitting process, memory of the original label of the objects is kept (e.g.: the hat).

Small uncertainty areas usually remain.

First step: Geometry Second step: Geo. and color

Page 13: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Temporal Segmentation

Page 14: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Conclusions A coherent approach has been chosen that

combines morphological multiscale representations with motion analysis.

This allows for processing times which are compatible with user interaction on an ordinary PC.

The integration into a common user environment has lead us to analyze the degree of freedom to be left to the user.

Page 15: © ACTS-MoMuSys 1999. All Rights Reserved. VOGUE The Video Object Generator with User Environment Ecole Nationale Supérieure des Mines de Paris, France

© ACTS-MoMuSys 1999. All Rights Reserved.

MoMuSys

Future

VOGUE offers a solid ground to be developed in two directions: As part of a professional MPEG-4 authoring

tool As an editing tool targeting the emerging mass

market of numerical photography and video