Indexation et recherche de contenus par objet visuel

Abstract : With the ever increasing amount of available video content on video repositories the issue of content-based video objects retrieval is growing in difficulty and becomes a mandatory feature for video search engines.The present thesis advances a user defined video object retrieval framework and brings two major contributions. The first contribution is a methodological framework for user selected video object instances retrieval, entitled DOOR (Dynamic Object Oriented Retrieval), while the second one concerns the support offered for video retrieval, namely the video navigation and retrieval system and interface and its underlying architecture.Under the DOOR framework, the user defined video object comports a hybrid representation obtained by over-segmenting the frames, constructing region adjacency graphs and aggregating interest points. The identification of object instances across multiple videos is formulated as an energy optimization problem approximating an NP-hard problem. Object candidates are sub-graphs that yield an optimum energy towards the user defined query. In order to obtain the optimum energy four optimization strategies are proposed: Greedy, Relaxed Greedy, Simulated Annealing and GraphCut. The region-based object representation is further improved by the aggregation of interest points into a hybrid object representation. The similarity between an object and a frame is achieved with the help of a spectral matching technique integrating both colorimetric and interest points descriptors.The DOOR framework is suitable to large scale video archives through the use of a Bag-of-Words representation enriched with a query definition and expansion mechanism based on a multi-modal, text-image-video principle.The performances of the proposed techniques are evaluated on multiple TRECVID video datasets prooving their effectiveness.The second contribution is related to the user support for video retrieval - video navigation, video retrieval, graphical interface - and consists in the OVIDIUS (On-line VIDeo Indexing Universal System) on-line video browsing and retrieval platform. The OVIDIUS platform features hierarchical video navigation functionalities that exploit the MPEG-7 approach for structural description of video content. The DOOR framework is integrated in the OVIDIUS platform, ensuring the search functionalities of the system. The major advantage of the proposed system concerns its modular architecture which makes it possible to deploy the system on various terminals (both fixed and mobile), independently of the exploitation systems involved. The choice of the technologies employed for each composing module of the platform is argumented in comparison with other technological options. Finally different scenarios and use cases for the OVIDIUS platform are presented.
Document type :
Theses
Complete list of metadatas

Cited literature [222 references]  Display  Hide  Download

https://pastel.archives-ouvertes.fr/pastel-00873966
Contributor : Abes Star <>
Submitted on : Wednesday, October 16, 2013 - 6:03:58 PM
Last modification on : Monday, November 12, 2018 - 10:54:46 AM
Long-term archiving on : Friday, January 17, 2014 - 4:42:44 AM

File

2012ENMP0096.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : pastel-00873966, version 1

Citation

Andrei Bursuc. Indexation et recherche de contenus par objet visuel. Autre [cs.OH]. Ecole Nationale Supérieure des Mines de Paris, 2012. Français. ⟨NNT : 2012ENMP0096⟩. ⟨pastel-00873966⟩

Share

Metrics

Record views

929

Files downloads

1402