Q. Li, U. Niaz, and B. Merialdo, An improved algorithm on Viola-Jones object detector, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI), 2012.
DOI : 10.1109/CBMI.2012.6269796

U. Niaz and B. Merialdo, Entropy Based Supervised Merging for Visual Categorization
DOI : 10.1007/978-3-642-33140-4_37

U. Niaz and B. Merialdo, Fusion methods for multimodal indexing of web data, WIAMIS 2013, 2013.

U. Niaz and B. Merialdo, Exploring intra-BOW statistics for improving visual categorization, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), 2013.
DOI : 10.1109/WIAMIS.2013.6616130

U. Niaz and B. Merialdo, Improving video concept detection using uploader model, 2013 IEEE International Conference on Multimedia and Expo (ICME), 2013.
DOI : 10.1109/ICME.2013.6607538

U. Niaz and B. Merialdo, Leveraging from group classification for video concept detection, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI), 2013.
DOI : 10.1109/CBMI.2013.6576577

U. Niaz and B. Merialdo, Selective Multi-cotraining for Video Concept Detection, Proceedings of International Conference on Multimedia Retrieval, ICMR '14, 2014.
DOI : 10.1145/2578726.2578789

B. Merialdo and U. , Niaz Uploader models for video concept detection, CBMI 2014, 2014.
DOI : 10.1109/cbmi.2014.6849847

U. Niaz and B. Merialdo, Improving video concept detection through label space partitioning, 2014 IEEE International Conference on Multimedia and Expo (ICME), 2014.
DOI : 10.1109/ICME.2014.6890258

B. Figure, 5: (a) Un classificateur discriminatif distinguant les exemples positifs et négatifs

Y. G. Jiang and C. W. Ngo, Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval, Computer Vision and Image Understanding, vol.113, issue.3, pp.405-414, 2009.
DOI : 10.1016/j.cviu.2008.10.002

C. Smith, 350m new photos each day, 2013.

A. Hanjalic, R. L. Lagendijk, and J. Biemond, A new method for key frame based video content representation. Image Databases and Multi-Media Search, 1998.
DOI : 10.1142/9789812797988_0009

M. S. Lew, N. Sebe, C. Djeraba, and R. Jain, Content-based multimedia information retrieval, ACM Transactions on Multimedia Computing, Communications, and Applications, vol.2, issue.1
DOI : 10.1145/1126004.1126005

L. Zhao, W. Qi, S. Li, S. Q. Yang, and H. J. Zhang, Key-frame extraction and shot retrieval using nearest feature line (NFL), Proceedings of the 2000 ACM workshops on Multimedia , MULTIMEDIA '00, pp.217-220, 2000.
DOI : 10.1145/357744.357942

Y. L. Boureau, F. Bach, Y. Lecun, and J. Ponce, Learning mid-level features for recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2559-2566, 2010.
DOI : 10.1109/CVPR.2010.5539963

G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV, 2004.

Y. G. Jiang, C. W. Ngo, and J. Yang, Towards optimal bag-of-features for object categorization and semantic video retrieval, Proceedings of the 6th ACM international conference on Image and video retrieval, CIVR '07, pp.494-501, 2007.
DOI : 10.1145/1282280.1282352

J. Philbin, M. Isard, J. Sivic, and A. Zisserman, Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587635

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.71.7388

H. Jégou, F. Perronnin, M. Douze, J. S&#x00e1-;-nchez, P. Perez et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9
DOI : 10.1109/TPAMI.2011.235

A. Ng and M. Jordan, On Discriminative vs Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes, Advances in Neural Information Processing Systems (NIPS), 2001.

A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.12, pp.1349-1380, 2000.
DOI : 10.1109/34.895972

A. Hanjalic and L. Q. Xu, Affective video content representation and modeling, IEEE Transactions on Multimedia, vol.7, issue.1, pp.143-154, 2005.
DOI : 10.1109/TMM.2004.840618

C. Snoek and M. Worring, Concept-based video retrieval. Found. Trends Inf, Retr, vol.2, issue.4, pp.215-322, 2009.
DOI : 10.1561/1500000014

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.156.5031

L. Zhang and Y. Rui, Image search—from thousands to billions in 20 years

A. F. Smeaton, P. Over, and W. Kraaij, Evaluation campaigns and TRECVid, Proceedings of the 8th ACM international workshop on Multimedia information retrieval , MIR '06, pp.321-330, 2006.
DOI : 10.1145/1178677.1178722

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.329.3415

M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang et al., Query by image and video content: The qbic system, Computer, issue.9, pp.2823-2855, 1995.

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

A. Joly, C. Frélicot, and O. Buisson, Robust Content-Based Video Copy Identification in a Large Reference Database, CIVR, pp.414-424, 2003.
DOI : 10.1007/3-540-45113-7_41

T. Quack, U. Mönich, L. Thiele, and B. S. Manjunath, Cortina, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, pp.508-511, 2004.
DOI : 10.1145/1027527.1027650

A. Torralba, R. Fergus, and Y. Weiss, Small codes and large image databases for recognition, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587633

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.222.6991

M. Douze, H. Jégou, H. Sandhawalia, L. Amsaleg, and C. Schmid, Evaluation of GIST descriptors for web-scale image search, Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR '09, pp.1-19, 2009.
DOI : 10.1145/1646396.1646421

URL : https://hal.archives-ouvertes.fr/inria-00394212

H. Jegou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3304-3311, 2010.
DOI : 10.1109/CVPR.2010.5540039

URL : https://hal.archives-ouvertes.fr/inria-00548637

X. J. Wang, L. Zhang, M. Liu, Y. Li, and W. Y. Ma, ARISTA - image search to annotation on billions of web photos, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2987-2994, 2010.
DOI : 10.1109/CVPR.2010.5540046

Y. Wang, T. Mei, S. Gong, and X. S. Hua, Combining global, regional and contextual features for automatic image annotation, Pattern Recognition, vol.42, issue.2, pp.259-266, 2009.
DOI : 10.1016/j.patcog.2008.05.010

J. Han and K. K. Ma, Fuzzy color histogram and its use in color image retrieval. Trans

R. Chakravarti and X. Meng, A Study of Color Histogram Based Image Retrieval, 2009 Sixth International Conference on Information Technology: New Generations, pp.1323-1328, 2009.
DOI : 10.1109/ITNG.2009.126

T. Gevers, J. Weijer, and H. Stokman, Color image processing: methods and applications: color feature detection: an overview, chapter 9, pp.203-226, 2006.

K. Van-de-sande, T. Gevers, and C. Snoek, Evaluating Color Descriptors for Object and Scene Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, pp.1582-1596, 2010.
DOI : 10.1109/TPAMI.2009.154

J. Weijer, T. Gevers, and A. D. Bagdanov, Boosting color saliency in image feature detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.1, pp.150-156, 2005.
DOI : 10.1109/TPAMI.2006.3

URL : https://hal.archives-ouvertes.fr/inria-00548615

J. Hays and A. Efros, IM2GPS: estimating geographic information from a single image, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587784

E. Chang, K. Goh, G. Sychay, and G. Wu, CBSA: content-based soft annotation for multimodal image retrieval using bayes point machines, IEEE Transactions on Circuits and Systems for Video Technology, vol.13, issue.1, pp.26-38, 2003.
DOI : 10.1109/TCSVT.2002.808079

J. Li and J. Z. Wang, Automatic linguistic indexing of pictures by a statistical modeling approach. Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.25, issue.9, pp.1075-1088, 2003.

M. Stricker and M. Orengo, Similarity of color images, pp.381-392, 1995.

J. Li and J. Z. Wang, Automatic linguistic indexing of pictures by a statistical modeling approach, IEEE Trans. Pattern Anal. Mach. Intell, vol.25, issue.9, pp.1075-1088, 2003.

F. Mindru, T. Tuytelaars, L. V. Gool, and T. Moons, Moment invariants for recognition under changing viewpoint and illumination, Computer Vision and Image Understanding, vol.94, issue.1-3, pp.3-27, 2004.
DOI : 10.1016/j.cviu.2003.10.011

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.5.3379

J. Wang and X. S. Hua, Interactive Image Search by Color Map, ACM Transactions on Intelligent Systems and Technology, vol.3, issue.1
DOI : 10.1145/2036264.2036276

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.690.227

A. Oliva and A. Torralba, Modeling the shape of the scene: A holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001.
DOI : 10.1023/A:1011139631724

Y. Weiss, A. Torralba, and R. Fergus, Spectral hashing, NIPS, pp.1753-1760

J. Hays and A. Efros, Scene completion using millions of photographs, ACM Transactions on Graphics, vol.26, issue.3, 2007.
DOI : 10.1145/1275808.1276382

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.220.2147

H. Tamura, S. Mori, and T. Yamawaki, Textural features corresponding to visual perception. Systems, Man and Cybernetics, IEEE Transactions on, vol.8, issue.6, pp.460-473, 1978.
DOI : 10.1109/tsmc.1978.4309999

D. Martin, C. Fowlkes, D. Tal, and J. Malik, A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001, pp.416-423, 2001.
DOI : 10.1109/ICCV.2001.937655

G. H. Liu, L. Zhang, Y. K. Hou, Z. Y. Li, and J. Y. Yang, Image retrieval based on multi-texton histogram, Pattern Recognition, vol.43, issue.7, pp.2380-2389, 2010.
DOI : 10.1016/j.patcog.2010.02.012

T. Ahonen, A. Hadid, and M. Pietikäinen, Face Recognition with Local Binary Patterns, Proc. of 9th Euro15 We, pp.469-481, 2011.
DOI : 10.1007/978-3-540-24670-1_36

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.214.6851

M. Subrahmanyam, R. P. Maheshwari, and R. Balasubramanian, Local maximum edge binary patterns: A new descriptor for image retrieval and object tracking, Signal Processing, vol.92, issue.6, pp.1467-1479, 2012.
DOI : 10.1016/j.sigpro.2011.12.005

A. Sinha, S. Banerji, and C. Liu, Novel color Gabor-LBP-PHOG (GLP) descriptors for object and scene image classification, Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP '12, pp.58-2012
DOI : 10.1145/2425333.2425391

C. S. Won, D. K. Park, and S. J. Park, Efficient Use of MPEG-7 Edge Histogram Descriptor, ETRI Journal, vol.24, issue.1, pp.23-30, 2002.
DOI : 10.4218/etrij.02.0102.0103

URL : http://doi.org/10.4218/etrij.02.0102.0103

M. Wang, X. H. Hua, Y. Song, X. Yuan, S. Li et al., Automatic video annotation by semi-supervised learning with kernel density estimation, Proceedings of the 14th annual ACM international conference on Multimedia , MULTIMEDIA '06, pp.967-976, 2006.
DOI : 10.1145/1180639.1180855

T. Tuytelaars and K. Mikolajczyk, Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision, pp.177-280, 2007.
DOI : 10.1561/0600000017

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.377.3635

F. F. Li and P. Perona, A bayesian hierarchical model for learning natural scene categories, Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) -Volume CVPR '05, pp.524-531, 2005.

J. Winn, A. Criminisi, and T. Minka, Object categorization by learned universal visual dictionary, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1800-1807, 2005.
DOI : 10.1109/ICCV.2005.171

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

F. Moosmann, B. Triggs, and F. Jurie, Fast discriminative visual codebooks using randomized clustering forests, NIPS, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00203734

A. Gordoa, J. A. Rodriguez-serrano, F. Perronnin, and E. Valveny, Leveraging category-level labels for instance-level image retrieval, 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp.3045-3052, 2012.
DOI : 10.1109/CVPR.2012.6248035

J. Delhumeau, P. H. Gosselin, H. Jégou, and P. Pérez, Revisiting the VLAD image representation, Proceedings of the 21st ACM international conference on Multimedia, MM '13, pp.653-656, 2013.
DOI : 10.1145/2502081.2502171

URL : https://hal.archives-ouvertes.fr/hal-00840653

K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.10, pp.1615-1630, 2005.
DOI : 10.1109/TPAMI.2005.188

URL : https://hal.archives-ouvertes.fr/inria-00548227

J. Matas, O. Chum, M. Urban, and T. Pajdla, Robust wide baseline stereo from maximally stable extremal regions, BMVC. British Machine Vision Association, 2002.
DOI : 10.5244/c.16.36

T. Tuytelaars and L. Van-gool, Matching Widely Separated Views Based on Affine Invariant Regions, International Journal of Computer Vision, vol.59, issue.1, pp.61-85, 2004.
DOI : 10.1023/B:VISI.0000020671.28016.e8

D. G. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, pp.1150-1157, 1999.
DOI : 10.1109/ICCV.1999.790410

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.121.4065

D. G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

Y. Ke and R. Sukthankar, Pca-sift: A more distinctive representation for local image descriptors, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR'04, pp.506-513, 2004.

M. Jain, R. Benmokhtar, H. Jegou, and P. Gros, Hamming embedding similarity-based image classification, Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, ICMR '12, 2012.
DOI : 10.1145/2324796.2324820

URL : https://hal.archives-ouvertes.fr/hal-00688169

R. Arandjelovi´carandjelovi´c and A. Zisserman, Three things everyone should know to improve object retrieval, IEEE Conference on Computer Vision and Pattern Recognition, 2012.

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.886-893, 2005.
DOI : 10.1109/CVPR.2005.177

URL : https://hal.archives-ouvertes.fr/inria-00548512

Z. Qiang, M. C. Yeh, K. T. Cheng, and S. Avidan, Fast Human Detection Using a Cascade of Histograms of Oriented Gradients, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.1491-1498, 2006.
DOI : 10.1109/CVPR.2006.119

P. Ott and M. Everingham, Implicit color segmentation features for pedestrian and object detection, 2009 IEEE 12th International Conference on Computer Vision, pp.723-730, 2009.
DOI : 10.1109/ICCV.2009.5459238

P. F. Felzenszwalb, R. B. Girshick, D. Mcallester, and D. Ramanan, Object detection with discriminatively trained part-based models. Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.32, issue.9, pp.1627-1645, 2010.
DOI : 10.1109/tpami.2009.167

X. Wang, T. X. Han, and S. Yan, An HOG-LBP human detector with partial occlusion handling, 2009 IEEE 12th International Conference on Computer Vision, pp.32-39, 2009.
DOI : 10.1109/ICCV.2009.5459207

F. Jing, M. Li, L. Zhang, H. J. Zhang, and B. Zhang, Learning in Region-Based Image Retrieval, Image and Video Retrieval, pp.206-215, 2003.
DOI : 10.1007/3-540-45113-7_21

G. Schindler, M. Brown, and R. Szeliski, City-Scale Location Recognition, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-7, 2007.
DOI : 10.1109/CVPR.2007.383150

J. Wang, J. Yang, K. Yu, F. Lv, T. S. Huang et al., Locality-constrained Linear Coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3360-3367, 2010.
DOI : 10.1109/CVPR.2010.5540018

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.175.2312

K. Chatfield, V. S. Lempitsky, A. Vedaldi, and A. Zisserman, The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, pp.1-12, 2011.
DOI : 10.5244/C.25.76

J. C. Van-gemert, C. J. Veenman, A. W. Smeulders, and J. M. Geusebroek, Visual Word Ambiguity, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.7, p.32, 2010.
DOI : 10.1109/TPAMI.2009.132

S. A. Chatzichristofis, C. Iakovidou, Y. S. Boutalis, and O. Marques, Co.Vi.Wo.: Color Visual Words Based on Non-Predefined Size Codebooks, IEEE Transactions on Cybernetics, vol.43, issue.1, pp.192-205, 2013.
DOI : 10.1109/TSMCB.2012.2203300

K. Grauman and T. Darrell, The pyramid match kernel: discriminative classification with sets of image features, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1458-1465, 2005.
DOI : 10.1109/ICCV.2005.239

J. Yang, K. Yu, Y. Gong, and T. S. Huang, Linear spatial pyramid matching using sparse coding for image classification, CVPR, pp.1794-1801, 2009.

Y. L. Boureau, N. Roux, F. Bach, J. Ponce, and Y. Lecun, Ask the locals: Multi-way local pooling for image recognition, 2011 International Conference on Computer Vision, pp.2651-2658, 2011.
DOI : 10.1109/ICCV.2011.6126555

URL : https://hal.archives-ouvertes.fr/hal-00646816

J. Yuan, Y. Wu, and M. Yang, Discovery of Collocation Patterns: from Visual Words to Visual Phrases, 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.
DOI : 10.1109/CVPR.2007.383222

Y. T. Zheng, S. Y. Neo, T. S. Chua, and Q. Tian, Toward a higher-level visual representation for object-based image retrieval. The Visual Computer, pp.13-23, 2009.

D. Picard and P. H. Gosselin, Improving image similarity with vectors of locally aggregated tensors, 2011 18th IEEE International Conference on Image Processing, pp.669-672, 2011.
DOI : 10.1109/ICIP.2011.6116641

URL : https://hal.archives-ouvertes.fr/hal-00591993

X. Zhou, K. Yu, T. Zhang, T. S. Huang, and T. S. Huang, Image Classification Using Super-Vector Coding of Local Image Descriptors, ECCV (5), pp.141-154, 2010.
DOI : 10.1007/978-3-642-15555-0_11

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.175.1571

A. Vailaya, M. Figueiredo, A. K. Jain, and H. Zhang, Image classification for content-based indexing, IEEE Transactions on Image Processing, vol.10, issue.1, pp.117-130, 2001.
DOI : 10.1109/83.892448

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.137.7732

J. Luo and A. E. Savakis, Indoor vs outdoor classification of consumer photographs using low-level and semantic features, pp.745-748, 2001.

Y. Mori, H. Takahashi, and R. Oka, Image-to-word transformation based on dividing and vector quantizing images with words, MISRM'99 First International Workshop on Multimedia Intelligent Storage and Retrieval Management, 1999.

J. Jeon, V. Lavrenko, and R. Manmatha, Automatic image annotation and retrieval using cross-media relevance models, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , SIGIR '03, pp.119-126, 2003.
DOI : 10.1145/860435.860459

URL : http://ciir.cs.umass.edu/pubfiles/mm-41.pdf

P. Huang, J. Bu, C. Chen, K. Liu, and G. Qiu, Improve Image Annotation by Combining Multiple Models, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System, pp.3-9, 2007.
DOI : 10.1109/SITIS.2007.29

A. Bosch, A. Zisserman, and X. Muñoz, Scene Classification Using a Hybrid Generative/Discriminative Approach, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.30, issue.4, pp.712-727, 2008.
DOI : 10.1109/TPAMI.2007.70716

P. Quelhas, F. Monay, J. M. Odobez, D. Gatica-perez, and T. Tuytelaars, A Thousand Words in a Scene, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.9, pp.1575-1589, 2007.
DOI : 10.1109/TPAMI.2007.1155

H. H. Permuter, J. M. Francos, and I. Jermyn, A study of Gaussian mixture models of color and texture features for image classification and segmentation, Pattern Recognition, vol.39, issue.4, pp.695-706, 2006.
DOI : 10.1016/j.patcog.2005.10.028

R. Zhang, Z. Zhang, W. Y. Li, H. Ma, and . Zhang, A probabilistic semantic model for image annotation and multi-modal image retrieval, Multimedia Systems, vol.22, issue.1, pp.27-33, 2006.
DOI : 10.1007/s00530-006-0025-1

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.84.5996

D. M. Blei and M. I. Jordan, Modeling annotated data, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , SIGIR '03, pp.127-134, 2003.
DOI : 10.1145/860435.860460

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.140.6686

X. Zhang, Z. Li, L. Zhang, W. Y. Ma, and H. Y. Shum, Efficient indexing for large scale visual search, Computer Vision IEEE 12th International Conference on, pp.1103-1110, 2009.

N. K. Alham, L. Maozhen, S. Hammoud, and Q. Hao, Evaluating Machine Learning Techniques for Automatic Image Annotations, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery, pp.245-249, 2009.
DOI : 10.1109/FSKD.2009.531

U. Niaz, M. Redi, C. Tanase, G. Merialdo, B. Farinella et al., EURECOM at TRECVID 2011: The light semantic indexing task, TRECVID 2011, 15th International Workshop on Video Retrieval Evaluation National Institute of Standards and Technology, p.2011, 2011.

U. Niaz, M. Redi, C. Tanase, and B. Merialdo, EURECOM at TrecVid 2012: The light semantic indexing task, TRECVID 2012, 16th International Workshop on Video Retrieval Evaluation, 2012.

S. Maji, A. C. Berg, and J. Malik, Classification using intersection kernel support vector machines is efficient, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587630

P. Vincent and Y. Bengio, K-local hyperplane and convex distance nearest neighbor algorithms, NIPS, pp.985-992, 2001.

O. Boiman, E. Shechtman, and M. Irani, In defense of Nearest-Neighbor based image classification, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587598

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems 25, pp.1097-1105, 2012.

H. Zhang, A. C. Berg, M. Maire, and J. Malik, SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2126-2136, 2006.
DOI : 10.1109/CVPR.2006.301

T. Tuytelaars, M. Fritz, K. Saenko, and T. Darrell, The NBNN kernel, 2011 International Conference on Computer Vision, pp.1824-1831, 2011.
DOI : 10.1109/ICCV.2011.6126449

K. Nigam and R. Ghani, Analyzing the effectiveness and applicability of co-training, Proceedings of the ninth international conference on Information and knowledge management , CIKM '00
DOI : 10.1145/354756.354805

R. Yan and M. R. Naphade, Semi-supervised cross feature learning for semantic concept detection in videos, pp.657-663, 2005.

W. Li, L. Duan, I. Tsang, and D. Xu, Co-labeling: A New Multi-view Learning Approach for Ambiguous Problems, 2012 IEEE 12th International Conference on Data Mining, pp.419-428, 2012.
DOI : 10.1109/ICDM.2012.78

W. Du, R. Phlypo, and T. Adali, Adaptive feature split selection for co-training: Application to tire irregular wear classification, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013.
DOI : 10.1109/ICASSP.2013.6638308

U. Niaz and B. Merialdo, Selective Multi-cotraining for Video Concept Detection, Proceedings of International Conference on Multimedia Retrieval, ICMR '14, 2014.
DOI : 10.1145/2578726.2578789

P. K. Atrey, M. A. Hossain, A. Saddik, and M. S. Kankanhalli, Multimodal fusion for multimedia analysis: a survey, Multimedia Systems, vol.24, issue.11, pp.345-379, 2010.
DOI : 10.1007/s00530-010-0182-0

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.190.1065

C. Snoek and M. Worring, Multimodal Video Indexing: A Review of the State-of-the-art, Multimedia Tools and Applications, vol.25, issue.1
DOI : 10.1023/B:MTAP.0000046380.27575.a5

U. Niaz and B. Merialdo, Fusion methods for multimodal indexing of web data, WIAMIS 2013, 14th International Workshop on Image and Audio Analysis for Multimedia Interactive Sercices, pp.7-2013, 2013.

C. Snoek, M. Worring, and A. Smeulders, Early versus late fusion in semantic video analysis, Proceedings of the 13th annual ACM international conference on Multimedia , MULTIMEDIA '05, 2005.
DOI : 10.1145/1101149.1101236

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.78.5928

S. T. Strat, A. Benoit, H. Bredin, G. Quénot, and P. Lambert, Hierarchical late fusion for concept detection in videos, p.2012
DOI : 10.1007/978-3-642-33885-4_34

URL : https://hal.archives-ouvertes.fr/hal-00732740

M. Law, N. Thome, and M. Cord, Hybrid Pooling Fusion in the BoW Pipeline, p.2012
DOI : 10.1007/978-3-642-33885-4_36

URL : https://hal.archives-ouvertes.fr/hal-01282471

M. Guillaumin, J. Verbeek, and C. Schmid, Multimodal semi-supervised learning for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540120

URL : https://hal.archives-ouvertes.fr/inria-00548640

M. S. Kankanhalli, J. Wang, and R. Jain, Experiential Sampling in Multimedia Systems, IEEE Transactions on Multimedia, vol.8, issue.5, pp.937-946, 2006.
DOI : 10.1109/TMM.2006.879876

Y. G. Jiang, G. Ye, S. F. Chang, D. Ellis, and A. C. Loui, Consumer video understanding, Proceedings of the 1st ACM International Conference on Multimedia Retrieval, ICMR '11, 2011.
DOI : 10.1145/1991996.1992025

X. Zou and B. Bhanu, Tracking humans using multi-modal fusion, 2005.

K. Mcdonald and A. F. Smeaton, A comparison of score, rank and probability-based fusion methods for video shot retrieval, 2005.

X. S. Hua and H. J. Zhang, An Attention-Based Decision Fusion Scheme for Multimedia Information Retrieval, 2004.
DOI : 10.1007/978-3-540-30542-2_123

N. Liu, E. Dellandrea, C. Zhu, C. E. Bichot, and L. Chen, A selective weighted late fusion for visual concept recognition, p.2012
DOI : 10.1007/978-3-642-33885-4_43

URL : https://hal.archives-ouvertes.fr/hal-01353059

W. H. Adams, G. Iyengar, M. R. Naphade, C. Neti, H. J. Nock et al., Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues, EURASIP Journal on Advances in Signal Processing, vol.2003, issue.2, pp.170-185, 2003.
DOI : 10.1155/S1110865703211173

R. Yan, J. Yang, and A. G. Hauptmann, Learning query-class dependent weights in automatic video retrieval, Proceedings of the 12th annual ACM international conference on Multimedia , MULTIMEDIA '04, pp.548-555, 2004.
DOI : 10.1145/1027527.1027661

U. Niaz and B. Mérialdo, Entropy based supervised merging for visual categorization Advanced Concepts for Intelligent Vision Systems, ACIVS 2012 Also published in LNCS, 2012.
DOI : 10.1007/978-3-642-33140-4_37

U. Niaz and B. Merialdo, Exploring intra-BOW statistics for improving visual categorization, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), pp.7-2013, 2013.
DOI : 10.1109/WIAMIS.2013.6616130

L. Wang, Toward a discriminative codebook: Codeword selection across multiresolution, CVPR, 2007.

F. Perronnin, C. R. Dance, G. Csurka, and M. Bressan, Adapted Vocabularies for Generic Visual Categorization, ECCV (4), pp.464-475, 2006.
DOI : 10.1007/11744085_36

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.95.8075

C. Lin, S. Li, and S. Su, Image classification using adapted codebook, 2009 IEEE International Symposium on IT in Medicine & Education, pp.1307-1312, 2009.
DOI : 10.1109/ITIME.2009.5236269

J. Hao and X. Jie, Improved bags-of-words algorithm for scene recognition, 2010 2nd International Conference on Signal Processing Systems, pp.2-279, 2010.
DOI : 10.1109/ICSPS.2010.5555494

Y. Su and F. Jurie, Visual word disambiguation by semantic contexts, 2011 International Conference on Computer Vision, 2011.
DOI : 10.1109/ICCV.2011.6126257

URL : https://hal.archives-ouvertes.fr/hal-00808655

F. Wang and B. Merialdo, Weighting informativeness of bag-of-visual-words by kernel optimization for video concept detection, Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval, VLS-MCMR '10, 2010.
DOI : 10.1145/1878137.1878150

H. Jegou, M. Douze, and C. Schmid, Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search, ECCV, 2008.
DOI : 10.1007/978-3-540-88682-2_24

URL : https://hal.archives-ouvertes.fr/inria-00316866

D. Arthur and S. Vassilvitskii, k-means++: the advantages of careful seeding, Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, SODA '07, pp.1027-1035, 2007.

A. F. Smeaton, P. Over, and W. Kraaij, High-Level Feature Detection from Video in TRECVid: A 5-Year Retrospective of Achievements, Multimedia Content Analysis, Theory and Applications, pp.151-174, 2009.
DOI : 10.1007/978-0-387-76569-3_6

C. C. Chang and C. J. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, pp.27-28, 2011.
DOI : 10.1145/1961189.1961199

A. Vedaldi and A. Zisserman, Efficient Additive Kernels via Explicit Feature Maps, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.3, pp.480-492, 2012.
DOI : 10.1109/TPAMI.2011.153

A. Vedaldi and B. Fulkerson, Vlfeat, Proceedings of the international conference on Multimedia, MM '10, 2008.
DOI : 10.1145/1873951.1874249

U. Niaz and B. Merialdo, Leveraging from group classification for video concept detection, 2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI), pp.6-2013, 2013.
DOI : 10.1109/CBMI.2013.6576577

U. Niaz and B. Merialdo, Improving video concept detection through label space partitioning, 2014 IEEE International Conference on Multimedia and Expo (ICME), pp.7-2014, 2014.
DOI : 10.1109/ICME.2014.6890258

G. Tsoumakas and I. Katakis, Multi-label classification: An overview, Int J Data Warehousing and Mining, pp.1-13, 2007.
DOI : 10.4018/jdwm.2007070101

G. Tsoumakas, I. Katakis, and I. Vlahavas, A review of multi-label classification methods, Proceedings of the 2nd ADBIS Workshop on Data Mining and Knowledge Discovery, pp.99-109, 2006.

G. Tsoumakas, I. Katakis, and I. P. Vlahavas, Mining Multi-label Data, Data Mining and Knowledge Discovery Handbook, pp.667-685, 2010.
DOI : 10.1007/978-0-387-09823-4_34

G. Tsoumakas, I. Katakis, and I. P. Vlahavas, Random k-Labelsets for Multilabel Classification, IEEE Transactions on Knowledge and Data Engineering, vol.23, issue.7, pp.1079-1089, 2011.
DOI : 10.1109/TKDE.2010.164

G. Tsoumakas and I. Vlahavas, Random k-Labelsets: An Ensemble Method for Multilabel Classification, Proceedings of the 18th European Conference on Machine Learning, ECML '07, pp.406-417, 2007.
DOI : 10.1007/978-3-540-74958-5_38

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.97.5044

E. Bart and S. Ullman, Single-example learning of novel classes using representation by similarity, Procedings of the British Machine Vision Conference 2005, 2005.
DOI : 10.5244/C.19.82

E. Bart and S. Ullman, Cross-Generalization: Learning Novel Classes from a Single Example by Feature Replacement, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.672-679, 2005.
DOI : 10.1109/CVPR.2005.117

R. Fergus, H. Bernal, Y. Weiss, and A. Torralba, Semantic Label Sharing for Learning with Many Categories, Lecture Notes in Computer Science, vol.1, issue.6311, pp.762-775, 2010.
DOI : 10.1007/978-3-642-15549-9_55

G. A. Miller, WordNet: a lexical database for English, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.
DOI : 10.1145/219717.219748

M. Rohrbach, M. Stark, G. Szarvas, I. Gurevych, and B. Schiele, What helps where – and why? Semantic relatedness for knowledge transfer, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.910-917
DOI : 10.1109/CVPR.2010.5540121

V. Ferrari and A. Zisserman, Learning visual attributes, Advances in Neural Information Processing Systems, 2007.

A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth, Describing objects by their attributes, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206772

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.149.9539

G. Wang and D. A. Forsyth, Joint learning of visual attributes, object classes and visual saliency, 2009 IEEE 12th International Conference on Computer Vision, pp.537-544, 2009.
DOI : 10.1109/ICCV.2009.5459194

C. H. Lampert, H. Nickisch, and S. Harmeling, Learning to detect unseen object classes by betweenclass attribute transfer, CVPR, 2009.
DOI : 10.1109/cvpr.2009.5206594

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.165.9750

G. Wang, D. Hoiem, and D. Forsyth, Building text features for object image classification, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.1367-1374, 2009.
DOI : 10.1109/CVPR.2009.5206816

N. Kumar, A. C. Berg, P. N. Belhumeur, and S. K. Nayar, Attribute and simile classifiers for face verification, 2009 IEEE 12th International Conference on Computer Vision, pp.365-372, 2009.
DOI : 10.1109/ICCV.2009.5459250

T. L. Berg, A. C. Berg, and J. Shih, Automatic Attribute Discovery and Characterization from Noisy Web Data, Lecture Notes in Computer Science, vol.1, issue.6311, pp.663-676, 2010.
DOI : 10.1007/978-3-642-15549-9_48

D. Parikh and K. Grauman, Interactively building a discriminative vocabulary of nameable attributes, CVPR 2011, pp.1681-1688, 2011.
DOI : 10.1109/CVPR.2011.5995451

M. Rastegari, A. Farhadi, and D. Forsyth, Attribute Discovery via Predictable Discriminative Binary Codes, ECCV, pp.876-889, 2012.
DOI : 10.1007/978-3-642-33783-3_63

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.376.3998

A. Farhadi, I. Endres, and D. Hoiem, Attribute-centric recognition for cross-category generalization, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.2352-2359, 2010.
DOI : 10.1109/CVPR.2010.5539924

D. Parikh and K. Grauman, Relative attributes, 2011 International Conference on Computer Vision, pp.503-510, 2011.
DOI : 10.1109/ICCV.2011.6126281

Y. Wang and G. Mori, A Discriminative Latent Model of Object Classes and Attributes, ECCV (5), pp.155-168
DOI : 10.1007/978-3-642-15555-0_12

O. Russakovsky and F. F. Li, Attribute Learning in Large-Scale Datasets, ECCV Workshops, pp.1-14
DOI : 10.1007/978-3-642-35749-7_1

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., ImageNet: A Large-Scale Hierarchical Image Database, CVPR09, 2009.

J. Cai, Z. J. Zha, W. Zhou, and Q. Tian, Attribute-assisted reranking for web image retrieval, Proceedings of the 20th ACM international conference on Multimedia, MM '12, pp.873-876, 2012.
DOI : 10.1145/2393347.2396335

H. Zhang, Z. Zha, Y. Yang, S. Yan, Y. Gao et al., Attribute-augmented semantic hierarchy, Proceedings of the 21st ACM international conference on Multimedia, MM '13, pp.33-42, 2013.
DOI : 10.1145/2502081.2502093

F. Yu, R. Ji, M. H. Tsai, G. Ye, and F. F. Chang, Weak attributes for large-scale image retrieval, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248023

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.394.954

R. Salakhutdinov and G. E. Hinton, Learning a nonlinear embedding by preserving class neighbourhood structure, AISTATS, volume 2 of JMLR Proceedings, pp.412-419, 2007.

R. R. Salakhutdinov and G. E. Hinton, Semantic hashing, Proceedings of the SIGIR Workshop on Information Retrieval and Applications of Graphical Models, 2007.
DOI : 10.1016/j.ijar.2008.11.006

URL : http://doi.org/10.1016/j.ijar.2008.11.006

M. Jain, H. Jégou, and P. Gros, Asymmetric hamming embedding, Proceedings of the 19th ACM international conference on Multimedia, MM '11, pp.1441-1444, 2011.
DOI : 10.1145/2072298.2072035

URL : https://hal.archives-ouvertes.fr/inria-00607278

M. Raginsky and S. Lazebnik, Locality-sensitive binary codes from shift-invariant kernels, NIPS, pp.1509-1517, 2009.

T. Hastie and R. Tibshirani, Classification by pairwise coupling, Proceedings of the 1997 Conference on Advances in Neural Information Processing Systems 10, NIPS '97, pp.507-513, 1998.
DOI : 10.1214/aos/1028144844

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.309.4720

T. G. Dietterich and G. Bakiri, Solving multiclass learning problems via errorcorrecting output codes, Journal of Artificial Intelligence Research, vol.2, pp.263-286, 1995.

S. Escalera, O. Pujol, and P. Radeva, Loss-weighted decoding for error-correcting output coding, International Conference on Computer Vision Theory and Applications, pp.117-122, 2008.

G. Armano, C. Chira, and N. Hatami, Error-correcting output codes for multi-label text categorization, IIR, CEUR, 2012.

M. Cissé, T. Artì, and P. Gallinari, Learning Compact Class Codes for Fast Inference in Large Multi Class Classification, ECML/PKDD, pp.506-520
DOI : 10.1007/978-3-642-33460-3_38

E. L. Allwein, R. E. Schapire, and Y. Singer, Reducing multiclass to binary: A unifying approach for margin classifiers, J. Mach. Learn. Res, vol.1, pp.113-141, 2001.

S. Escalera, O. Pujol, and P. Radeva, Separability of ternary codes for sparse designs of error-correcting output codes, Pattern Recognition Letters, vol.30, issue.3, 2009.
DOI : 10.1016/j.patrec.2008.10.002

T. Kajdanowicz and P. Kazienko, Multi-label classification using error correcting output codes, International Journal of Applied Mathematics and Computer Science, vol.22, issue.4, 2012.
DOI : 10.2478/v10006-012-0061-2

M. Mirza-mohammadi, F. Ciompi, S. Escalera, O. Pujol, and P. Radeva, Ranking error-correcting output codes for class retrieval, CVC-IIR, 2009.

C. S. Ferng and H. T. Lin, Multi-label classification with error-correcting codes, ACML, 2011.

O. Pujol, S. Escalera, and P. Radeva, An incremental node embedding technique for error correcting output codes, Pattern Recognition, vol.41, issue.2, p.41, 2008.
DOI : 10.1016/j.patcog.2007.04.008

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.217.5758

Y. Wang and D. A. Forsyth, Large multi-class image categorization with ensembles of label trees, 2013 IEEE International Conference on Multimedia and Expo (ICME), 2013.
DOI : 10.1109/ICME.2013.6607437

A. Rocha and S. K. Goldenstein, Multiclass From Binary: Expanding One-Versus-All, One-Versus-One and ECOC-Based Approaches, IEEE Transactions on Neural Networks and Learning Systems, vol.25, issue.2, pp.289-302, 2014.
DOI : 10.1109/TNNLS.2013.2274735

B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman, LabelMe: A Database and Web-Based Tool for Image Annotation, International Journal of Computer Vision, vol.3, issue.1, pp.157-173, 2008.
DOI : 10.1007/s11263-007-0090-8

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.192.9478

P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders et al., Trecvid 2013 ? an overview of the goals, tasks, data, evaluation mechanisms and metrics, Proceedings of TRECVID 2013, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00953093

S. Shalev-shwartz, Y. Singer, and N. Srebro, Pegasos, Proceedings of the 24th international conference on Machine learning, ICML '07, 2007.
DOI : 10.1145/1273496.1273598

S. Bengio, J. Weston, and D. Grangier, Label embedding trees for large multi-class tasks, NIPS, pp.163-171, 2010.

M. D. Smucker, J. Allan, and B. Carterette, A comparison of statistical significance tests for information retrieval evaluation, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management , CIKM '07, 2007.
DOI : 10.1145/1321440.1321528

A. Blum and T. Mitchell, Combining labeled and unlabeled data with co-training, Proceedings of the eleventh annual conference on Computational learning theory , COLT' 98, 1998.
DOI : 10.1145/279943.279962

URL : http://axon.cs.byu.edu/~martinez/classes/678/Papers/Mitchell_cotraining.pdf

C. Xu, D. Tao, and C. Xu, A survey on multi-view learning. CoRR, abs/1304, 2013.

X. Zhang, J. Cheng, H. Lu, and S. Ma, Weighted Co-SVM for Image Retrieval with MVB Strategy, 2007 IEEE International Conference on Image Processing, pp.517-520, 2007.
DOI : 10.1109/ICIP.2007.4380068

R. Ewerth and B. Freisleben, Semi-supervised learning for semantic video retrieval, Proceedings of the 6th ACM international conference on Image and video retrieval, CIVR '07, pp.154-161, 2007.
DOI : 10.1145/1282280.1282308

X. Zhang, J. Cheng, H. Lu, and S. Ma, Weighted Co-SVM for Image Retrieval with MVB Strategy, 2007 IEEE International Conference on Image Processing, pp.517-520, 2007.
DOI : 10.1109/ICIP.2007.4380068

C. M. Christoudias, R. Urtasun, A. Kapoor, and T. Darrell, Co-training with noisy perceptual observations, 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.2844-2851, 2009.
DOI : 10.1109/CVPR.2009.5206572

R. Yan and M. R. Naphade, Co-training non-robust classifiers for video semantic concept detection, ICIP (1), pp.1205-1208, 2005.

P. Q. Gu, Q. Zhu, and C. Zhang, A multi-view approach to semi-supervised document classification with incremental Naive Bayes, Computers & Mathematics with Applications, vol.57, issue.6, pp.1030-1036, 2009.
DOI : 10.1016/j.camwa.2008.10.025

URL : http://doi.org/10.1016/j.camwa.2008.10.025

Y. Du, X. Guan, and Z. Cai, Enhancing Web Page Classification via Local Co-training, 2010 20th International Conference on Pattern Recognition, pp.2905-2908, 2010.
DOI : 10.1109/ICPR.2010.712

Y. Yaslan and Z. Cataltepe, Random Relevant and Non-redundant Feature Subspaces for Co-training, IDEAL, 2009.
DOI : 10.1109/TSA.2005.860352

G. Z. Li, D. Li, W. C. Lu, J. Y. Yang, and M. Q. Yang, Feature selection for co-training, Journal of Shanghai University (English Edition), vol.23, issue.11, 2007.
DOI : 10.1007/s11741-008-0110-2

J. Cheng and K. Wang, Active learning for image retrieval with Co-SVM, Pattern Recognition, vol.40, issue.1, pp.330-334, 2007.
DOI : 10.1016/j.patcog.2006.06.005

I. Muslea, C. Kloblock, and S. Minton, Adaptive view validation: A first step towards automatic view detection, pp.443-450, 2002.

U. Güz, S. Cuendet, D. Hakkani-tür, and G. Tür, Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.320-329, 2010.
DOI : 10.1109/TASL.2009.2028371

X. Li and C. Snoek, Classifying tag relevance with relevant positive and negative examples, Proceedings of the 21st ACM international conference on Multimedia, MM '13, 2013.
DOI : 10.1145/2502081.2502129

K. Van-de-sande, T. Gevers, and C. Snoek, Empowering Visual Categorization With the GPU, IEEE Transactions on Multimedia, vol.13, issue.1, 2011.
DOI : 10.1109/TMM.2010.2091400

J. Yangqing and . Caffe, An open source convolutional architecture for fast feature embedding, 2013.

M. Lux, Content based image retrieval with LIRe, Proceedings of the 19th ACM international conference on Multimedia, MM '11, pp.735-738, 2011.
DOI : 10.1145/2072298.2072432

I. Bartolini, M. Patella, and C. Romani, Shiatsu: tagging and retrieving videos without worries. Multimedia Tools and Applications, pp.357-385, 2013.
DOI : 10.1007/s11042-011-0948-1

J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee et al., Multimodal deep learning, International Conference on Machine Learning (ICML), 2011.

R. Socher, B. Huval, B. Bhat, C. D. Manning, and A. Y. Ng, Convolutional- Recursive Deep Learning for 3D Object Classification, Advances in Neural Information Processing Systems 25, 2012.

U. Niaz and B. Merialdo, Improving video concept detection using uploader model, 2013 IEEE International Conference on Multimedia and Expo (ICME)
DOI : 10.1109/ICME.2013.6607538

U. Merialdo and B. Niaz, Uploader models for video concept detection, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI), pp.6-2014, 2014.
DOI : 10.1109/CBMI.2014.6849847