A. Ghias, J. Logan, D. Chamberlin, B. C. Smith, G. Gillet et al., Query by humming: Musical information retrieval in au audio database Automatic transcription of drum loops, Proceedings of ACM Multime- dia'95 Proceedings of the IEEE ICASSP 2004 Conference, 1995.

O. Gillet and G. Richard, Drum Loops Retrieval from Spoken Queries, Journal of Intelligent Information Systems, 2005.
DOI : 10.1007/s10844-005-0321-9

URL : https://hal.archives-ouvertes.fr/hal-00477665

M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, Rwc music database: Popular, classical, and jazz music databases, Proceedings of the 3rd International Conference on Music Information Retrieval, pp.287-288, 2002.

T. Hastie and R. Tibshirani, Classification by pairwise coupling, Advances in Neural Information Processing Systems, 1998.
DOI : 10.1214/aos/1028144844

P. Herrera, X. Amatriain, E. Battle, and X. Serra, Towards instrument segmentation for music content description: a critical review of instrument classification techniques, Proceedings of ISMIR2000, 2000.

P. Herrera, A. Dehamel, and F. Gouyon, Automatic labeling of unpitched percussion sounds, Proceedings of the 114th AES convention, 2003.

A. Kapur, M. Benning, and G. Tzanetakis, Query by beatboxing: Music information retrieval for the dj, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

A. Klapuri, Sound onset detection by applying psychoacoustic knowledge, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), 1999.
DOI : 10.1109/ICASSP.1999.757494

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.76.5114

U. H. and -. Kressel, Pairwise classification and support vector machines, Advances in kernel methods: support vector learning, pp.255-268, 1999.

R. J. Mcnab, L. A. Smith, I. H. Bainbridge, and . Witten, The New Zealand Digital Library MELody inDEX, D-Lib Magazine, vol.3, issue.5, 1997.
DOI : 10.1045/may97-witten

T. Nakano, J. Ogata, M. Goto, and Y. Hiraga, A drum pattern retrieval method by voice percussion, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

J. Platt, Probabilistic outputs for support vector machines and comparison to regularized likelihood methods, Advances in Large Margin Classiers, pp.61-74, 2000.

V. Vapnik, The Nature of Statistical Learning Theory, 1995.

O. Gillet and G. Richard, Automatic transcription of drum loops, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004.
DOI : 10.1109/ICASSP.2004.1326815

P. Smaragdis and M. Casey, Audio/visual independent components, Proceedings of International Symposium on ICA and Blind Source Separation, 2003.

J. W. Fisher and T. Darrell, Signal level fusion for multimodal perceptual user interface, Proceedings of the 2001 workshop on Percetive user interfaces , PUI '01, 2001.
DOI : 10.1145/971478.971482

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.16.7863

D. Murphy, Tracking a conductor's baton, Proceedings of 12th Danish Conference on Pattern Recognition and Image Analysis, 2003.

M. M. Wanderley and P. Depalle, Gesturally-controlled digital audio effects, Proceedings of the COST G-6 Conference on Digital Audio Effects (DAFX-01), 2001.

S. Dahl, The Playing of an Accent ? Preliminary Observations from Temporal and Kinematic Analysis of Percussionists*, Journal of New Music Research, vol.29, issue.3, pp.225-234, 2000.
DOI : 10.1076/jnmr.29.3.225.3090

A. Klapuri, Sound onset detection by applying psychoacoustic knowledge, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), 1999.
DOI : 10.1109/ICASSP.1999.757494

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.76.5114

J. Platt, Probabilistic outputs for support vector machines and comparison to regularized likelihood methods, Advances in Large Margin Classiers, pp.61-74, 2000.

T. Hastie and R. Tibshirani, Classification by pairwise coupling, Advances in Neural Information Processing Systems, 1998.
DOI : 10.1214/aos/1028144844

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.309.4720

C. C. Chang and C. J. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, 2001.
DOI : 10.1145/1961189.1961199

Y. Bengio and Y. Grandvalet, No unbiased estimator of the variance of k-fold cross-validation CIRANO Working Papers, 2003.

F. Opolko and J. Wapnick, McGill University Master Sam- ples, 1987.

G. Ballet, R. Borghesi, P. Hoffmann, and F. Levy, Studio online 3.0: An internet killer application for remote access to ircam sounds and processing tools, Proc. of Journes d'Informatique Musicale (JIM'99), 1999.

L. Fritts, University of Iowa Musical Instrument Samples

M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, RWC Music Database: Popular, Classical, and Jazz Music Databases, Proc. 3rd International Conference on Music Information Retrieval, pp.287-288, 2002.

K. Tanghe, M. Lesaffre, S. Degroeve, M. Leman, B. D. Baets et al., Collecting Ground Truth Annotations for Drum Detection in Polyphonic Music, Proc. 6th Int. Conf. on Music Information Retrieval (ISMIR 2005), pp.50-57, 2005.

E. Thiévon, Batterie mode d'emploi -Playbacks, 2004.

E. Thiévon and P. Argentier, Drums Training Session - Métier et variété, 1999.

M. Alonso, G. Richard, and B. David, Extracting Note Onsets from Musical Recordings, 2005 IEEE International Conference on Multimedia and Expo, 2005.
DOI : 10.1109/ICME.2005.1521568

URL : http://cecs.uci.edu/~papers/icme05/defevent/papers/cr1513.pdf

O. Gillet and G. Richard, Automatic transcription of drum loops, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004.
DOI : 10.1109/ICASSP.2004.1326815

M. Alonso, R. Badeau, B. David, and E. G. Richard, Musical tempo estimation using noise subspace projections, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684), 2003.
DOI : 10.1109/ASPAA.2003.1285828

URL : https://hal.archives-ouvertes.fr/hal-00945267

N. [. Agnihotri, J. R. Dimitrova, and . Kender, Design and evaluation of a music video summarization system, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763), pp.1943-1946, 2004.
DOI : 10.1109/ICME.2004.1394641

L. Agnihotri, N. Dimitrova, J. Kender, and E. J. Zimmerman, Music videos miner, Proceedings of the eleventh ACM international conference on Multimedia , MULTIMEDIA '03, pp.442-443, 2003.
DOI : 10.1145/957013.957103

]. M. Alo06 and . Alonso, Extraction of Metrical Information from Acoustic Music Signals, 2006.

M. [. Abdallah and . Plumbey, Probability as metadata : event detection in music using ICA as a conditional density model, Proceedings of the 4th International Symposium on Independent Component Analysis and Blind Signal Separation (ICA'03), 2003.

M. [. Abdallah and . Plumbey, Polyphonic transcription by non-negative sparse coding of power spectra, Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR'04), pp.318-325, 2004.

M. App07-]-apple, G. Alonso, E. B. Richard, and . David, Final cut studio 2 ? motion 3 Extracting Note Onsets from Musical Recordings, Proceedings of the 2005 IEEE International Conference on Multimedia and Expo (ICME'05), 2005.

G. [. Alonso, E. B. Richard, and . David, Accurate tempo estimation based on harmonic + noise decomposition, EURASIP Journal on Advances in Signal Processing, vol.2007, issue.1, 2007.
DOI : 10.1016/0047-259X(86)90017-5

L. [. Albiol, E. E. Torres, and . Delp, Combining audio and video for video sequence indexing applications, Proceedings. IEEE International Conference on Multimedia and Expo, 2002.
DOI : 10.1109/ICME.2002.1035603

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.7281

P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, Audio-Visual Speech Recognition Using MPEG-4 Compliant Visual Features, EURASIP Journal on Applied Signal Processing, vol.11, pp.1213-1227, 2002.
DOI : 10.1155/s1110865702206162

URL : http://doi.org/10.1155/s1110865702206162

]. R. Bad05 and . Badeau, MéthodesMéthodes`Méthodesà haute résolution pour l'estimation et le suivi de sinuso¨?dessinuso¨?des modulées. Application aux signaux de musique, 2005.

E. [. Bennett and . Bredensteiner, Duality and Geometry in SVM Classifiers, Proceedings of the 17th International Conference on Machine Learning, pp.65-72, 2000.

R. [. Badeau, E. B. Boyer, and . David, EDS parametric modeling and tracking of audio signals, Proceedings of the 5th International Conference on Digital Audio Effects (DAFX'02), 2002.
URL : https://hal.archives-ouvertes.fr/hal-00945272

F. [. Benaroya, E. R. Bimbot, and . Gribonval, Audio source separation with a single sensor, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, pp.191-199, 2006.
DOI : 10.1109/TSA.2005.854110

URL : https://hal.archives-ouvertes.fr/inria-00544949

G. Ballet, R. Borghesi, P. Hoffmann, and E. F. Levy, Studio Online 3.0 : An Internet Killer Application for Remote Access to IRCAM Sounds and Processing tools, Proceedings of Journées d'Informatique Musicale (JIM'99), 1999.

R. [. Bertin, E. G. Badeau, and . Richard, Blind Signal Decompositions for Automatic Transcription of Polyphonic Music: NMF and K-SVD on the Benchmark, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.366617

URL : https://hal.archives-ouvertes.fr/hal-00945282

G. [. Bredin and . Chollet, Audio-Visual Speech Synchrony Measure for Talking-Face Identity Verification, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.366215

L. Benaroya, L. Mc-donagh, F. Bimbot, and E. R. Gribonval, Non negative sparse representation for Wiener based source separation with a single sensor, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2003.
DOI : 10.1109/ICASSP.2003.1201756

URL : https://hal.archives-ouvertes.fr/inria-00574784

J. P. Bello, C. Duxbury, M. Davies, and E. M. Sandler, On the Use of Phase and Energy for Musical Onset Detection in the Complex Domain, IEEE Signal Processing Letters, vol.11, issue.6, pp.553-556, 2004.
DOI : 10.1109/LSP.2004.827951

B. [. Badeau, E. G. David, and . Richard, Selecting the modeling order for the ESPRIT high resolution method: an alternative approach, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
DOI : 10.1109/ICASSP.2004.1326435

URL : https://hal.archives-ouvertes.fr/hal-00945275

]. L. Ben03 and . Benaroya, Séparation de plusieurs sources sonores avec un capteur, 2003.

D. Barry, D. Fitzgerald, E. Coyle, and E. B. Lawlor, Drum source separation using percussive feature detection and spectral modulation, IEE Irish Signals and Systems Conference 2005, 2005.
DOI : 10.1049/cp:20050280

M. Bosi and E. Goldberg, Introduction to Digital Audio Coding and Standards. Kluwer, 2002.

O. [. Bascoul, E. G. Gillet, and . Laurent, Marginal effects analysis : Identifying the most effective marginal levers in decision making, Marketing Science, 2007.

]. J. Bil93 and . Bilmes, Timing is the essence : Perceptual and computational techniques for representing , learning and reproducing expressive timing in percussive rhythm, 1993.

M. [. Bach and . Jordan, Learning spectral clustering with application to speech separation, Journal of Machine Learning Research, vol.7, pp.1963-2001, 2006.

M. [. Bencina, E. S. Kaltenbrunner, and . Jordà, Improved Topological Fiducial Tracking in the reacTIVision System, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Workshops, 2005.
DOI : 10.1109/CVPR.2005.475

B. [. Barry, E. E. Lawlor, and . Coyle, Sound source separation : Azimuth discrimination and resynthesis, Proceedings of the 7th International Conference on Digital Audio Effects, 2004.

]. I. Blo94 and . Bloch, Information Combination Operators for Data Fusion : A Comparative Review with Classification, SPIE/EUROPTO Conference on Image and Signal Processing for Remote Sensing, pp.148-159, 1994.

]. C. Bon02 and . Bond, A new algorithm for scan conversion of a general ellipse, 2002.

N. [. Brand, E. A. Olivier, and . Pentland, Coupled hidden Markov models for complex action recognition, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p.994, 1997.
DOI : 10.1109/CVPR.1997.609450

]. M. Bra97 and . Brand, Coupled hidden markov models for modeling interacting processes, 1997.

]. L. Bre01 and . Breiman, Statistical modeling : The two cultures, Statistical Science, vol.16, issue.3, pp.199-231, 2001.

M. [. Bello and . Sandler, Phase-based note onset detection for music signals, Proceedings of the 2003 IEEE Conference on Acoustics, Speech and Signal Processing, 2003.
DOI : 10.1109/icassp.2003.1200001

J. C. Christopher and . Burges, A tutorial on support vector machines for pattern recognition, Data Mining and Knowledge Discovery, vol.2, issue.2, pp.121-167, 1998.

G. [. Bartsch and . Wakefield, To catch a chorus: using chroma-based representations for audio thumbnailing, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575), pp.15-18, 2001.
DOI : 10.1109/ASPAA.2001.969531

D. [. Ben-yishai and . Burshtein, A Discriminative Training Algorithm for Hidden Markov Models, IEEE Transactions on Speech and Audio Processing, vol.12, issue.3, pp.204-217, 2004.
DOI : 10.1109/TSA.2003.822639

]. J. Can86 and . Canny, A computational approach to edge detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.8, issue.6, pp.679-698, 1986.

]. M. Cas01 and . Casey, MPEG-7 sound-recognition tools, IEEE Transactions on Circuits and Systems for Video Technology, vol.11, issue.6, pp.737-747, 2001.

C. [. Crisp and . Burges, A geometric interpretation of ?-SVM classifiers, Proceedings of the 12th Conference on Neural Information Processing Systems, 1999.

A. [. Chen and . Chen, Query by rhythm: an approach for song retrieval in music databases, Proceedings Eighth International Workshop on Research Issues in Data Engineering. Continuous-Media Databases and Applications, pp.139-146, 1998.
DOI : 10.1109/RIDE.1998.658288

M. Cooper and J. Foote, Automatic Music Summarization via Similarity Analysis, Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR'02), 2002.

P. [. Chen and . Gopalakrishnan, Speaker, environment and channel change detection and clustering via the bayesian information criterion, Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1998.

]. P. Cho05 and . Chordia, Segmentation and Recognition of Tabla Strokes, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR'05), 2005.

C. [. Chang and . Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, 2001.
DOI : 10.1145/1961189.1961199

]. J. Cla and . Clark, Advanced Programming Techniques for Modular Synthesizers -Chapter 5. Percussions

C. [. Chen, E. B. Lin, and . Schölkopf, A tutorial on ?-support vector machines, Applied Stochastic Models in Business and Industry, pp.111-136, 2005.

B. A. Camurri, M. Mazzarino, R. Ricchetti, E. G. Timmers, and . Volpe, Multimodal Analysis of Expressive Gesture in Music and Dance Performances, Proceedings of the 5th International Gesture Workshop, pp.20-39, 2003.
DOI : 10.1007/978-3-540-24598-8_3

]. A. Con06 and . Cont, Realtime multiple pitch observation using sparse non-negative constraints, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR'06, 2006.

A. [. Canu and . Smola, Kernel methods and the exponential family, Proceedings of the 13th European Symposium on Artificial Neural Networks (ESANN'05), 2005.
DOI : 10.1016/j.neucom.2005.12.009

S. [. Costanza, E. J. Shelley, and . Robinson, Introducing audio d-touch : A tangible user interface for music composition and performance, Proceedings of the 6th International Conference on Digital Audio Effects, 2003.

C. [. Chaigne, E. O. Touzé, and . Thomas, Nonlinear vibrations and chaos in gongs and cymbals, Acoustical Science and Technology, vol.26, issue.5, pp.403-409, 2005.
DOI : 10.1250/ast.26.403

URL : https://hal.archives-ouvertes.fr/hal-01135295

P. [. Cilibrasi, R. Vitanyi, . De, and . Wolf, Algorithmic Clustering of Music Based on String Compression, Computer Music Journal, vol.4, issue.4, pp.49-67, 2004.
DOI : 10.1109/TSA.2002.800560

M. Casey and A. Westner, Separation of mixed audio sources by independent subspace analysis, Proceedings of the International Computer Music Conference, 2000.

]. S. Dah00 and . Dahl, The Playing of an Accent -Preliminary observations from temporal and kinematic analysis of percussionists, In Journal of New Music Research, vol.29, issue.3, pp.225-234, 2000.

]. S. Dah04 and . Dahl, Playing the Accent -Comparing Striking Velocity and Timing in an Ostinato Rhythm Performed by Four Drummers, Acta Acustica united with Acustica, vol.90, pp.762-776, 2004.

A. [. Davis and . Bobick, The Representation and Recognition of Action Using Temporal Templates, Proceedings of the 1997 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'97, 1997.

M. [. Desobry, C. Davy, and . Doncarli, An online kernel change detection algorithm, IEEE Transactions on Signal Processing, vol.53, issue.8, pp.2961-2974, 2005.
DOI : 10.1109/TSP.2005.851098

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.1469

M. [. Duxbury, E. M. Davies, and . Sandler, Extraction of transient content in musical audio using multiresolution analysis techniques, Proceedings of the 4th International Conference on Digital Audio Effects (DAFX'01), 2001.

M. Davy and S. Godsill, Detection of abrupt spectral changes using support vector machines : an application to audio signal segmentation, Proceedings of the 2002 IEEE Conference on Acoustics, Speech and Signal Processing, 2002.

S. [. Davy, E. J. Godsill, and . Idier, Bayesian analysis of polyphonic western tonal music, The Journal of the Acoustical Society of America, vol.119, issue.4, pp.119-42498, 2006.
DOI : 10.1121/1.2168548

URL : https://hal.archives-ouvertes.fr/inria-00120240

F. Pedro, D. P. Daniel, and . Huttenlocher, Cornell computing and information science, 2004.

]. S. Dix01 and . Dixon, Automatic extraction of tempo and beat from expressive performances, In Journal of New Music Research, 2001.

. Drumagog, Drum replacer 3.0, 2003.

]. S. Dtb-+-05, K. Degroeve, B. Tanghe, M. De-baets, J. P. Leman et al., A simulated annealing optimization of audio features for drum classification, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR'05), 2005.

J. [. Ellis and . Arroyo, Eigenrhythms : Drum pattern basis sets for classification and generation, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

R. [. Earl and . Ladner, Enhanced Sequitur for finding structure in data, Data Compression Conference, 2003. Proceedings. DCC 2003, 2003.
DOI : 10.1109/DCC.2003.1194044

]. D. Ell96 and . Ellis, Prediction-driven computational auditory scene analysis, 1996.

]. S. Erd06a, G. Essid, E. B. Richard, and . David, Instrument Recognition in Polyphonic Music Based on Automatic Taxonomies, IEEE Transactions on Audio, Speech, and Language Processing, pp.68-80, 2006.

]. S. Erd06b, G. Essid, E. B. Richard, and . David, Musical instrument recognition by pairwise classification strategies, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1401-1412, 2006.

]. A. Ero01 and . Eronen, Automatic musical instrument recognition, 2001.

]. A. Ero03 and . Eronen, Musical Instrument Recognition using ICA-based transform of features and discriminatively trained HMMs, Proceedings of the 7th International Symposium on Signal Processing and its Applications, pp.133-136, 2003.

R. [. Ellis and . Weiss, Model-based monaural source separation using a vectorquantized phase-vocoder representation, Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2006.

M. [. Foote, E. A. Cooper, and . Girgensohn, Creating music videos using automatic media analysis, Proceedings of the tenth ACM international conference on Multimedia , MULTIMEDIA '02, pp.553-560, 2002.
DOI : 10.1145/641007.641119

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.85.514

E. [. Fitzgerald, E. B. Coyle, and . Lawlor, Sub-band independent subspace analysis for drum transcription, Proceedings of the 5th International Conference on Digital Audio Effects (DAFX'02), 2002.

T. [. Fisher and . Darrell, Signal level fusion for multimodal perceptual user interface, Proceedings of the 2001 workshop on Percetive user interfaces , PUI '01, pp.1-7, 2001.
DOI : 10.1145/971478.971482

J. W. Fisher, T. Darrell, W. Freeman, and P. A. Viola, Learning joint statistical models for audio-visual fusion and segregation, NIPS, pp.772-778, 2000.

I. [. Fiebrink and . Fujinaga, Feature selection pitfalls and music classification, Proceedings of the 7th International Conference on Music Information Retrieval (IS- MIR'06), 2006.

]. S. Fil06 and . Filippi, Transcription rythmique d'un signal audio de pianò a fortes variations de tempo. Master's thesis Automatic Drum Transcription and Source Separation, 2004.

B. [. Fitzgerald and . Lawlor, Independent subspace analysis using locally linear em- BIBLIOGRAPHIE bedding, Proceedings of the 6th International Conference on Digital Audio Effects, 2003.

]. D. Flc03a, B. Fitzgerald, E. E. Lawlor, and . Coyle, Drum transcription in the presence of pitched instruments using prior subspace analysis, Proceedings of the Irish Signals and Systems Conference, 2003.

]. D. Flc03b, B. Fitzgerald, E. E. Lawlor, and . Coyle, Prior subspace analysis for drum transcription, Proceedings of the 114th AES Convention, 2003.

K. [. Fujinaga and . Macmillian, Real-time recognition of orchestral instruments, Proceedings of the International Computer Music Conference, 2000.

]. J. Foo99 and . Foote, Visualizing music and audio using self-similarity, Proceedings of ACM Multimedia'99, pp.77-87, 1999.

]. G. For73 and . Forney, The Viterbi algorithm, Proceedings of the IEEE, pp.268-278, 1973.

M. [. Fitzgibbon, R. B. Pilu, and . Fisher, Direct least square fitting of ellipses, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.21, issue.5, pp.476-480, 1999.
DOI : 10.1109/34.765658

]. L. Fri and . Fritts, University of Iowa Musical Instrument Samples

L. [. Gribonval, E. Benaroya, C. Vincent, and . Févotte, Proposals for performance measurement in source separation, Proceedings of the 4th Conference on Independent Component Analysis and Blind Signal Separation (ICA'03), 2003.
URL : https://hal.archives-ouvertes.fr/inria-00570123

A. [. Guyon and . Elisseeff, An introduction to feature and variable selection, Journal of Machine Learning Research, vol.3, pp.1157-1182, 2003.

S. [. Gillet, E. G. Essid, and . Richard, On the Correlation of Automatic Audio and Visual Segmentations of Music Videos, IEEE Transactions on Circuits and Systems for Video Technology, pp.347-355, 2007.
DOI : 10.1109/TCSVT.2007.890831

P. [. Gouyon and . Herrera, Exploration of techniques for automatic labeling of audio drum tracks, Proceedings of MOSART : Workshop on Current Directions in Computer Music, 2001.

P. [. Gouyon, E. P. Herrera, and . Cano, Pulse-dependent analyses of percussive music, IEEE International Conference on Acoustics Speech and Signal Processing, 2002.
DOI : 10.1109/ICASSP.2002.5745626

P. [. Gouyon, E. A. Herrera, and . Dehamel, Automatic labeling of unpitched percussion sounds, Proceedings of the 114th AES convention, 2003.

M. Goto, H. Hashiguchi, T. Nishimura, and E. R. Oka, Rwc music database : Popular, classical , and jazz music databases, Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR'02), pp.287-288, 2002.

]. O. Gil03 and . Gillet, Amélioration d'un système de transcription de phrases de Tabla, 2003.

M. [. Ghahramani and . Jordan, Factorial hidden markov models, Machine Learning, vol.29, issue.2/3, pp.245-273, 1997.
DOI : 10.1023/A:1007425814087

J. [. Ghias, D. Logan, B. C. Chamberlin, and . Smith, Query by humming, Proceedings of the third ACM international conference on Multimedia , MULTIMEDIA '95, pp.231-236, 1995.
DOI : 10.1145/217279.215273

M. Goto and Y. Muraoka, A sound source separation system for percussion instruments, Transactions of the Institute of Electronics, pp.901-911, 1994.

M. Goto and Y. Muraoka, A real-time beat tracking system for audio signals, Proceedings of the International Computer Music Conference (ICMC'95), pp.171-174, 1995.

]. M. Gon03 and . Gondry, The Work of Director Michel Gondry, 2003.

G. [. Gillet and . Richard, Automatic labelling of Tabla signals, Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR'03), 2003.

G. [. Gillet and . Richard, Automatic transcription of drum loops, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004.
DOI : 10.1109/ICASSP.2004.1326815

]. O. Gr05a, G. Gillet, and . Richard, Automatic transcription of drum sequences using audiovisual features, Proceedings of the 2005 IEEE Conference on Acoustics, Speech and Signal Processing (ICASSP'05), 2005.

]. O. Gr05b, G. Gillet, and . Richard, Drum loops retrieval from spoken queries, Journal of Intelligent Information Systems, vol.24, issue.2, pp.159-177, 2005.

]. O. Gr05c, G. Gillet, and . Richard, Drum track transcription of polyphonic music using noise subspace projection, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR'05), 2005.

]. O. Gr05d, G. Gillet, and . Richard, Extraction and remixing of drum tracks from polyphonic music signals, Proceedings of the 2005 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'05), 2005.

]. O. Gr05e, G. Gillet, and . Richard, Indexing and querying drum loops databases, Proceedings of the 4th International Workshop on Content-Based Multimedia Indexing, 2005.

]. O. Gr06a, G. Gillet, and . Richard, Comparing Audio and Video Segmentations for Music Videos Indexing, Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2006.

]. O. Gr06b, G. Gillet, and . Richard, ENST-drums : an extensive audio-visual database for drum signals processing, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR'06), 2006.

G. [. Gillet and . Richard, Transcription and Separation of Drum Signals From Polyphonic Music, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.3, 2007.
DOI : 10.1109/TASL.2007.914120

J. [. Guyon, S. Weston, E. V. Barnhill, and . Vapnik, Gene selection for cancer classification using support vector machines, Machine Learning, pp.389-422, 2002.

]. A. Haz05 and . Hazan, Towards automatic transcription of expressive oral percussive performances, Proceedings of the 10th international conference on Intelligent user interfaces (IUI'05), pp.296-298, 2005.

M. [. Hershey and . Casey, Audiovisual sound separation via hidden markov models, Proceedings of the 15th Conference on Neural Information Processing Systems, Advances in Neural Information Processing Systems, 2002.

P. L. Van-hove, M. H. Hayes, J. S. Lim, and A. V. Oppenheim, Signal reconstruction from signed Fourier transform magnitude, IEEE Transactions on Acoustics Speech and Signal Processing, pp.1286-1293, 1983.
DOI : 10.1109/TASSP.1983.1164178

Y. [. Huang and . Lee, Kernel fisher's discriminant analysis in gaussian reproducing kernel hilbert space ? theory, 2006.

J. [. Hershey and . Movellan, Audio-vision : Using audio-visual synchrony to locate sounds, Advances in Neural Information Processing Systems, pp.813-819, 2000.

M. [. Hainsworth and . Macleod, Beat tracking with particle filtering algorithms, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684), 2003.
DOI : 10.1109/ASPAA.2003.1285827

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.11.2965

E. [. Hyvärinen and . Oja, Independent component analysis: algorithms and applications, Neural Networks, vol.13, issue.4-5, pp.411-430, 2000.
DOI : 10.1016/S0893-6080(00)00026-5

M. Helén and T. Virtanen, Separation of drums from polyphonic music using nonnegative matrix factorization and support vector machine, Proceedings of the 13th European Signal Processing Conference, 2005.

P. [. Hermus and . Wambacq, Assessment of signal subspace based speech enhancement for noise robust speech recognition, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.945-948, 2004.
DOI : 10.1109/ICASSP.2004.1326143

A. [. Herrera, E. A. Yeterian, and . Gouyon, Automatic Classification of Drum Sounds: A Comparison of Feature Selection Methods and Classification Techniques, Proceedings of the Second International Conference on Music and Artificial Intelligence (ICMAI'02), pp.69-80, 2002.
DOI : 10.1007/3-540-45722-4_8

I. [. Huet, E. B. Yahiaoui, and . Mérialdo, Image similarity for automatic video summarization, Proceedings of the 11th European Signal Processing Conference, 2002.

]. A. Hyv99 and . Hyvärinen, Fast and robust fixed-point algorithms for independent component analysis, IEEE Transactions on Neural Networks, pp.626-634, 1999.

J. [. Ikizler, L. Vasanth, E. D. Wong, and . Forsyth, Finding celebrities in video, 2006.

A. [. Jeannin and . Divakaran, MPEG-7 visual motion descriptors, IEEE Transactions on Circuits and Systems for Video Technology, pp.720-724, 2001.
DOI : 10.1109/76.927428

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.323.7878

]. T. Joa98 and . Joachims, Making large-scale support vector machine learning practical, Advances in Kernel Methods ? Support Vector Learning, 1998.

]. S. Jon03 and . Jonze, The Work of Director Spike Jonze, 2003.

]. M. Jør02 and . Jørgensen, Drumfinder, DSP-project on recognition of drum sounds in drum tracks, 2002.

]. I. Kam00 and . Kaminskyj, Multi-feature musical instrument sound classifier, Proceedings of the Australasian Computer Music Conference, 2000.

M. [. Kapur, E. G. Benning, and . Tzanetakis, Query by beatboxing : Music information retrieval for the DJ, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

]. A. Kkvb-+-05, A. Kapur, N. Kapur, G. Virji-babul, P. F. Tzanetakis et al., Gesture-Based Affective Computing on Motion Capture Data, Proceedings of the International Conference on Affective Computing and Intelligent Interaction, ACII'05, 2005.

]. A. Kla99 and . Klapuri, Sound onset detection by applying psychoacoustic knowledge, IEEE International Conference on Acoustics, Speech and Signal Processing, 1999.

A. Klapuri, Multipitch estimation and sound separation by the spectral smoothness principle, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), 2001.
DOI : 10.1109/ICASSP.2001.940384

]. A. Kla03 and . Klapuri, Musical meter estimation and music transcription, Proceedings of the Cambridge Music Processing Colloquium, 2003.

]. A. Kla04 and . Klapuri, Signal processing methods for the automatic transcription of music, 2004.

S. [. Kim, S. Y. Park, and . Shin, Rhythmic-Motion Synthesis Based on Motion-Beat Analysis, Proceedings of the 30th International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH2003), 2003.

]. J. Kru83, An Overview of Sequence Comparison

. Kruskal, Time Warps, String Edits, and Macromolecules : The Theory and Practice of Sequence Comparison, pp.1-44, 1983.

]. H. Kuh55 and . Kuhn, The hungarian method for the assignment problem, Naval Research Logistics Quarterly, vol.2, pp.83-97, 1955.

]. J. Lar01 and . Laroche, Estimating tempo, swing and beat locations in audio recordings, Proceedings of the 2001 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'01), pp.131-135, 2001.

]. J. Lar04 and . Laroche, Efficient Tempo and Beat Tracking in Audio Recordings, Journal of the Audio Engineering Society, vol.51, issue.4, pp.226-233, 2004.

G. Loosli, S. Canu, S. V. Vishwanathan, A. J. Smola, M. [. Chattopadhyay et al., Bo??tèBo??tè a outils SVM simple et rapide. Revue d'Intelligence Artificielle A supervised classification algorithm for note onset detection, EURASIP Journal on Advances in Signal Processing, p.1043745, 1155.

]. S. Lip05 and . Lipscomb, The perception of audio-visual composites : accent structure alignment of simple stimuli, pp.37-67, 2005.

R. [. Lerdahl and . Jackendoff, A generative Theory of tonal Music, 1983.

]. B. Log00 and . Logan, Mel frequency cepstral coefficients for music modeling, Proceedings of the 1st International Conference on Music Information Retrieval (ISMIR'00, 2000.

H. [. Lee and . Seung, Algorithms for non-negative matrix factorization, Advances in Neural Information Processing Systems, pp.556-562, 2001.

M. Li and R. Sleep, Melody classification using a similarity metric based on kolmogorov complexity, Proceedings of the 2nd Conference on Sound and Music Computing, 2005.

T. [. Murphy, E. K. Andersen, and . Jensen, Conducting Audio Files via Computer Vision, Lecture notes in Computer science, 2004.
DOI : 10.1007/978-3-540-24598-8_49

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.3.730

]. B. Mér95 and . Mérialdo, Modèles probabilistes etétiquetageet´etétiquetage automatique. T.A.L, traitement automatique des langues, traitements probabilistes et corpus, pp.7-9, 1995.

K. Mcguinness, O. Gillet, N. O. Connor, and E. G. Richard, Visual analysis for drum sequence transcription, AcceptéAccepté`Acceptéà la 17th European Signal Processing Conference, 2007.

]. J. Min05 and . Min, Human Activity Recognition using Motion Trajectories, 2005.

. Mir and . Mirex, Results of the MIREX Audio Drum Detection Contest

]. M. Mit98 and . Mitchell, An Introduction to Genetic Algorithms, 1998.

P. Mulhem, M. S. Kankanhalli, J. Yi, and E. H. Hassan, Pivot vector space approach for audio-video mixing, IEEE Multimedia, vol.10, issue.2, pp.28-40, 2003.
DOI : 10.1109/MMUL.2003.1195159

P. [. Marques and . Moreno, A study of musical instrument classification using gaussian mixture models and support vector machines, 1999.

[. Mitra, C. A. Murthy, and S. K. , Unsupervised feature selection using feature similarity, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.3, pp.301-312, 2002.
DOI : 10.1109/34.990133

]. D. Mur03 and . Murphy, Tracking a conductor's baton, Proceedings of the 12th Danish Conference on Pattern Recognition and Image Analysis, 2003.

G. [. Madsen and . Widmer, Music complexity measures predicting the listening experience, Proceedings of the 9th International Conference on Music Perception and Cognition (ICMPC'06), 2006.

J. [. Nielsen, E. J. Carstensen, and . Smedsgaard, Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping, Journal of Chromatography A, vol.805, issue.1-2, pp.17-35, 1998.
DOI : 10.1016/S0021-9673(98)00021-1

I. [. Nevill-manning and . Witten, Identifying hierarchical structure in sequences : A linear-time algorithm, Journal of Artificial Intelligence Research, vol.7, pp.67-82, 1997.

I. [. Nevill-manning, D. L. Witten, and . Maulsby, Compression by induction of hierarchical grammars, Proceedings of IEEE Data Compression Conference (DCC'94), pp.244-253, 1994.
DOI : 10.1109/DCC.1994.305932

T. Nakano, J. Ogata, M. Goto, and Y. Hiraga, A drum pattern retrieval method by voice percussion, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

S. [. Nayak, M. S. Srinivasan, and . Kankanhalli, Music synthesis for home videos: an analogy based approach, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, 2003.
DOI : 10.1109/ICICS.2003.1292728

P. [. Ozerov, R. Philippe, E. F. Gribonval, and . Bimbot, One microphone singing voice separation using source-adapted models, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., 2005.
DOI : 10.1109/ASPAA.2005.1540176

URL : https://hal.archives-ouvertes.fr/inria-00564491

]. I. Ori01 and . Orife, Riddim : A rhythm analysis and decomposition tool based on independent subspace analysis Master's thesis [oW03] University of Waikato. WEKA 3 : Machine Learning Software in Java, 2001.

]. J. Pau06 and . Paulus, Acoustic modelling of drum sounds with hidden markov models for music transcription, Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2006.

A. [. Peeters, E. X. Burthe, and . Rodet, Toward automatic music audio summary generation from signal analysis, Proceedings of the 2nd International Conference on Music Information Retrieval (ISMIR'01), 2002.
URL : https://hal.archives-ouvertes.fr/hal-01161322

A. [. Peker and . Divakaran, Framework for measurement of the intensity of motion activity of video segments, Journal of Visual Communication and Image Representation, vol.15, issue.3, 2003.
DOI : 10.1016/j.jvcir.2004.04.007

S. [. Pampalk, E. G. Dixon, and . Widmer, Exploring Music Collections by Browsing Different Views, Proceedings of the 4th International Conference on Music Information Retrieval, 2003.
DOI : 10.1109/TSA.2002.800560

]. G. Pee03 and . Peeters, Automatic classification of large musical instrument databases using hierarchical classifiers with inertia ratio maximization, Proceedings of the 115th AES Convention, 2003.

]. G. Pee04 and . Peeters, A large Set of Audio Features for Sound Description (Similarity and Classification) in the CUIDADO project, 2004.

A. [. Paulus and . Klapuri, Measuring the similarity of rhythmic patterns, Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR'02), 2002.

]. J. Pk03a, A. Paulus, and . Klapuri, Conventional and periodic n-grams in the transcription of drum sequences, Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003.

]. J. Pk03b, A. Paulus, and . Klapuri, Model-based event labeling in the transcription of percussive audio signals, Proceedings of the 6th International Conference on Digital Audio Effects, 2003.

A. [. Paulus and . Klapuri, Music structure analysis by finding repeated parts, Proceedings of the 1st ACM workshop on Audio and music computing multimedia , AMCMM '06, 2006.
DOI : 10.1145/1178723.1178733

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.163.5699

]. J. Pla98 and . Platt, Fast training of support vector machines using sequential minimal optimization, Advances in Kernel Methods ? Support Vector Learning, 1998.

]. J. Pla00 and . Platt, Probabilistic outputs for support vector machines and comparison to regularized likelihood methods, Advances in Large Margin Classiers, pp.61-74, 2000.

G. Potamianos, C. Neti, J. Luettin, and E. I. Matthews, Audio-visual automatic speech recognition : An overview, Issues in Visual and Audio-Visual Speech Processing, 2004.

R. [. Pavlovic, T. S. Sharma, and . Huang, Visual interpretation of hand gestures for human-computer interaction: a review, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, issue.7, pp.677-695, 1997.
DOI : 10.1109/34.598226

S. [. Press, W. T. Teukoslky, B. P. Vetterling, and . Flannery, Numerical Recipes in C, 1992.

T. [. Paulus and . Virtanen, Drum transcription with nonnegative spectrogram factorisation, Proceedings of the 15th European Signal Processing Conference (EUSIP- CO'2005), 2005.

]. R. Qui93 and . Quinlan, C4.5 : Programs for Machine Learning (Morgan Kaufmann Series in Machine Learning), 1993.

]. L. Rab89 and . Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989.

]. C. Rap01 and . Raphael, Automated rhythm transcription, Proceedings of the 2nd International Conference on Music Information Retrieval (ISMIR'01), 2001.

J. [. Ravelli, M. B. Bello, and . Sandler, Drum sound analysis for the manipulation of rhythm in drum loops, Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.233-236, 2006.

J. [. Ravelli, E. M. Bello, and . Sandler, Automatic Rhythm Modification of Drum Loops, IEEE Signal Processing Letters, vol.14, issue.4, 2007.
DOI : 10.1109/LSP.2006.887783

R. [. Lin, P. H. Fan, and . Chen, Working set selection using second order information for training support vector machines, Journal of Machine Learning Research, vol.6, pp.1889-1918, 2005.

]. E. Ris02, Drum Analysis, 2002.

B. [. Rabiner and . Juang, Fundamentals of speech recognition, 1993.

O. [. Ridder, E. H. Munkelt, and . Kirchner, Adaptive Background Estimation and Foreground Detection using Kalman Filtering, Proceedings of the International Conference on recent Advances in Mechatronics (ICRAM'95), pp.193-199, 1995.

]. T. Ros01 and . Rossing, Acoustics of percussion instruments : Recent progress, Journal of Acoustical Science and Technology, vol.22, issue.3, pp.177-188, 2001.

]. S. Row01 and . Roweis, One microphone source separation, Advances in Neural Information Processing Systems, pp.793-799, 2001.

[. Ramona, G. Richard, and E. S. Essid, Combined supervised and unsupervised segmentation of radiophonic audio streams, Proceedings of the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007.

M. [. Smaragdis and . Casey, Audio/visual independent components, Proceedings of the 3rd International Conference on ICA and Blind Source Separation, 2003.

]. E. Sch98 and . Scheirer, Tempo and beat analysis of acoustic musical signals, Journal of the Acoustical Society of America, vol.103, issue.1, pp.588-601, 1998.

]. J. Sep01 and . Seppänen, Tatum Grid Analysis of Musical Signals, Proceedings of the 2001 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2001.

W. [. Stauffer and . Grimson, Adaptive background mixture models for real-time tracking, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), 1999.
DOI : 10.1109/CVPR.1999.784637

F. [. Sandvold, E. P. Gouyon, and . Herrera, Percussion classification in polyphonic audio recordings using localized sound models, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

L. [. Sodoyer, C. Girin, J. L. Jutten, and . Schwartz, Developing an audio-visual speech source separation algorithm, Speech Communication, vol.44, issue.1-4, pp.113-125, 2004.
DOI : 10.1016/j.specom.2004.10.002

URL : https://hal.archives-ouvertes.fr/hal-00186591

M. [. Sonoda, Y. A. Goto, and . Muraoka, A WWW-based melody retrieval system, Proceedings of the International Computer Music Conference, pp.349-352, 1998.
DOI : 10.1002/ecjb.10073

J. Sillanpää, A. Klapuri, J. Seppänen, and E. T. Virtanen, Recognition of acoustic noise mixtures by combined bottom-up and top-down approach, Proceedings of the 10th European Signal Processing Conference, 2000.

A. [. Saitoh, E. H. Kodata, and . Tominaga, Integrated data processing between image and audio - musical instrument (piano) playing information processing, 6th International Conference on Image Processing and its Applications, pp.432-442, 1997.
DOI : 10.1049/cp:19970932

T. Shiratori, A. Nakazawa, and E. K. Ikeuchi, Detecting dance motion structure through music analysis, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings., 2004.
DOI : 10.1109/AFGR.2004.1301641

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.61.2282

. Spst-+-99-]-b, J. Schölkopf, J. Platt, A. J. Shawe-taylor, R. C. Smola et al., Estimating the support of a high-dimensional distribution, 1999.

J. [. Serra and . Smith, Spectral Modeling Synthesis: A Sound Analysis/Synthesis System Based on a Deterministic Plus Stochastic Decomposition, Computer Music Journal, vol.14, issue.4, 1990.
DOI : 10.2307/3680788

A. [. Schölkopf and . Smola, Learning with kernels, 2002.

]. D. Ssg-+-02, J. L. Sodoyer, L. Schwartz, J. Girin, C. Klinkisch et al., Separation of audiovisual speech sources : A new approach exploiting the audio-visual coherence of speech stimuli, EURASIP Journal on Applied Signal Processing, vol.11, pp.1165-1173, 2002.

S. [. Petersen, T. Sigurdssson, and . Lehn-schiøler, Mel frequency cepstral coefficients : An evaluation of robustness of mp3 encoded music, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR'06), 2006.

]. D. Std-+-05, K. Van-steelant, S. Tanghe, B. Degroeve, M. De-baets et al., Support vector machines for bass and snare drum recognition, Studies in Classification , Data Analysis and Knowledge Organisation, 2005.

B. [. Scheirer and . Vercoe, SAOL: The MPEG-4 Structured Audio Orchestra Language, Computer Music Journal, vol.46, issue.3, pp.31-51, 1999.
DOI : 10.2307/3681361

C. [. Shao, M. S. Xu, and . Kankanhalli, Automatically generating summaries for musical video, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429), 2003.
DOI : 10.1109/ICIP.2003.1246738

C. [. Shao, M. S. Xu, and . Kankanhalli, A New Approach to Automatic Music Video Summarization, Proceedings of the International Conference on Image Processing, 2004.

]. K. Tan05 and . Tanghe, MAMI -software -drum detection console application, 2005.

]. G. Tau91 and . Taubin, Estimation of planar curves, surfaces, and nonplanar space curves defined by implicit equations with applications to edge and range image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.13, issue.11, pp.1115-1138, 1991.

P. [. Tzanetakis and . Cook, Musical genre classification of audio signals, IEEE Transactions on Speech and Audio Processing, vol.10, issue.5, pp.293-301, 2002.
DOI : 10.1109/TSA.2002.800560

S. [. Tanghe, E. B. Degroeve, and . De-baets, An algorithm for detecting and labeling drum events in polyphonic music, Proceedings of the 2005 MIREX evaluation campaign, 2005.

. Tld-+-05-]-k, M. Tanghe, S. Lesaffre, M. Degroeve, B. Leman et al., Collecting Ground Truth Annotations for Drum Detection in Polyphonic Music, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR'05), pp.50-57, 2005.

R. [. Tomasi and . Manduchi, Bilateral filtering for gray and color images, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), 1998.
DOI : 10.1109/ICCV.1998.710815

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.126.2091

T. [. Takeda, E. S. Nishimoto, and . Sagayama, Maximum likelihood method for estimating BIBLIOGRAPHIE rhythm and tempo, Proceedings of the Internation Symposium on Musical Acoustics, 2004.

]. C. Ud04a, C. Uhle, and . Dittmar, Drum pattern based genre classification of popular music, Proceedings of the AES 25th Internation Conference, 2004.

]. C. Ud04b, C. Uhle, and . Dittmar, Further steps towards drum transcription of polyphonic music, Proceedings of the 116th AES convention, 2004.

C. [. Uhle, E. T. Dittmar, and . Sporer, Extraction of drum tracks from polyphonic music using independent subspace analysis, Proceedings of the 4th International Symposium on Independent Component Analysis and Blind Signal Separation (ICA'03), 2003.

J. [. Uhle and . Herre, Estimation of tempo, micro time and time signature from percussive music, Proceedings of the 6th International Conference on Digital Audio Effects, 2003.

]. P. Vai93 and . Vaidyanathan, Multirate Systems and Filter Banks, 1993.

]. T. Vir03 and . Virtanen, Sound source separation using sparse coding with temporal continuity objective, Proceedings of the 2003 International Computer Music Conference, 2003.

]. E. Vr04a, X. Vincent, and . Rodet, Instrument identification in solo and ensemble music using independent subspace analysis, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

]. E. Vr04b, X. Vincent, and . Rodet, Underdetermined source separation with structured source priors, Proceedings of the 5th Symposium on Independent Component Analysis and Blind Signal Separation, 2004.

T. [. Witten and . Bell, The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression, IEEE Transactions on Information Theory, vol.37, issue.4, pp.1085-1094, 1991.
DOI : 10.1109/18.87000

W. Wang, D. Cosker, Y. Hicks, S. Sanei, and E. J. Chambers, Video Assisted Speech Source Separation, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., pp.425-428, 2005.
DOI : 10.1109/ICASSP.2005.1416331

P. [. Wanderley and . Depalle, Gesturally-controlled digital audio effects, Proceedings of the 5th International Conference on Digital Audio Effects (DAFX'02), 2001.

P. [. Wanderley and . Depalle, Gestural Control of Sound Synthesis, Proceedings of the IEEE, pp.632-644, 2004.
DOI : 10.1109/JPROC.2004.825882

F. [. Witten and . Eibe, Data mining, ACM SIGMOD Record, vol.31, issue.1, 2005.
DOI : 10.1145/507338.507355

. J. Webs, A. Weston, G. Elisseef, E. F. Bakir, and . Sinz, The Spider Matlab toolbox

T. [. Wu and . Huang, View-independent recognition of hand postures, Proceedings of the 2000 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2000.

C. [. Wang, K. H. Yang, and . Chang, Subspace tracking for speech enhancement in car noise environments, Proceedings of the 2004 IEEE Conference on Acoustics, Speech and Signal Processing, pp.789-792, 2004.

L. Xie, L. Kennedy, S. Chang, A. Divakaran, H. Sun et al., Discovering meaningful multimedia patterns with audio-visual concepts and associated text, 2004 International Conference on Image Processing, 2004. ICIP '04., 2004.
DOI : 10.1109/ICIP.2004.1421580

M. [. Yang and . Brown, Music database query with video by synesthesia observation, Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME'04), pp.305-308, 2004.

K. Yoshii, M. Goto, K. Komatani, T. Ogata, and E. H. Okuno, An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp.237-240, 2006.
DOI : 10.1109/ICASSP.2006.1661256

]. K. Ygo04a, M. Yoshii, H. G. Goto, and . Okuno, Automatic drum sound description for real-world music using template adaptation and matching methods, Proceedings of the 5th International Conference on Music Information Retrieval, 2004.

]. K. Ygo04b, M. Yoshii, H. G. Goto, and . Okuno, Drum sound identification for polyphonic music using template adaptation and matching methods, Proceedings of the 2004 Workshop on Statistical and Perceptual Audio Processing, 2004.

M. [. Yoshii, H. G. Goto, and . Okuno, INTER :D : a drum sound equalizer for controlling volume and timbre of drums, Proceedings of the 2nd European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology (EWIMT'05), 2005.

B. [. Yahiaoui, E. B. Mérialdo, and . Huet, Generating summaries of multi-episode video, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001., 2001.
DOI : 10.1109/ICME.2001.1237794

J. [. Yamato, E. K. Ohya, and . Ishii, Recognizing human action in time-sequential images using hidden Markov model, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.379-385, 1992.
DOI : 10.1109/CVPR.1992.223161

R. [. Zhou and . Chellappa, From sample similarity to ensemble similarity: probabilistic distance measures in reproducing kernel Hilbert space, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.6, pp.917-929, 2006.
DOI : 10.1109/TPAMI.2006.120

]. H. Zet98 and . Zettl, Sight, Sound, Motion : Applied Media Aesthetics, 1998.

J. [. Zhou and . Hansen, Unsupervised Audio Stream Segmentation and Clustering via the Bayesian Information Criterion, Proceedings of the International Conference on Spoken Language Processing, 2000.

T. [. Zhu and . Hastie, Kernel Logistic Regression and the Import Vector Machine, Journal of Computational and Graphical Statistics, vol.14, issue.1, pp.185-205, 2005.
DOI : 10.1198/106186005X25619

A. [. Ziv and . Lempel, Compression of individual sequences via variable-rate coding, IEEE Transactions on Information Theory, vol.24, issue.5, pp.530-536, 1978.
DOI : 10.1109/TIT.1978.1055934

F. [. Zils, O. Pachet, E. F. Delerue, and . Gouyon, Automatic extraction of drum tracks from polyphonic music signals, Second International Conference on Web Delivering of Music, 2002. WEDELMUSIC 2002. Proceedings., 2002.
DOI : 10.1109/WDM.2002.1176209

]. E. Zwi77 and . Zwicker, Procedure for calculating loudness of temporally variable sounds, Journal of the Acoustical Society of America, p.277, 1977.

?. O. Bibliographie-de-l-'auteur-revues-internationales, G. Gillet, and . Richard, Transcription and Separation of Drum Signals from Polyphonic Music, Accepté pour publication dans les IEEE Transactions on Audio, Speech, and Language Processing , Special Issue on Music Information Retrieval

?. O. Gillet, S. Essid, and G. Richard, On the Correlation of Automatic Audio and Visual Segmentations of Music Videos, IEEE Transactions on Circuits and Systems for Video Technology, pp.347-355, 2007.
DOI : 10.1109/TCSVT.2007.890831

?. O. Gillet and G. Richard, Drum Loops Retrieval from Spoken Queries, Journal of Intelligent Information Systems, vol.103, issue.1, pp.159-177, 2005.
DOI : 10.1007/s10844-005-0321-9

URL : https://hal.archives-ouvertes.fr/hal-00477665

?. O. Conférencesconf´conférences-internationales-avec-comitécomit´comité-de-lecture, G. Gillet, and . Richard, Supervised and unsupervised Sequence Modelling for Drum Transcription, SoumisàSoumis`Soumisà 8th International Conference on Music Information Retrieval (ISMIR'07), 2007.

?. K. Mcguinness, O. Gillet, N. O. Connor, and G. , Richard Visual Analysis of Drum Playing, Accepté Accepté`Acceptéà la 15th European Signal Processing Conference, 2007.

?. O. Gillet and G. Richard, ENST-drums : an extensive audio-visual database for drum signals processing, Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR'06), 2006.

?. O. Gillet and G. Richard, Comparing Audio and Video Segmentations for Music Videos Indexing, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006.
DOI : 10.1109/ICASSP.2006.1661202

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.459.2159

?. O. Gillet and G. Richard, Indexing and Querying Drum Loops Databases, Proceedings of the 4th International Workshop on Content-Based Multimedia Indexing, 2005.

?. O. Gillet and G. Richard, Extraction and remixing of drum tracks from polyphonic music signals, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., 2005.
DOI : 10.1109/ASPAA.2005.1540232

?. O. Gillet and G. Richard, Drum Track Transcription of Polyphonic Music using Noise Subspace Projection, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR'05), 2005.

?. O. Gillet and G. Richard, Automatic Transcription of Drum Sequences using Audiovisual Features, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., 2005.
DOI : 10.1109/ICASSP.2005.1415682

?. O. Gillet and G. Richard, Automatic transcription of drum loops, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004.
DOI : 10.1109/ICASSP.2004.1326815

?. O. Gillet and G. Richard, Automatic Labelling of Tabla Signals, Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR'03), 2003.

?. G. Bascoul, O. Gillet, and E. G. Laurent, Marginal effects analysis : Identifying the most effective marginal levers in decision making. SoumisàSoumis`Soumisà Marketing Science, 2007.