Polyphonic music transcription by non-negative sparse coding of power spectra, Proceedings of the International Conference on Music Information Retrieval, p.318325, 2004. ,
Sinusoidal model based on instantaneous frequency attractors, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1292-1300, 2006. ,
DOI : 10.1109/TSA.2005.858545
Accurate tempo estimation based on harmonic + noise decomposition, EURASIP Journal on Advances in Signal Processing, vol.2007, issue.1, 2007. ,
DOI : 10.1016/0047-259X(86)90017-5
A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture, IEEE Transactions on Signal Processing, vol.58, issue.1, pp.121-133, 2010. ,
DOI : 10.1109/TSP.2009.2030854
URL : https://hal.archives-ouvertes.fr/inria-00489529
On the stability of multiplicative update algorithms. application to non-negative matrix factorization, Institut TELECOM ,
A robust mid-level representation for harmonic content in music signals, Proceedings of the International Conference on Music Information Retrieval, pp.311-322, 2005. ,
Séparation de plusieurs sources sonores avec un seul microphone, 2003. ,
Non negative sparse representation for Wiener based source separation with a single sensor, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., p.61316, 2003. ,
DOI : 10.1109/ICASSP.2003.1201756
URL : https://hal.archives-ouvertes.fr/inria-00574784
Audio source separation with a single sensor, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, p.191199, 2006. ,
DOI : 10.1109/TSA.2005.854110
URL : https://hal.archives-ouvertes.fr/inria-00544949
Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.538-549, 2010. ,
DOI : 10.1109/TASL.2010.2041381
URL : https://hal.archives-ouvertes.fr/inria-00557088
spectral transform, The Journal of the Acoustical Society of America, vol.89, issue.1, p.425434, 1991. ,
DOI : 10.1121/1.400476
Tracking melody in polyphonic audio, Music Information Retrieval Evaluation eXchange, 2008. ,
Multiple f0 estimation in polyphonic music (mirex 2008). extended abstract for the Music Information Retrieval Evaluation eXchange, 2008. ,
Component separation with exible models. application to the separation of astrophysical emissions ,
Monte Carlo methods for Tempo Tracking and Rhythm Quantization, Journal of Articial Intelligence Research, vol.18, p.4581, 2003. ,
Rhythm Quantization for Transcription, Proceedings of the AISB'99 Symposium on Musical Creativity, p.140146, 1999. ,
DOI : 10.2307/3680894
Constrained non-negative matrix factorisation method for eeg analysis in early detection of alzheimer's disease, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, p.893896, 2006. ,
Multi-Pitch Estimation, 2009. ,
Generalized component analysis and blind source separation methods for analyzing multichannel brain signals, 2004. ,
Generalized independent component analysis and its applications in processing of multisensory biomedical data, Proceedings of IVth International Workshop Computational Problems of Electrical Engineering, p.1324, 2002. ,
Independent component analysis, A new concept?, Signal Processing, vol.36, issue.3, p.287314, 1994. ,
DOI : 10.1016/0165-1684(94)90029-9
URL : https://hal.archives-ouvertes.fr/hal-00417283
Perceptually-based evaluation of the errors usually made when automatically transcribing music, ISMIR, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00452615
Bayesian harmonic models for musical signal analysis (with discussion), Bayesian Statistics VII, 2003. ,
Bayesian analysis of polyphonic western tonal music, The Journal of the Acoustical Society of America, vol.119, issue.4, p.24982517, 2006. ,
DOI : 10.1121/1.2168548
URL : https://hal.archives-ouvertes.fr/inria-00120240
Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time???domain cancellation model of auditory processing, The Journal of the Acoustical Society of America, vol.93, issue.6, p.32713290, 1993. ,
DOI : 10.1121/1.405712
YIN, a fundamental frequency estimator for speech and music, The Journal of the Acoustical Society of America, vol.111, issue.4, p.19171930, 2002. ,
DOI : 10.1121/1.1458024
Maximum Likelihood from Incomplete Data via the EM Algorithm, Journal of the Royal Statistical Society. Series B (Methodological), vol.39, issue.1, p.138, 1977. ,
Generalized nonnegative matrix approximations with Bregman divergences, Proceeding of the Neural Information Processing Systems (NIPS) Conference, 2005. ,
Extraction of the Melody Pitch Contour from Polyphonic Audio. extended abstract for the Music Information Retrieval Evaluation eXchange, 2005. ,
Audio melody extraction for MIREX 2009. extended abstract for the Music Information Retrieval Evaluation eXchange, 2009. ,
Harmonically informed multi-pitch tracking, Proceedings of the International Society on Music Information Retrieval conference, pp.333-338, 2009. ,
Singer melody extraction in polyphonic signals using source separation methods, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, p.169172, 2008. ,
DOI : 10.1109/ICASSP.2008.4517573
Single sensor singer/music separation using a source/lter model of the singer voice, ACOUSTICS, 2008. ,
Main melody extraction from polyphonic music excerpts using a source/lter model of the main source. extended abstract for the Music Information Retrieval Evaluation eXchange, 2008. ,
An iterative approach to monaural musical mixture de-soloing, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, p.105108, 2009. ,
DOI : 10.1109/ICASSP.2009.4959531
Main instrument separation from stereophonic audio signals using a source/lter model, European Signal Processing Conference (EUSIPCO), 2009. ,
A source/lter approach to audio melody extraction . extended abstract for the Music Information Retrieval Evaluation eXchange, 2009. ,
Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.564-575, 2010. ,
DOI : 10.1109/TASL.2010.2041114
Beat Tracking by Dynamic Programming, Journal of New Music Research, vol.51, issue.1, p.5160, 2007. ,
DOI : 10.1155/2007/67215
Classication-based melody transcription, Machine Learning, p.439456, 2006. ,
DOI : 10.1007/s10994-006-8373-9
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.90.4819
Identifying`coverIdentifying`cover songs' with chroma features and dynamic programming beat tracking, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pp.1429-1432, 2007. ,
Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.6, 2009. ,
DOI : 10.1109/TASL.2009.2038819
URL : https://hal.archives-ouvertes.fr/inria-00510392
Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.32, issue.6, p.11091121, 1984. ,
DOI : 10.1109/TASSP.1984.1164453
Instrument recognition in polyphonic music based on automatic taxonomies, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, p.6880, 2006. ,
DOI : 10.1109/TSA.2005.860351
URL : https://hal.archives-ouvertes.fr/hal-00477670
Musical instrument recognition by pairwise classication strategies, IEEE Transactions on Audio, Speech, and Language Processing, vol.14, issue.4, p.14011412, 2006. ,
DOI : 10.1109/tsa.2005.860842
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.222.7986
Acoustic Theory of Speech Production, 1970. ,
DOI : 10.1515/9783110873429
Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, 2009. ,
DOI : 10.1016/j.sigpro.2007.01.024
Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, 2009. ,
DOI : 10.1016/j.sigpro.2007.01.024
Towards an inverse constant Q transform, 120th Audio Engineering Society Convention, 2006. ,
Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation, Computational Intelligence and Neuroscience, vol.2008, 2008. ,
DOI : 10.1109/TSA.2005.858005
Multimodal similarity between musical streams for cover version detection, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010. ,
DOI : 10.1109/ICASSP.2010.5495217
URL : https://hal.archives-ouvertes.fr/hal-01132553
F0 Estimation Method for Singing Voice in Polyphonic Audio Signal Based on Statistical Vocal Model and Viterbi Search, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp.14-19, 2006. ,
DOI : 10.1109/ICASSP.2006.1661260
An F0 estimation method of vocal part in polyphonic music by using statistical modelling of singing voice and Viterbi search, pp.3682-3693, 2008. ,
Transcription and Separation of Drum Signals From Polyphonic Music. Audio, Speech, and Language Processing, IEEE Transactions on [see also Speech and Audio Processing, p.529540, 2008. ,
Melodic Description of Audio Signals for Music Content Processing, 2002. ,
A quantitative comparison of dierent approaches for melody extraction from polyphonic audio recordings, 2006. ,
A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), p.757760, 2000. ,
DOI : 10.1109/ICASSP.2000.859070
A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals, Speech Communication, vol.43, issue.4, pp.311-329, 2004. ,
DOI : 10.1016/j.specom.2004.07.001
PreFEst: A Predominant-F0 Estimation method for polyphonic musical audio signals, Proceedings of the 2nd Music Information Retrieval Evaluation eXchange, 2005. ,
RWC music database: Popular, classical, and jazz music databases, Proceedings of the International Conference on Music Information Retrieval, p.287288, 2002. ,
Signal estimation from modied short-time Fourier transform, IEEE Transactions on Acoustics, Speech, and Signal Processing, pp.32236-242, 1984. ,
Desoloing monaural audio using mixture models, Proceedings of the International Conference on Music Information Retrieval, 2007. ,
Musical instrument recognition in polyphonic audio using source-lter model for sound separation, Proceedings of the International Society for Music Information Retrieval Conference, pp.327-332, 2009. ,
Etude de la source glottique en voix parlée et chantée, 2001. ,
Measurement of pitch by subharmonic summation, The Journal of the Acoustical Society of America, vol.83, issue.1, p.257264, 1988. ,
DOI : 10.1121/1.396427
Singing pitch extraction from monaural polyphonic songs by contextual audio modeling and singing harmonic enhancement, Proceedings of the International Society for Music Information Retrieval conference, pp.26-30, 2009. ,
Temporal integration for audio classication with application to musical instrument classication, IEEE Transactions on Audio, Speech and Language Processing, vol.17, issue.1, p.174186, 2009. ,
Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), p.29852988, 2000. ,
DOI : 10.1109/ICASSP.2000.861162
Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture, Signal Processing, vol.24, issue.1, p.110, 1991. ,
DOI : 10.1016/0165-1684(91)90079-X
Multipitch estimation and sound separation by the spectral smoothness principle, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), pp.33813384-33813391, 2001. ,
DOI : 10.1109/ICASSP.2001.940384
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.2, p.255266, 2008. ,
DOI : 10.1109/TASL.2007.908129
Analysis, synthesis, and perception of voice quality variations among female and male talkers, The Journal of the Acoustical Society of America, vol.87, issue.2, p.820857, 1990. ,
DOI : 10.1121/1.398894
Comparison of subjective and objective evaluation methods for audio source separation, The Journal of the Acoustical Society of America, vol.123, issue.5, p.3569, 2008. ,
DOI : 10.1121/1.2934636
Nonlinear programming, Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability, p.481492, 1951. ,
Normalized Cuts for Predominant Melodic Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.2, pp.278290-1558, 2008. ,
DOI : 10.1109/TASL.2007.909260
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.184.9752
Resonance and the Perception of Musical Meter, Connection Science, vol.55, issue.2-3, p.3, 1994. ,
DOI : 10.1007/978-3-662-22492-2
Single channel speech and background segregation through harmonic-temporal clustering, Proceedings of the WASPAA 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, p.279282, 2007. ,
Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction, Proceedings of the SAPA 2008 ISCA Workshop on Statistical and Perceptual Audition, p.2328, 2008. ,
Algorithms for Non-negative Matrix Factorization, Advances in Neural Information Processing Systems, p.556562, 2001. ,
Learning the parts of objects by nonnegative matrix factorization, Nature, vol.401, pp.788-791, 1999. ,
Décompositions parcimonieuses structurées : application à la représentation objet de la musique : modèles de signaux, algorithmes et applications, 2007. ,
Separation of Singing Voice From Music Accompaniment for Monaural Recordings, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.4, p.1475, 2007. ,
DOI : 10.1109/TASL.2006.889789
A Wavelet tour of signal processing, 2008. ,
Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, vol.41, issue.12, p.33973415, 1993. ,
DOI : 10.1109/78.258082
A Connectionist Approach to Automatic Transcription of Polyphonic Piano Music, IEEE Transactions on Multimedia, vol.6, issue.3, p.439449, 2004. ,
DOI : 10.1109/TMM.2004.827507
Audio Melody Extraction Based on Timbral Similarity of Melodic Fragments, EUROCON 2005, The International Conference on "Computer as a Tool", 2005. ,
DOI : 10.1109/EURCON.2005.1630193
Speech enhancement using a soft-decision noise suppression lter, Acoustics, Speech and Signal Processing IEEE Transactions on, vol.28, issue.2, p.137145, 1980. ,
Speech analysis/Synthesis based on a sinusoidal representation, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.34, issue.4, pp.34744-754, 1986. ,
DOI : 10.1109/TASSP.1986.1164910
Simulation of mechanical to neural transduction in the auditory receptor, The Journal of the Acoustical Society of America, vol.79, issue.3, p.70271179, 1986. ,
DOI : 10.1121/1.393460
Complex-valued sparse representation based on smoothed l 0 norm, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, p.38813884, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-00271364
Shift invariant multilinear decomposition of neuroimaging data. accepted for publication NeuroImage, p.14391450, 2008. ,
Template-based chord recognition: inuence of the chord types, Proceedings of the International Society for Music Information Retrieval conference, pp.153-158, 2009. ,
Adaptation de modèles statistiques pour la séparation de sources mono-capteur ,
Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010. ,
DOI : 10.1109/TASL.2009.2031510
Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, p.15641578, 2007. ,
DOI : 10.1109/TASL.2007.899291
URL : https://hal.archives-ouvertes.fr/inria-00544774
Melody Detection in Polyphonic Audio, 2006. ,
On the detection of melody notes in polyphonic audio, Proceedings of the International Conference on Music Information Retrieval, pp.11-15, 2005. ,
Simultaneous estimation of chord progression and downbeats from an audio le, IEEE International Conference on Acoustics, Speech and Signal Processing, p.121124, 2008. ,
Cubyhum: A fully operational query by humming system, ISMIR 2002 Conference Proceedings, p.187196, 2002. ,
Template-Based Estimation of Time-Varying Tempo, EURASIP Journal on Advances in Signal Processing, vol.2007, issue.1, 2007. ,
DOI : 10.1109/5.18626
Beat-marker location using a probabilistic framework and linear discriminant analysis, Proceedings of the Digital Audio Eects (DAFX) conference, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01106384
Algorithms for nonnegative independent component analysis, IEEE Transactions on Neural Networks, vol.14, issue.3, p.534543, 2003. ,
DOI : 10.1109/TNN.2003.810616
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.19.4128
Melody Transcription From Music Audio: Approaches and Evaluation, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.4, p.12471256, 2007. ,
DOI : 10.1109/TASL.2006.889797
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.331.1287
Melody characterization by a genetic fuzzy system, Proceedings of the 5th Sound and Music Computing Conference, p.1523, 2008. ,
A tutorial on hidden Markov models and selected applications inspeech recognition, Proceedings of the IEEE, p.257286, 1989. ,
Separating a foreground singer from background music, International Symposium on Frontiers of Research on Speech and Music (FRSM), 2007. ,
Melody extraction using harmonic matching. Music Information Retrieval Evaluation eXchange, 2008. ,
A pattern recognition approach for melody track selection in MIDI les, Proceedings of the Internation Society for Music Information Retrieval conference, pp.8-12, 2006. ,
One microphone source separation, Advances in Neural Information Processing Systems, p.793799, 2001. ,
Query by humming of midi and audio using locality sensitive hashing, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, p.22492252, 2008. ,
DOI : 10.1109/ICASSP.2008.4518093
Polyphonic music transcription using note event modeling, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005. ,
DOI : 10.1109/ASPAA.2005.1540233
Modelling of note events for singing transcription, Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004. ,
Accompaniment separation and karaoke application based on automatic melody transcription, 2008 IEEE International Conference on Multimedia and Expo, p.14171420, 2008. ,
DOI : 10.1109/ICME.2008.4607710
Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music, Computer Music Journal, vol.1, issue.4, p.7286, 2008. ,
DOI : 10.1109/18.87000
Transcription of the singing melody in polyphonic music, Proceedings of the International Conference on Music Information Retrieval, p.222227, 2006. ,
Tempo and beat analysis of acoustic musical signals, The Journal of the Acoustical Society of America, vol.103, issue.1, p.588601, 1998. ,
DOI : 10.1121/1.421129
Chroma binary similarity and local alignment applied to cover song identication, IEEE Transactions on Audio, Speech and Language Processing, vol.16, p.11381151, 2008. ,
Professionally produced music recordings Internet page, 2008. ,
Auditory model inversion for sound separation, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing, 1994. ,
DOI : 10.1109/ICASSP.1994.389714
Non-negative matrix factorization for polyphonic music transcription, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684), p.177180, 2003. ,
DOI : 10.1109/ASPAA.2003.1285860
Sparse and shift-invariant feature extraction from non-negative data, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ,
DOI : 10.1109/ICASSP.2008.4518048
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.219.3967
A scale for the measurement of a psychological magnitude: loudness., Psychological Review, vol.43, issue.5, p.405416, 1936. ,
DOI : 10.1037/h0058773
Transcription of vocal melodies using voice characteristics and algorithm fusion. Extended abstract for the Music Information Retrieval Evaluation eXchange, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00544277
Musical source separation using time-frequency source priors, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, p.9198, 2006. ,
DOI : 10.1109/TSA.2005.860342
URL : https://hal.archives-ouvertes.fr/inria-00544269
Performance measurement in blind audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1462-1469, 2006. ,
DOI : 10.1109/TSA.2005.858005
URL : https://hal.archives-ouvertes.fr/inria-00544230
Harmonic and inharmonic Nonnegative Matrix Factorization for Polyphonic Pitch transcription, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, p.109112, 2008. ,
DOI : 10.1109/ICASSP.2008.4517558
URL : https://hal.archives-ouvertes.fr/inria-00544183
The 2008 Signal Separation Evaluation Campaign: A Community-Based Approach to Large-Scale Evaluation, Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA), pp.734741-734756, 2009. ,
DOI : 10.1109/TASL.2007.899176
URL : https://hal.archives-ouvertes.fr/inria-00544168
MTG MASS database, 2008. ,
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.3, p.10661074, 2007. ,
DOI : 10.1109/TASL.2006.885253
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.330.1508
Sound Source Separation in Monaural Music Signals, 2006. ,
Analysis of polyphonic audio using source-lter model and non-negative matrix factorization, Advances in Models for Acoustic Processing, Neural Information Processing Systems Workshop, 2006. ,
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE transactions on Information Theory, vol.13, issue.2, p.260269, 1967. ,
Elimination of Biases in Loudness Judgments for Tones, The Journal of the Acoustical Society of America, vol.48, issue.6B, p.13971403, 1397. ,
DOI : 10.1121/1.1912298
Beat tracking using the delta-phase matrix, Groupe AAO : Audio, Acoustique et Ondes, Télécom ParisTech, 2009. ,
Automatic generation of lead sheets from polyphonic music signals, Proceedings of International Society fo Music Information Retrieval Conference, pp.26-30, 2009. ,
A variational EM algorithm for learning eigenvoice parameters in mixed signals, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.113-116, 2009. ,
DOI : 10.1109/ICASSP.2009.4959533
Speech separation using speaker-adapted eigenvoice speech models, Computer Speech & Language, vol.24, issue.1, pp.16-29, 2010. ,
DOI : 10.1016/j.csl.2008.03.003
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.148.6572
Using OQSTFT and a modied SHS to detect the melody in polyphonic music (mirex 2009) Extended abstract for the Music Information Retrieval Evaluation eXchange, 2009. ,
63 P Power Spectral Density (PSD), p.89 ,
see Cosine window Smooth lters -Gaussian Scaled Mixture Model (SGSMM), p.82 ,
82 INDEX Source separation (instantaneous linear mixture ), p.64 ,
147 6.2 Results of the proposed algorithms compared to the other systems submitted to MIREX 2008 Audio Melody Extraction task. We also added the results by 2 participants from the MIREX, p.150, 2006. ,
STFT) of 2 excerpts from the ADC2004 database. The ground-truth melody line is drawn as solid line over the, p.54 ,
Graphical model for the observation layer, rst layer dependency for the mixture. The Fourier vectors for the voice v n and the music m n are respectively generated through the states Z V n and Z M n . The mixture vector x n is the sum of v n and v n , and thus only depends on these vectors. The only observed variable is x n STFT example: excerpt from ADC2004 database, opera_male5.wav. Darker colors correspond to higher energy, proportional to the squared magnitude of the STFT (its power), in dB. The analysis window length is 46.44ms, and the overlap ratio is 87, 2004. ,
song opera_male5.wav, second round (for system SEP-I), p.125, 2004. ,
Accompaniment Separation: algorithm outline, p.162, 2009. ,
Accompaniment Separation System Flow, p.166, 2009. ,
Evolution of SIR gains and solo sections for 4 instruments: guitar, piano Spectrum of a harmonic sound and of an inharmonic sound, p.180 ,
121 5.2 Updating rules for the SIMM: Estimating ? 122 5.3 EM algorithm for the (S)GSMM: Estimating ?, equal to ? ,