S. Akaho, A kernel method for canonical correlation analysis, Proceedings of International Meeting on Psychometric Society (IMPS2001), 2001.

S. Amari and H. Nagaoka, Methods of Information Geometry, 2001.

N. Aronszajn, Theory of reproducing kernels. Transactions of the, pp.337-404, 1950.

F. Bach and M. Jordan, Kernel independent component analysis, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., pp.1-48, 2002.
DOI : 10.1109/ICASSP.2003.1202783

F. R. Bach and M. I. Jordan, Predictive low-rank decomposition for kernel methods, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2005.
DOI : 10.1145/1102351.1102356

F. R. Bach, G. R. Lanckriet, J. , and M. I. , Multiple kernel learning, conic duality, and the SMO algorithm, Twenty-first international conference on Machine learning , ICML '04, 2004.
DOI : 10.1145/1015330.1015424

G. Bejerano and G. Yona, Modeling protein families using probabilistic suffix trees, Proceedings of the third annual international conference on Computational molecular biology , RECOMB '99, pp.15-24, 1999.
DOI : 10.1145/299432.299445

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.21.2389

A. Ben-hur and D. L. Brutlag, Remote homology detection: a motif based approach, ISMB (Supplement of Bioinformatics), pp.26-33, 2003.
DOI : 10.1093/bioinformatics/btg1002

C. Berg, J. P. Christensen, and P. Ressel, Harmonic Analysis on Semigroups. Number 100 in Graduate Texts in Mathematics, 1984.

A. Berlinet and C. Thomas-agnan, Reproducing Kernel Hilbert Spaces in Probability and Statistics, 2003.
DOI : 10.1007/978-1-4419-9096-9

D. S. Bernstein, Matrix Mathematics: Theory, Facts, and Formulas with Application to Linear Systems Theory, 2005.
DOI : 10.1515/9781400833344

B. E. Boser, I. M. Guyon, and V. N. Vapnik, A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory , COLT '92, pp.144-152, 1992.
DOI : 10.1145/130385.130401

S. Boyd and L. Vandenberghe, Convex Optimization, 2004.

M. P. Brown, R. Hughey, A. Krogh, I. S. Mian, K. Sjölander et al., Using Dirichlet mixture priors to derive hidden Markov models for protein families, Proc. of First Int. Conf. on Intelligent Systems for Molecular Biology, pp.47-55, 1993.

M. P. Brown, W. N. Grundy, D. Lin, N. Cristianini, C. W. Sugnet et al., Knowledge-based analysis of microarray gene expression data by using support vector machines, Proc. Natl. Acad. Sci. USA, pp.262-267, 2000.
DOI : 10.1093/nar/17.20.8367

O. Catoni, Statistical learning theory and stochastic optimization, Ecole d'´ eté de probabilités de Saint-Flour XXXI -2001, Number 1851 in Lecture Notes in Mathematics, 2004.

O. Chapelle, P. Haffner, and V. Vapnik, Support vector machines for histogram-based image classification, IEEE Transactions on Neural Networks, vol.10, issue.5, p.1055, 1999.
DOI : 10.1109/72.788646

O. Chapelle, V. Vapnik, O. Bousquet, and S. Mukherjee, Choosing multiple parameters for support vector machines, Machine Learning, p.131, 2002.

C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, vol.1, issue.3, p.273, 1995.
DOI : 10.1007/BF00994018

T. M. Cover and J. A. Thomas, Elements of Information Theory, 1991.

L. Csató and M. Opper, Sparse On-Line Gaussian Processes, Neural Computation, vol.14, issue.3, pp.641-668, 2002.
DOI : 10.1109/34.735807

F. Cucker and S. Smale, On the mathematical foundations of learning, Bulletin of the American Mathematical Society, vol.39, issue.01, p.39, 2002.
DOI : 10.1090/S0273-0979-01-00923-5

M. Cuturi and K. Fukumizu, Multiresolution kernels, arxiv cs, 2005.

M. Cuturi, K. Fukumizu, and J. Vert, Semigroup kernels on measures, Journal of Machine Learning Research, vol.6, pp.1169-1198, 2005.

M. Cuturi and J. Vert, The context-tree kernel for strings, Neural Networks, vol.18, issue.8, p.18, 2005.
DOI : 10.1016/j.neunet.2005.07.010

URL : https://hal.archives-ouvertes.fr/hal-00433583

M. Cuturi and J. Vert, Semigroup kernels on finite sets, Advances in Neural Information Processing Systems 17, pp.329-336, 2005.

C. Davis, All convex invariant functions of hermitian matrices, Archiv der Mathematik, vol.22, issue.4, pp.276-278, 1957.
DOI : 10.1007/BF01898787

J. Dieudonné, Calcul Infinitésimal, 1968.

R. Durbin, S. Eddy, A. Krogh, and G. Mitchison, Biological sequence analysis -Probabilistic models of proteins and nucleic acids, 1998.

J. Eichhorn and O. Chapelle, Object categorization with svm: Kernels for local features, 2004.

D. M. Endres and J. E. Schindelin, A new metric for probability distributions, IEEE Transactions on Information Theory, vol.49, issue.7, pp.1858-1860, 2003.
DOI : 10.1109/TIT.2003.813506

E. Eskin, W. Noble, and Y. Singer, Protein Family Classification Using Sparse Markov Transducers, Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, 2000.
DOI : 10.1089/106652703321825964

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.6705

B. Fuglede and F. Topsøe, Jensen-Shannon divergence and Hilbert space embedding, International Symposium onInformation Theory, 2004. ISIT 2004. Proceedings., p.31, 2004.
DOI : 10.1109/ISIT.2004.1365067

K. Fukumizu, F. Bach, and A. Gretton, Consistency of kernel canonical correlation analysis, 2005.

K. Fukumizu, F. Bach, J. , and M. , Dimensionality reduction for supervised learning with reproducing kernel hilbert spaces, Journal of Machine Learning Research, vol.5, pp.73-99, 2004.

F. Girosi, M. Jones, and T. Poggio, Regularization Theory and Neural Networks Architectures, Neural Computation, vol.26, issue.3, pp.219-269, 1995.
DOI : 10.1016/0893-6080(90)90004-5

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.48.9258

M. Gribskov and N. L. Robinson, Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching, Computers & Chemistry, vol.20, issue.1, pp.25-33, 1996.
DOI : 10.1016/S0097-8485(96)80004-0

B. Haasdonk, Feature space interpretation of SVMs with indefinite kernels, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.27, issue.4, pp.482-492, 2005.
DOI : 10.1109/TPAMI.2005.78

B. Haasdonk and D. Keysers, Tangent distance kernels for support vector machines, Object recognition supported by user interaction for service robots, pp.864-868, 2002.
DOI : 10.1109/ICPR.2002.1048439

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.2952

T. Hastie, R. Tibshirani, and J. Friedman, Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2001.

D. Haussler, Convolution kernels on discrete structures, 1999.

M. Hein and O. Bousquet, Hilbertian metrics and positive definite kernels on probability measures, Proceedings of AISTATS 2005, 2005.

S. Hua and Z. Sun, Support vector machine approach for protein subcellular localization prediction, Bioinformatics, vol.17, issue.8, pp.721-728, 2001.
DOI : 10.1093/bioinformatics/17.8.721

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.138.889

T. Hubbard, A. Murzin, S. Brenner, C. , and C. , Scop: a structural classification of proteins database, Nucleic Acids Research, pp.236-239, 1997.

T. Jaakkola, M. Diekhans, and D. Haussler, A Discriminative Framework for Detecting Remote Protein Homologies, Journal of Computational Biology, vol.7, issue.1-2, pp.95-114, 2000.
DOI : 10.1089/10665270050081405

T. S. Jaakkola and D. Haussler, Exploiting Generative Models in Discriminative Classifiers, Advances in Neural Information Processing Systems 11, 1999.

T. Jebara, R. Kondor, and A. Howard, Probability product kernels, Journal of Machine Learning Research, vol.5, pp.819-844, 2004.

T. Joachims, Text categorization with Support Vector Machines: Learning with many relevant features, 1997.
DOI : 10.1007/BFb0026683

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.11.6124

T. Joachims, Learning to Classify Text Using Support Vector Machines: Methods, Theory, and Algorithms, 2002.
DOI : 10.1007/978-1-4615-0907-3

H. Kashima, K. Tsuda, and A. Inokuchi, Marginalized kernels between labeled graphs, Proceedings of the Twentieth International Conference on Machine Learning, pp.321-328, 2003.

G. S. Kimeldorf and G. Wahba, Some results on Tchebycheffian spline functions, Journal of Mathematical Analysis and Applications, vol.33, issue.1, pp.82-95, 1971.
DOI : 10.1016/0022-247X(71)90184-3

URL : http://doi.org/10.1016/0022-247x(71)90184-3

M. Koecher, Positivitatsbereiche Im R n, American Journal of Mathematics, vol.79, issue.3, pp.575-596, 1957.
DOI : 10.2307/2372563

R. Kondor and T. Jebara, A kernel between sets of vectors, Proceedings of the Twentieth International Conference on Machine Learning, pp.361-368, 2003.

R. Kondor and J. Lafferty, Diffusion kernels on graphs and other discrete input spaces, Proceedings of the Nineteenth International Conference on Machine Learning, pp.315-322, 2002.

J. Lafferty and G. Lebanon, Information diffusion kernels, Advances in Neural Information Processing Systems 14, 2002.

J. Lafferty and G. Lebanon, Diffusion kernels on statistical manifolds, Journal of Machine Learning Research, vol.6, pp.129-163, 2005.

G. R. Lanckriet, T. D. Bie, N. Cristianini, M. I. Jordan, N. et al., A statistical framework for genomic data fusion, Bioinformatics, vol.20, issue.16, pp.202626-2635, 2004.
DOI : 10.1093/bioinformatics/bth294

C. Leslie, E. Eskin, N. , and W. S. , The spectrum kernel: a string kernel for svm protein classific ation, Proceedings of the Pacific Symposium on Biocomputing 2002, pp.564-575, 2002.

C. Leslie, E. Eskin, J. Weston, N. , and W. S. , Mismatch string kernels for svm protein classification, Advances in Neural Information Processing Systems 15, 2003.

A. S. Lewis, The mathematics of eigenvalue optimization, Mathematical Programming, pp.155-176, 2003.
DOI : 10.1007/s10107-003-0441-3

M. Li, X. Chen, X. Li, B. Ma, and V. Vitanyi, The Similarity Metric, IEEE Transactions on Information Theory, vol.50, issue.12, pp.503250-3264, 2004.
DOI : 10.1109/TIT.2004.838101

L. Liao and W. S. Noble, Combining pairwise sequence similarity and support vector machines for remote protein homology detection, Proceedings of the sixth annual international conference on Computational biology , RECOMB '02, pp.225-232, 2002.
DOI : 10.1145/565196.565225

H. Lodhi, C. Saunders, J. Shawe-taylor, N. Cristianini, and C. Watkins, Text classification using string kernels, Journal of Machine Learning Research, vol.2, pp.419-444, 2002.

P. Mahé, N. Ueda, T. Akutsu, J. Perret, and J. Vert, Extensions of marginalized graph kernels, Twenty-first international conference on Machine learning , ICML '04, pp.552-559, 2004.
DOI : 10.1145/1015330.1015446

G. Matheron, Les variables régionalisées et leur estimation, 1965.

T. Melzer, M. Reiter, and H. Bischof, Nonlinear Feature Extraction Using Generalized Canonical Correlation Analysis, Proceedings of International Conference on Artificial Neural Networks (ICANN), pp.353-360, 2001.
DOI : 10.1007/3-540-44668-0_50

T. Mercer, Functions of Positive and Negative Type, and Their Connection with the Theory of Integral Equations, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol.83, issue.559, pp.415-446, 1909.
DOI : 10.1098/rspa.1909.0075

K. S. Miller, Some Eclectic Matrix Theory, 1987.

P. J. Moreno, P. P. Ho, and N. Vasconcelos, A kullback-leibler divergence based kernel for svm classification in multimedia applications, Advances in Neural Information Processing Systems 16, 2004.

I. Nemenman, F. Shafee, and W. Bialek, Entropy and inference, revisited, Advances in Neural Information Processing Systems 14, 2002.

W. S. Noble and L. Liao, Combining pairwise sequence similarity and support vector machines for remote protein homology detection, Proceedings of the Sixth Annual International Conference on Research in Computational Molecular Biology, pp.225-232, 2002.

C. S. Ong, A. J. Smola, and R. C. Williamson, Learning the kernel with hyperkernels, Journal of Machine Learning Research, vol.6, pp.1043-1071, 2005.

F. Osterreicher and I. Vajda, A new class of metric divergences on probability spaces and its applicability in statistics, Annals of the Institute of Statistical Mathematics, vol.81, issue.3, pp.639-653, 2003.
DOI : 10.1007/BF02517812

E. Parzen, Extraction and Detection Problems and Reproducing Kernel Hilbert Spaces, Journal of the Society for Industrial and Applied Mathematics Series A Control, vol.1, issue.1, pp.35-62, 1962.
DOI : 10.1137/0301004

J. Platt, Fast training of support vector machines using sequential minimal optimization, Advances in Kernel Methods?Support Vector Learning, 1999.

M. Pontil and A. Verri, Support vector machines for 3D object recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.6, pp.637-646, 1998.
DOI : 10.1109/34.683777

L. Ralaivola, J. S. Swamidass, H. Saigo, and P. Baldi, Graph kernels for chemical informatics, Neural Networks, vol.18, issue.8, p.18, 2005.
DOI : 10.1016/j.neunet.2005.07.009

C. Rao, S. Amari, O. Barndorff-nielsen, R. Kass, S. Lauritzen et al., Differential metrics in probability spaces, Differential Geometry in Statistical Inference, 1987.

G. Rätsch and S. Sonnenburg, Accurate splice site prediction for caenorhabditis elegans, Kernel Methods in Computational Biology, 2004.

W. Rudin, Fourier Analysis on Groups, 1962.

G. Salton, Automatic Text Processing, 1989.

I. Schoenberg, Positive definite functions on spheres, Duke Mathematical Journal, vol.9, issue.1, pp.96-108, 1942.
DOI : 10.1215/S0012-7094-42-00908-6

URL : http://projecteuclid.org/download/pdf_1/euclid.dmj/1077493072

B. Schölkopf, A. Smola, and K. Müller, Nonlinear Component Analysis as a Kernel Eigenvalue Problem, Neural Computation, vol.20, issue.5, pp.1299-1319, 1998.
DOI : 10.1007/BF02281970

B. Schölkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization , Optimization, and Beyond, 2002.

B. Schölkopf, K. Tsuda, and J. Vert, Kernel Methods in Computational Biology, 2004.

B. Schölkopf, J. Weston, E. Eskin, C. Leslie, N. et al., A Kernel Approach for Learning from almost Orthogonal Patterns, Proceedings of ECML 2002 13th European Conference on Machine Learning, pp.511-528, 2002.
DOI : 10.1007/3-540-36755-1_44

M. Seeger, Covariance kernels from bayesian generative models, Advances in Neural Information Processing Systems 14, pp.905-912, 2002.

M. Seeger, GAUSSIAN PROCESSES FOR MACHINE LEARNING, International Journal of Neural Systems, vol.14, issue.02, pp.69-106, 2004.
DOI : 10.1142/S0129065704001899

H. Shimodaira, K. Noma, M. Nakai, and S. Sagayama, Dynamic timealignment kernel in support vector machine, Advances in Neural Information Processing Systems 14, 2002.

V. Sindhwani, P. Niyogi, and M. Belkin, Beyond the point cloud, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2005.
DOI : 10.1145/1102351.1102455

N. Smith and M. Gales, Speech recognition using svms, Advances in Neural Information Processing Systems 14, 2002.

A. N. Tikhonov and V. Y. Arsenin, Solution of Ill-Posed Problems, 1977.

K. Tsuda, S. Akaho, and K. Asai, The em algorithm for kernel matrix completion with auxiliary data, Journal of Machine Learning Research, vol.4, pp.67-81, 2003.

K. Tsuda, S. Akaho, M. Kawanabe, and K. Müller, Asymptotic Properties of the Fisher Kernel, Neural Computation, vol.27, issue.1, pp.115-137, 2004.
DOI : 10.1093/bioinformatics/16.9.799

K. Tsuda, M. Kawanabe, G. Rätsch, S. Sonnenburg, and K. Müller, A New Discriminative Kernel from Probabilistic Models, Neural Computation, vol.14, issue.10, pp.142397-2414, 2002.
DOI : 10.1023/A:1007618119488

K. Tsuda, T. Kin, and K. Asai, Marginalized kernels for biological sequences, Bioinformatics, vol.18, issue.Suppl 1, pp.268-275, 2002.
DOI : 10.1093/bioinformatics/18.suppl_1.S268

K. Tsuda and W. Noble, Learning kernels from biological networks by maximizing entropy, Bioinformatics, vol.20, issue.Suppl 1, pp.326-333, 2004.
DOI : 10.1093/bioinformatics/bth906

K. Tsuda, G. Rätsch, and M. K. Warmuth, Matrix exponentiated gradient updates for on-line learning and bregman projection, Journal of Machine Learning Research, vol.6, pp.995-1018, 2005.

V. N. Vapnik, Statistical Learning Theory, 1998.

J. Vert, A tree kernel to analyse phylogenetic profiles, Bioinformatics, vol.18, issue.Suppl 1, pp.276-284, 2002.
DOI : 10.1093/bioinformatics/18.suppl_1.S276

URL : https://hal.archives-ouvertes.fr/hal-00433591

J. Vert and M. Kanehisa, Graph-driven features extraction from microarray data using diffusion kernels and kernel cca, Advances in Neural Information Processing Systems 15, 2003.

J. Vert, H. Saigo, and T. Akutsu, Local alignment kernels for protein sequences, Kernel Methods in Computational Biology, 2004.

J. Vert and Y. Yamanishi, Supervised graph inference, Advances in Neural Information Processing Systems 17, 2005.

R. Vert and J. Vert, Consistency and convergence rates of one-class svm and related algorithms, 2005.

G. Wahba, Splines Models for Observational Data, Series in Applied Mathematics, vol.59, 1990.
DOI : 10.1137/1.9781611970128

C. Watkins, Dynamic alignment kernels, Advances in Large Margin Classifiers, pp.39-50, 2000.

F. M. Willems, Y. M. Shtarkov, and T. J. Tjalkens, The context-tree weighting method: basic properties, IEEE Transactions on Information Theory, vol.41, issue.3, pp.653-664, 1995.
DOI : 10.1109/18.382012

L. Wolf and A. Shashua, Learning over sets using kernel principal angles, Journal of Machine Learning Research, vol.4, pp.913-931, 2003.

D. Zhang, X. Chen, L. , and W. S. , Text classification with kernels on the multinomial manifold, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '05, pp.266-273, 2005.
DOI : 10.1145/1076034.1076081

J. Zhu and T. Hastie, Kernel Logistic Regression and the Import Vector Machine, Advances in Neural Information Processing Systems 14, pp.1081-1088, 2002.
DOI : 10.1198/106186005X25619