Distributed clustering algorithms over a cloud computing platform

Abstract : He subjects addressed in this thesis are inspired from research problems faced by the Lokad company. These problems are related to the challenge of designing efficient parallelization techniques of clustering algorithms on a Cloud Computing platform. Chapter 2 provides an introduction to the Cloud Computing technologies, especially the ones devoted to intensivecomputations. Chapter 3 details more specifically Microsoft Cloud Computing offer : Windows Azure. The following chapter details technical aspects of cloud application development and provides some cloud design patterns. Chapter 5 is dedicated to the parallelization of a well-known clustering algorithm: the Batch K-Means. It provides insights on the challenges of a cloud implementation of distributed Batch K-Means, especially the impact of communication costs on the implementation efficiency. Chapters 6 and 7 are devoted to the parallelization of another clustering algorithm, the Vector Quantization (VQ). Chapter 6 provides an analysis of different parallelization schemes of VQ and presents the various speedups to convergence provided by them. Chapter 7 provides a cloud implementation of these schemes. It highlights that it is the online nature of the VQ technique that enables an asynchronous cloud implementation, which drastically reducesthe communication costs introduced in Chapter 5.
Complete list of metadatas

https://pastel.archives-ouvertes.fr/tel-00744768
Contributor : Abes Star <>
Submitted on : Wednesday, June 4, 2014 - 5:53:19 PM
Last modification on : Wednesday, February 20, 2019 - 2:40:59 PM
Long-term archiving on : Thursday, September 4, 2014 - 1:10:48 PM

File

These_Durut_-_V1.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00744768, version 2

Collections

Citation

Matthieu Durut. Distributed clustering algorithms over a cloud computing platform. Other [cs.OH]. Télécom ParisTech, 2012. English. ⟨NNT : 2012ENST0055⟩. ⟨tel-00744768v2⟩

Share

Metrics

Record views

1213

Files downloads

1857