Commit | Line | Data |
---|---|---|
70b2e106 BA |
1 | <h1>Parallel clustering with k-medoids algorithm</h1> |
2 | ||
f87c4278 BA |
3 | <p>Joint work with <a href="http://eric.univ-lyon2.fr/~jcugliari/">Jairo Cugliari</a>.</p> |
4 | ||
5 | <hr/> | |
6 | ||
70b2e106 BA |
7 | <p>This C program runs the k-medoid algorithm on several subsets of one (presumably big) dataset. |
8 | The computed medoids are then merged iteratively, until we get a final set of k centers.</p> | |
9 | ||
10 | <hr/> | |
11 | ||
12 | <p>The folder "communication/" contains latex sources (and generated pdf files) of a short paper submitted | |
13 | to the Journées de Statistique in Rennes, France (2014), and also the slides presented at this event.</p> | |
14 | ||
15 | <p>The other folder contains all the C code; but not the EDF (french electricity company) datasets, because they | |
16 | are not public. Since the (de)serialization process in code/src/TimeSeries/ is tailored for these data, | |
17 | it is necessary to adapt this small part of the code to use any other custom time-series files.</p> |