80391519b7d8c2c8d8f3d6bf59e5141a8e82697e
[ppam-mpi.git] / README.md
1 # Parallel clustering with a k-medoids algorithm
2
3 Joint work with [Jairo Cugliari](http://eric.univ-lyon2.fr/~jcugliari/)
4
5 ---
6
7 This C program runs the k-medoid algorithm on several subsets of one (presumably big) dataset.
8 The computed medoids are then merged iteratively, until we get a final set of k centers.
9
10 ---
11
12 The folder "communication/" contains latex sources (and generated pdf files) of a short paper submitted
13 to the Journées de Statistique in Rennes, France (2014), and also the slides presented at this event.
14
15 The other folder contains all the C code; but not the EDF (french electricity company) datasets, because they
16 are not public. Since the (de)serialization process in code/src/TimeSeries/ is tailored for these data,
17 it is necessary to adapt this small part of the code to use any other custom time-series files.