X-Git-Url: https://git.auder.net/css/rpsls.css?a=blobdiff_plain;f=code%2FREADME;h=6a539afe37f422a70ebd0953ecc4675860a52219;hb=b81706231c887d593a0eed89940427b577c57ff0;hp=5546cf67e3cc56e9e59dd40d98c9e0a2c476febc;hpb=311c5c078a9fc8ec6bb449fa685dbb1657d75d39;p=epclust.git diff --git a/code/README b/code/README index 5546cf6..6a539af 100644 --- a/code/README +++ b/code/README @@ -1,14 +1,39 @@ -To compile source code : +0) Download & compile Benjamin's specific library - (mkdir -p (src|test)/build - cd (src|test)/build - cmake .. + git clone git@auder.net:cgds + cd cgds + bash makeMakefile.sh src + make src + sudo make install + +Make sure that the install destination is on the LD_LIBRARY_PATH environment variable. + +1) Compile source code for 1st stage clustering + + mkdir -p build/stage1/src + cd build/stage1/src + cmake ../../../stage1/src make -Usage : +2) #repeat previous lines for stage 2 ??? + + +NOTA: Need to have openmpi, mpich (compiler for mpi) libxml, and libgsl installed. + + +Usage (stage 1) : + +Serialize input data using + + ppam.exe serialize inputfile_edf outputfile_edf 1 0 + +# 1 indicates data is by column +# 0 means process all the rows mpirun -np nbProcess ppam.exe cluster ifilename nbSeriesInChunk nbClusters randomize p_for_dissims +## ex. > mpirun -np 4 ./ppam.exe cluster ~/tmp/2009.bin 5000 200 1 2 + Where : nbProcess = number of simultaneous processes ifilename = path to serialized dataset (read below) @@ -17,6 +42,12 @@ Where : randomize = 1 to dispatch time-series at random. 0 to process them in order p_for_dissims = the 'p' of L_p distance used to compute dissimilarities + +The results are stored in ppamResult.xml (curves ids and ranks) while ppamFinalSeries.bin +are the curves used in the last clustering step. The ranks in ppamResult.xml refer to the +curves in ppamFinalSeries.bin + + Note : custom [de]serialization. Consider writing your own in src/TimeSeries/ folder if you plan to test the package.