X-Git-Url: https://git.auder.net/?a=blobdiff_plain;f=TODO;h=3c1fd78ec8aadcc1ff7a9df076932c2db515f99b;hb=dc1aa85a96bbf815b0d896c22a9b4a539a9e8a9c;hp=cd454f21189bebff085bdbcad0d368c9c560be4d;hpb=65bd7506a22f58e425d5de32cd58b70efad2b2ab;p=epclust.git diff --git a/TODO b/TODO index cd454f2..3c1fd78 100644 --- a/TODO +++ b/TODO @@ -41,3 +41,28 @@ distor = getDistor("../old_C_code/build", "ppamResult.xml", "2009.bin") ?? Piste à explorer pour les comparaisons: H20 + +renvoyer nombre d'individues par classe ? (+ somme ?) +hypothèse : données déjà ordonnées 48 1/2H sur 365j +utiliser du mixmod avec modèles allongés +doit toutner sur machine plutôt standard, utilisateur "lambda" +utiliser Rcpp ? + +===== + +trategies for upscaling +From 25K to 25M : in 1000 chunks of 25K +Reference values : + K 0 = 200 super consumers (SC) + K ∗ = 15 nal clusters +1st strategy + Do 1000 times ONLY Energycon's 1st-step strategy on 25K clients + With the 1000 × K 0 SC perform a 2-step run leading to K ∗ clusters + +--> il faut s'arranger pour que + +2nd strategy + Do 1000 times Energycon's 2-step strategy on 25K clients leading to + 1000 × K ∗ intermediate clusters + Treat the intermediate clusters as individual curves and perform a + single 2-step run to get K ∗ nal clusters