KmL: k-means for longitudinal data. KmL, a new package implementing k-means is presented. The advantage of KmL over existing procedures is that it is designed to work specifically on longitudinal data. This package is able to deal with missing values. It also provides an easy way to run the algorithm several times, varying the starting conditions and the number of clusters looked for. Its graphical interface helps the user to choose the appropriate number of clusters when the classic criterion is not efficient. Simulations on both artificial and real data are presented. Performance of $k$-means on longitudinal data is compared to Proc Traj results. The simulations have shown that KmL (like Proc Traj) gives acceptable results for all polynomial examples, even with high levels of noise. KmL gives much better results on non-polynomial trajectories. It is also remarked that KmL is not model-based, which can be an advantage (nonparametric, more flexible) but also a disadvantage (no scope for testing goodness of fit).
Keywords for this software
References in zbMATH (referenced in 4 articles , 1 standard article )
Showing results 1 to 4 of 4.
- Proïa, Frédéric; Pernet, Alix; Thouroude, Tatiana; Michel, Gilles; Clotault, Jérémy: On the characterization of flowering curves using Gaussian mixture models (2016)
- Goffard, Pierre-Olivier; Guerrault, Xavier: Is it optimal to group policyholders by age, gender, and seniority for BEL computations based on model points? (2015)
- Schels, Martin; Scherer, Stefan; Glodek, Michael; Kestler, Hans A.; Palm, Günther; Schwenker, Friedhelm: On the discovery of events in EEG data utilizing information fusion (2013)
- Genolini, Christophe; Falissard, Bruno: KmL: k-means for longitudinal data (2010)