plfit
plfit: Fitting power-law distributions to empirical data. This program fits power-law distributions to empirical (discrete or continuous) data, according to the method of Clauset, Shalizi and Newman. Power-law distributions occur in many situations of scientific interest and have significant consequences for our understanding of natural and man-made phenomena. Unfortunately, the detection and characterization of power laws is complicated by the large fluctuations that occur in the tail of the distributions – the part of the distributions representing large but rare events – and by the difficulty of identifying the range over which power-law behavior holds. Commonly used methods for analyzing power-law data, such as least-squares fitting, can produce substantially inaccurate estimates of parameters for power-law distributions, and even in cases where such methods return accurate answers they are still unsatisfactory because they give no indication of whether the data obey a power law at all. We present a principled statistical framework for discerning and quantifying power-law behavior in empirical data. Our approach combines maximum-likelihood fitting methods with goodness-of-fit tests based on the Kolmogorov - Smirnov (KS) statistic and likelihood ratios. We evaluate the effectiveness of the approach with tests on synthetic data and give critical comparisons to previous approaches. We also apply the proposed methods to twenty-four real-world data sets from a range of different disciplines, each of which has been conjectured to follow a power-law distribution. In some cases we find these conjectures to be consistent with the data, while in others the power law is ruled out.
Keywords for this software
References in zbMATH (referenced in 239 articles , 1 standard article )
Showing results 221 to 239 of 239.
Sorted by year (- Garcia Garcia, Juan Manuel: A fixed-point algorithm to estimate the Yule-Simon distribution parameter (2011)
- Li, Wentian: On parameters of the human genome (2011)
- Mora, Thierry; Bialek, William: Are biological systems poised at criticality? (2011)
- Ribeiro, Leonardo Andrade; Härder, Theo: Generalizing prefix filtering to improve set similarity joins (2011) ioport
- Selvam, A. M.: Signatures of universal characteristics of fractal fluctuations in global mean monthly temperature anomalies (2011)
- Turnu, I.; Concas, G.; Marchesi, M.; Pinna, S.; Tonelli, R.: A modified Yule process to model the evolution of some object-oriented system properties (2011) ioport
- Zweig, Katharina A.: Good versus optimal: why network analytic methods need more systematic evaluation (2011) ioport
- Concas, Giulio; Marchesi, Michele; Murgia, Alessandro; Tonelli, Roberto: An empirical study of social networks metrics in object-oriented software (2010) ioport
- James, Alex; Pitchford, Jonathan W.; Plank, Michael J.: Efficient or inaccurate? Analytical and numerical modelling of random search strategies (2010)
- Maillart, T.; Sornette, D.: Heavy-tailed distribution of cyber-risks (2010)
- Politi, M.; Scalas, E.; Fulger, D.; Germano, G.: Spectral densities of Wishart-Lévy free stable random matrices (2010)
- Sator, N.; Hietala, H.: Damage in impact fragmentation (2010)
- Zhang, Jiang; Guo, Liangpeng: Scaling behaviors of weighted food webs as energy transportation networks (2010)
- Brenes, David J.; Gayo-Avello, Daniel: Stratified analysis of AOL query log (2009) ioport
- Clauset, Aaron; Shalizi, Cosma Rohilla; Newman, M. E. J.: Power-law distributions in empirical data (2009)
- Hecker, Michael; Goertsches, Robert Hermann; Engelmann, Robby; Thiesen, Hans-Jürgen; Guthke, Reinhard: Integrative modeling of transcriptional regulation in response to antirheumatic therapy (2009) ioport
- Klimek, Peter; Thurner, Stefan; Hanel, Rudolf: Pruning the tree of life: (k)-core percolation as selection mechanism (2009)
- Ni, Xiao-Hui; Jiang, Zhi-Qiang; Zhou, Wei-Xing: Degree distributions of the visibility graphs mapped from fractional Brownian motions and multifractal random walks (2009)
- Gnutzmann, Hinnerk: Network formation under cumulative advantage: Evidence from the Cambridge high-tech cluster (2008)