The Ngram Statistics Package (NSP) is a flexible and easy-to-use software tool that supports the identification and analysis of Ngrams, sequences of N tokens in online text. We have designed and implemented NSP to be easy to customize to particular problems and yet remain general enough to serve a broad range of needs. This paper provides an introduction to NSP while raising some general issues in Ngram analysis, and summarizes several applications where NSP has been successfully employed. NSP is written in Perl and is freely available under the GNU Public License.
Keywords for this software
References in zbMATH (referenced in 3 articles , 1 standard article )
Showing results 1 to 3 of 3.
- Giles, Kendall E.; Trosset, Michael W.; Marchette, David J.; Priebe, Carey E.: Iterative denoising (2008)
- Vechtomova, Olga: Noun phrases in interactive query expansion and document ranking (2006) ioport
- Banerjee, Satanjeev; Pedersen, Ted: The design, implementation, and use of the gram statistics package (2003)