TnT -- Statistical Part-of-Speech Tagging. TnT, the short form of Trigrams’n’Tags, is a very efficient statistical part-of-speech tagger that is trainable on different languages and virtually any tagset. The component for parameter generation trains on tagged corpora. The system incorporates several methods of smoothing and of handling unknown words. TnT is not optimized for a particular language. Instead, it is optimized for training on a large variety of corpora. Adapting the tagger to a new language, new domain, or new tagset is very easy. Additionally, TnT is optimized for speed. The tagger is an implementation of the Viterbi algorithm for second order Markov models. The main paradigm used for smoothing is linear interpolation, the respective weights are determined by deleted interpolation. Unknown words are handled by a suffix trie and successive abstraction.

References in zbMATH (referenced in 20 articles )

Showing results 1 to 20 of 20.
Sorted by year (citations)

  1. Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Vilares Ferro, Jesús: Absolute convergence and error thresholds in non-active adaptive sampling (2022)
  2. Dong, Jichang; Dai, Wei; Li, Jingjing: Exploring the linear and nonlinear causality between Internet big data and stock markets (2020)
  3. Forsati, Rana; Shamsfard, Mehrnoush: Hybrid PoS-tagging: a cooperation of evolutionary and statistical approaches (2014)
  4. Silva, Ana Paula; Silva, Arlindo; Rodrigues, Irene: An approach to the POS tagging problem using genetic algorithms (2014) ioport
  5. Kornai, András: Probabilistic grammars and languages (2011)
  6. Ponzetto, Simone Paolo; Strube, Michael: Taxonomy induction based on a collaboratively built knowledge repository (2011) ioport
  7. Rupnik, Jan; Grčar, Miha; Erjavec, Tomaž: Improving morphosyntactic tagging of Slovene language through meta-tagging (2010)
  8. Agić, Željko; Dovedan, Zdravko; Tadić, Marko: Improving part-of-speech tagging accuracy for Croatian by morphological analysis (2009)
  9. Biemann, Chris: Unsupervised part-of-speech tagging in the large (2009) ioport
  10. Carl, Michael; Melero, Maite; Badia, Toni; Vandeghinste, Vincent; Dirix, Peter; Schuurman, Ineke; Markantonatou, Stella; Sofianopoulos, Sokratis; Vassiliou, Marina; Yannoutsou, Olga: METIS-II: Low resource machine translation (2008) ioport
  11. Ramakrishnan, Ganesh; Joshi, Sachindra; Balakrishnan, Sreeram; Srinivasan, Ashwin: Using ILP to construct features for information extraction from semi-structured text (2008)
  12. Saquete, E.; Ferrández, O.; Ferrández, S.; Martínez-Barco, P.; Muñoz, R.: Combining automatic acquisition of knowledge with machine learning approaches for multilingual temporal recognition and normalization (2008) ioport
  13. Filippova, Katja; Strube, Michael: The German Vorfeld and local coherence (2007)
  14. Alba, Enrique; Luque, Gabriel; Araujo, Lourdes: Natural language tagging with genetic algorithms (2006)
  15. Crego, Josep Maria; Mariño, José B.: Improving statistical MT by coupling reordering and decoding (2006) ioport
  16. Shen, Hong; Sarkar, Anoop: Voting between multiple data representations for text chunking (2005)
  17. Cohen, K. Bretonnel; Hunter, Lawrence: Natural language processing and systems biology (2004)
  18. Tufiş, Dan; Barbu, Ana Maria: Revealing translators’ knowledge: Statistical methods in constructing practical translation lexicons for language and speech processing (2002)
  19. Brants, Thorsten: Tnt - A statistical part-of-speech tagger (2000) ioport
  20. Brants, Thorsten: Estimating hidden Markov model topologies (1998)