• SRILM

  • Referenced in 30 articles [sw09928]
  • language model types based on N-gram statistics, as well as several related tasks, such...
  • Rouge

  • Referenced in 19 articles [sw15452]
  • number of overlapping units such as n-gram, word sequences, and word pairs between...
  • Swoogle

  • Referenced in 14 articles [sw25441]
  • system which can use either character N-Gram or URIrefs as keywords to find relevant...
  • Julius

  • Referenced in 5 articles [sw21656]
  • researchers and developers. Based on word N-gram and context-dependent HMM, it can perform ... word-pair context approximation, rank/score pruning, N-gram factoring, cross-word context dependency handling, enveloped...
  • textcat

  • Referenced in 2 articles [sw07329]
  • textcat Package for n-Gram Based Text Categorization in R. Identifying the language used will ... text categorization based on character n-gram frequencies have been particularly successful. This paper presents ... extension package textcat for n-gram based text categorization which implements both the Cavnar ... approach as well as a reduced n-gram approach designed to remove redundancies...
  • Bugram

  • Referenced in 1 article [sw40141]
  • Bugram: bug detection with n-gram language models. To improve software reliability, many rule-based ... approach—Bugram—that leverages n-gram language models instead of rules to detect bugs. Bugram ... models program tokens sequentially, using the n-gram language model. Token sequences from the program...
  • ProphetNet

  • Referenced in 1 article [sw35776]
  • ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training. This paper presents ... novel self-supervised objective named future n-gram prediction and the proposed n-stream self ... optimized by n-step ahead prediction that predicts the next n tokens simultaneously based ... each time step. The future n-gram prediction explicitly encourages the model to plan...
  • tokenizers

  • Referenced in 1 article [sw16425]
  • stringi’ package. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences...
  • SummaryEvaluation

  • Referenced in 2 articles [sw33890]
  • SummaryEvaluation using n-gram graphs. This project utilizes the JInsect toolkit to implement three automatic...
  • corpus

  • Referenced in 2 articles [sw30858]
  • computing term occurrence frequencies, including n-grams...
  • DKPro Similarity

  • Referenced in 2 articles [sw16551]
  • ranging from ones based on simple n-grams and common subsequences to high-dimensional vector...
  • PPDB

  • Referenced in 2 articles [sw33749]
  • scores computed from the Google n-grams and the Annotated Gigaword corpus. Our re- lease...
  • Sally

  • Referenced in 2 articles [sw08470]
  • string features, such as words or n-grams of words. The implementation of Sally builds...
  • Rishi

  • Referenced in 2 articles [sw21756]
  • uncommon server ports. By using n-gram analysis and a scoring system, we are able...
  • chrF

  • Referenced in 1 article [sw36203]
  • chrF: character n-gram F-score for automatic MT evaluation. We propose ... character n-gram F-score for automatic evaluation of ma- chine translation output. Character...
  • Charagram

  • Referenced in 1 article [sw26535]
  • Embedding Words and Sentences via Character n-grams. We present Charagram embeddings, a simple approach ... sentence is represented using a character n-gram count vector, followed by a single nonlinear...
  • textreg

  • Referenced in 1 article [sw35678]
  • package textreg: n-Gram Text Regression, aka Concise Comparative Summarization. Function for sparse regression...
  • libTextCat

  • Referenced in 1 article [sw24541]
  • classification technique described in Cavnar & Trenkle, ”N-Gram-Based Text Categorization” [1]. It was primarily ... list of the most frequent n-grams occurring in a document, ordered by frequency. Fingerprints...
  • Pattern

  • Referenced in 1 article [sw08487]
  • parser), natural language processing (tagger/chunker, n-gram search, sentiment analysis, WordNet), machine learning (vector space...
  • DAFSA

  • Referenced in 1 article [sw32807]
  • sequences (typically character strings or n-grams) in the form of a directed acyclic graph...