YAP3: improved detection of similarities in computer program and other texts. In spite of years of effort, plagiarism in student assignment submissions still causes considerable difficulties for course designers; if students’ work is not their own, how can anyone be certain they have learnt anything? YAP is a system for detecting suspected plagiarism in computer programs and other texts submitted by students. The paper reviews YAP3, the third version of YAP, focusing on its novel underlying algorithm - Running-Karp-Rabin Greedy-String-Tiling (or RKS-GST), whose development arose from the observation with YAP and other systems that students shuffle independent code segments. YAP3 is able to detect transposed subsequences, and is less perturbed by spurious additional statements. The paper concludes with a discussion of recent extension of YAP to English texts, further illustrating the flexibility of the YAP approach.

References in zbMATH (referenced in 7 articles )

Showing results 1 to 7 of 7.
Sorted by year (citations)

  1. Pilehvar, Mohammad Taher; Navigli, Roberto: From senses to texts: an all-in-one graph-based approach for measuring semantic similarity (2015)
  2. Qu, Wei; Jia, Yuanyuan; Jiang, Michael: Pattern mining of cloned codes in software systems (2014) ioport
  3. Arbuckle, Tom: Studying software evolution using artefacts’ shared information content (2011) ioport
  4. Bronstein, Alexander M.; Bronstein, Michael M.; Bruckstein, Alfred M.; Kimmel, Ron: Partial similarity of objects, or how to compare a centaur to a horse (2009) ioport
  5. Lukácsy, Gergely; Szeredi, Péter: Plagiarism detection in source programs using structural similarities (2009)
  6. Degrave, François; Vanhoof, Wim: Towards a normal form for Mercury programs (2008)
  7. Chen, Xin; Francia, Brent; Li, Ming; McKinnon, Brian; Seker, Amit: Shared information and program plagiarism detection (2004)