ProTDB

ProTDB: probabilistic data in XML. Where as traditional databases manage only deterministic information, many applications that use databases involve uncertain data. This paper presents a Probabilistic Tree Data Base (ProTDB) to manage probabilistic data, represented in XML. Our approach differs from previous efforts to develop probabilistic relational systems in that we build a probabilistic XML database. This design is driven by application needs that involve data not readily amenable to a relational representation. XML data poses several modeling challenges: due to its structure, due to the possibility of uncertainty association at multiple granularities, and due to the possibility of missing and repeated sub-elements. We present a probabilistic XML model that addresses all of these challenges. We devise an implementation of XML query operations using our probability model, and demonstrate the efficiency of our implementation experimentally. We have used ProTDB to manage data from two application areas: protein chemistry data from the bioinformatics domain, and information extraction data obtained from the web using a natural language analysis system. We present a brief case study of the latter to demonstrate the value of probabilistic XML data management.

This software is also peer reviewed by journal TOMS.


References in zbMATH (referenced in 17 articles )

Showing results 1 to 17 of 17.
Sorted by year (citations)

  1. Liu, Jian; Ma, Z.M.; Qv, Qiulong: Dynamically querying possibilistic XML data (2014)
  2. Liu, Jian; Ma, Z.M.; Ma, Ruizhe: Efficient processing of twig query with compound predicates in fuzzy XML (2013)
  3. Ma, Z.M.; Liu, Jian; Yan, Li: Matching twigs in fuzzy XML (2011)
  4. Ma, Jianbing; Liu, Weiru; Hunter, Anthony; Zhang, Weiya: An XML based framework for merging incomplete and inconsistent statistical information from clinical trials (2010)
  5. Ma, Z.M.; Liu, Jian; Yan, Li: Fuzzy data modeling and algebraic operations in XML (2010)
  6. Yan, Li; Liu, Jian; Ma, Z.M.: Formal translation from fuzzy XML to fuzzy nested relational database schema (2010)
  7. Abiteboul, Serge; Kimelfeld, Benny; Sagiv, Yehoshua; Senellart, Pierre: On the expressiveness of probabilistic XML models (2009)
  8. Kimelfeld, Benny; Kosharovsky, Yuri; Sagiv, Yehoshua: Query evaluation over probabilistic XML (2009)
  9. Magnani, Matteo; Montesi, Danilo: Management of interval probabilistic data (2008)
  10. Dalvi, Nilesh; Suciu, Dan: Efficient query evaluation on probabilistic databases (2007)
  11. Hunter, Anthony; Liu, Weiru: Merging uncertain information with semantic heterogeneity in XML (2006)
  12. Hunter, Anthony; Liu, Weiru: Merging uncertain information with semantic heterogeneity in XML (2006)
  13. Hunter, Anthony; Liu, Weiru: Measuring the quality of uncertain information using possibilistic logic (2005)
  14. Zhao, Wenzhong; Dekhtyar, Alex; Goldsmith, Judy: A framework for management of semistructured probabilistic data (2005)
  15. Zhao, Wenzhong; Dekhtyar, Alex; Goldsmith, Judy: A framework for management of semistructured probabilistic data (2005)
  16. Zhao, Wenzhong; Dekhtyar, Alex; Goldsmith, Judy: A framework for management of semistructured probabilistic data (2005)
  17. Zhao, Wenzhong; Dekhtyar, Alex; Goldsmith, Judy: A framework for management of semistructured probabilistic data (2005)