Netlogger: a toolkit for distributed system performance analysis. Diagnosis and debugging of performance problems on complex distributed systems requires end-to-end performance information at both the application and system level. We describe a methodology, called NetLogger, that enables real-time diagnosis of performance problems in such systems. The methodology includes tools for generating precision event logs, an interface to a system event-monitoring framework, and tools for visualizing the log data and real-time state of the distributed system. Low overhead is an important requirement for such tools, therefore we evaluate efficiency of the monitoring itself. The approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system.

References in zbMATH (referenced in 14 articles )

Showing results 1 to 14 of 14.
Sorted by year (citations)

  1. Hierons, Robert M.; Merayo, Mercedes G.; Núñez, Manuel: Implementation relations and test generation for systems with distributed interfaces (2012)
  2. Carpen-Amarie, Alexandra; Costan, Alexandru; Cai, Jing; Antoniu, Gabriel; Bougé, Luc: Bringing introspection into BlobSeer: towards a self-adaptive distributed data management system (2011)
  3. Callaghan, Scott; Deelman, Ewa; Gunter, Dan; Juve, Gideon; Maechling, Philip; Brooks, Christopher; Vahi, Karan; Milner, Kevin; Graves, Robert; Field, Edward; Okaya, David; Jordan, Thomas: Scaling up workflow-based applications (2010)
  4. Pandey, Nirved; Sharma, G.K.: Startup comparison for message passing libraries with DTM on linux clusters (2007)
  5. Hallal, H.H.; Boroday, S.; Petrenko, A.; Ulrich, A.: A formal approach to property testing in causally consistent distributed traces (2006)
  6. Robinson, William N.: A requirements monitoring framework for enterprise systems (2006)
  7. Robinson, William N.: A requirements monitoring framework for enterprise systems (2006)
  8. Truong, Hong-Linh; Fahringer, Thomas; Dustdar, Schahram: Dynamic instrumentation, performance monitoring and analysis of grid scientific workflows (2005)
  9. Kulkarni, Devdatta; Sosonkina, Masha: A framework for integrating network information into distributed iterative solution of sparse linear systems (2003)
  10. Balis, Bartosz; Bubak, Marian; Funika, Włodzimierz; Szepieniec, Tomasz; Wismüller, Roland: An infrastructure for grid application monitoring (2002)
  11. Bubak, Marian; Funika, Włodzimierz; Wismüller, Roland: The CrossGrid performance analysis tool for interactive grid applications (2002)
  12. Ponce, O.; Cuevas, J.; Fuentes, A.; Marco, J.; Marco, R.; Martínez-Rivero, C.; Menéndez, R.; Rodríguez, D.: Training of neural networks: Interactive possibilities in a distributed framework (2002)
  13. Balaton, Zoltán; Kacsuk, Péter; Podhroszki, Norbert: Application monitoring in the grid with GRM and PROVE (2001)
  14. Sosonkina, Masha; Chen, Gan: Design of a tool for providing dynamic network information to an application (2001)