dproc -- extensible run-time resource monitoring for cluster applications In this paper we describe the dproc (distributed/proc) kernel-level mechanisms and abstractions,which provide the building blocks for implementation of efficient, cluster-wide, and application-specific performance monitoring. Such monitoring functionality may be constructed at any time, both before and during application invocation, and can include dynamic run-time extensions. This paper (i) presents dproc’s implementation in a Linux-based cluster of SMP-machines, and (ii) evaluates its utility by construction of sample monitoring functionality. Full version of this paper can be found at: http://www.cc.gatech.edu/systems/projects/dproc/
Keywords for this software
References in zbMATH (referenced in 3 articles , 1 standard article )
Showing results 1 to 3 of 3.
- Cicotti, P.; Taufer, M.; Chien, Andrew A.: Globus: A metacomputing infrastructure toolkit (2005)
- Agarwala, Sandip; Poellabauer, Christian; Kong, Jiantao; Schwan, Karsten; Wolf, Matthew: System-level resource monitoring in high-performance computing environments (2003)
- Jancic, J.; Poellabauer, C.; Schwan, K.; Wolf, M.; Bright, N.: dproc -- extensible run-time resource monitoring for cluster applications (2002)