NAS Parallel Benchmarks

The NAS Parallel Benchmarks (NPB) are a small set of programs designed to help evaluate the performance of parallel supercomputers. The benchmarks are derived from computational fluid dynamics (CFD) applications and consist of five kernels and three pseudo-applications in the original ”pencil-and-paper” specification (NPB 1). The benchmark suite has been extended to include new benchmarks for unstructured adaptive mesh, parallel I/O, multi-zone applications, and computational grids. Problem sizes in NPB are predefined and indicated as different classes. Reference implementations of NPB are available in commonly-used programming models like MPI and OpenMP (NPB 2 and NPB 3).

References in zbMATH (referenced in 95 articles )

Showing results 1 to 20 of 95.
Sorted by year (citations)

1 2 3 4 5 next

  1. Kell, Nathaniel; Havill, Jessen: Improved upper bounds for online malleable job scheduling (2015)
  2. Zhukov, V.; Krasnov, M.; Novikova, N.; Feodoritova, O.: Multigrid effectiveness on modern computing architectures (2015)
  3. Cruz, Eduardo H.M.; Diener, Matthias; Alves, Marco A.Z.; Navaux, Philippe O.A.: Dynamic thread mapping of shared memory applications by exploiting cache coherence protocols (2014)
  4. Didelot, Sylvain; Carribault, Patrick; Pérache, Marc; Jalby, William: Improving MPI communication overlap with collaborative polling (2014)
  5. Escudero-Sahuquillo, Jesus; Garcia, Pedro J.; Quiles, Francisco J.; Reinemo, Sven-Arne; Skeie, Tor; Lysne, Olav; Duato, Jose: A new proposal to deal with congestion in InfiniBand-based fat-trees (2014)
  6. Schneider, Timo; Gerstenberger, Robert; Hoefler, Torsten: Application-oriented ping-pong benchmarking: how to assess the real communication overheads (2014)
  7. Cores, Iván; Rodríguez, Gabriel; Martín, Mará J.; González, Patricia; Osorio, Roberto R.: Improving scalability of application-level checkpoint-recovery by reducing checkpoint sizes (2013)
  8. de Carvalho, Francisco Heron jun.; de Rezende, Cenez Araújo: A case study on expressiveness and performance of component-oriented parallel programming (2013)
  9. Emeneker, Wesley; Apon, Amy: On modeling contention for shared caches in multi-core processors with techniques from ecology (2013)
  10. Lerida, Josep L.; Solsona, Francesc; Hernandez, Porfidio; Gine, Francesc; Hanzich, Mauricio; Conde, Josep: State-based predictions with self-correction on enterprise desktop grid environments (2013)
  11. Pennycook, S.J.; Hammond, S.D.; Wright, S.A.; Herdman, J.A.; Miller, I.; Jarvis, S.A.: An investigation of the performance portability of OpenCL (2013)
  12. Sundriyal, Vaibhav; Sosonkina, Masha; Gaenko, Alexander; Zhang, Zhao: Energy saving strategies for parallel applications with point-to-point communication phases (2013)
  13. Taboada, Guillermo L.; Ramos, Sabela; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón: Java in the high performance computing arena: research, practice and experience (2013)
  14. Viñas, Moisés; Bozkus, Zeki; Fraguela, Basilio B.: Exploiting heterogeneous parallelism with the heterogeneous programming library (2013)
  15. Wu, Xingfu; Taylor, Valerie: Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers (2013)
  16. Dong, Bin; Li, Xiuqiao; Wu, Qimeng; Xiao, Limin; Ruan, Li: A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers (2012)
  17. Fraguela, Basilio B.; Bikshandi, Ganesh; Guo, Jia; Garzarán, María J.; Padua, David; Von Praun, Christoph: Optimization techniques for efficient HTA programs (2012)
  18. Hori, Atsushi; Lee, Jinpil; Sato, Mitsuhisa: Audit: a new synchronization API for the GET/PUT protocol (2012)
  19. Mello Schnorr, Lucas; Huard, Guillaume; Navaux, Philippe Olivier Alexandre: A hierarchical aggregation model to achieve visualization scalability in the analysis of parallel applications (2012)
  20. Salnikov, A.N.; Andreev, D.Yu.; Lebedev, R.D.: Toolkit for analyzing the communication environment characteristics of a computational cluster based on MPI standard functions (2012)

1 2 3 4 5 next