NAS Parallel Benchmarks

The NAS Parallel Benchmarks (NPB) are a small set of programs designed to help evaluate the performance of parallel supercomputers. The benchmarks are derived from computational fluid dynamics (CFD) applications and consist of five kernels and three pseudo-applications in the original ”pencil-and-paper” specification (NPB 1). The benchmark suite has been extended to include new benchmarks for unstructured adaptive mesh, parallel I/O, multi-zone applications, and computational grids. Problem sizes in NPB are predefined and indicated as different classes. Reference implementations of NPB are available in commonly-used programming models like MPI and OpenMP (NPB 2 and NPB 3).

References in zbMATH (referenced in 119 articles )

Showing results 1 to 20 of 119.
Sorted by year (citations)

1 2 3 4 5 6 next

  1. Jongmans, Sung-Shik T.Q.; Arbab, Farhad: Data optimizations for constraint automata (2016)
  2. Kell, Nathaniel; Havill, Jessen: Improved upper bounds for online malleable job scheduling (2015)
  3. Zhukov, V.; Krasnov, M.; Novikova, N.; Feodoritova, O.: Multigrid effectiveness on modern computing architectures (2015) ioport
  4. Cruz, Eduardo H.M.; Diener, Matthias; Alves, Marco A.Z.; Navaux, Philippe O.A.: Dynamic thread mapping of shared memory applications by exploiting cache coherence protocols (2014) ioport
  5. Didelot, Sylvain; Carribault, Patrick; Pérache, Marc; Jalby, William: Improving MPI communication overlap with collaborative polling (2014) ioport
  6. Escudero-Sahuquillo, Jesus; Garcia, Pedro J.; Quiles, Francisco J.; Reinemo, Sven-Arne; Skeie, Tor; Lysne, Olav; Duato, Jose: A new proposal to deal with congestion in InfiniBand-based fat-trees (2014) ioport
  7. Schneider, Timo; Gerstenberger, Robert; Hoefler, Torsten: Application-oriented ping-pong benchmarking: how to assess the real communication overheads (2014) ioport
  8. Cores, Iván; Rodríguez, Gabriel; Martín, Mará J.; González, Patricia; Osorio, Roberto R.: Improving scalability of application-level checkpoint-recovery by reducing checkpoint sizes (2013) ioport
  9. de Carvalho, Francisco Heron jun.; de Rezende, Cenez Araújo: A case study on expressiveness and performance of component-oriented parallel programming (2013) ioport
  10. Emeneker, Wesley; Apon, Amy: On modeling contention for shared caches in multi-core processors with techniques from ecology (2013) ioport
  11. Lerida, Josep L.; Solsona, Francesc; Hernandez, Porfidio; Gine, Francesc; Hanzich, Mauricio; Conde, Josep: State-based predictions with self-correction on enterprise desktop grid environments (2013) ioport
  12. Pennycook, S.J.; Hammond, S.D.; Wright, S.A.; Herdman, J.A.; Miller, I.; Jarvis, S.A.: An investigation of the performance portability of OpenCL (2013) ioport
  13. Sundriyal, Vaibhav; Sosonkina, Masha; Gaenko, Alexander; Zhang, Zhao: Energy saving strategies for parallel applications with point-to-point communication phases (2013) ioport
  14. Taboada, Guillermo L.; Ramos, Sabela; Expósito, Roberto R.; Touriño, Juan; Doallo, Ramón: Java in the high performance computing arena: research, practice and experience (2013) ioport
  15. Viñas, Moisés; Bozkus, Zeki; Fraguela, Basilio B.: Exploiting heterogeneous parallelism with the heterogeneous programming library (2013) ioport
  16. Wu, Xingfu; Taylor, Valerie: Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers (2013)
  17. Dong, Bin; Li, Xiuqiao; Wu, Qimeng; Xiao, Limin; Ruan, Li: A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers (2012) ioport
  18. Fraguela, Basilio B.; Bikshandi, Ganesh; Guo, Jia; Garzarán, María J.; Padua, David; Von Praun, Christoph: Optimization techniques for efficient HTA programs (2012) ioport
  19. Hori, Atsushi; Lee, Jinpil; Sato, Mitsuhisa: Audit: a new synchronization API for the GET/PUT protocol (2012) ioport
  20. Mello Schnorr, Lucas; Huard, Guillaume; Navaux, Philippe Olivier Alexandre: A hierarchical aggregation model to achieve visualization scalability in the analysis of parallel applications (2012) ioport

1 2 3 4 5 6 next