Code optimizations for complex microprocessors applied to CFD software. Improving large scale, time-dependent numerical simulation of the Navier-Stokes equations is critical for the future of Computational Fluid Dynamics (CFD) in engineering applications. Unfortunately, these computations require massive, and generally expensive, computing resources. With the continuing advances in commodity computer hardware, an alternative approach to computationally expensive CFD computations is emerging in the form of PC clusters. However, to take advantage of clusters most CFD programs require extensive modifications so that they run efficiently on cache-based microprocessor systems. This paper presents techniques and tools that we have developed and used to optimize Navier--Stokes solvers on a single node of a PC cluster through the example CFD code DNSTool. After describing DNSTool, the paper demonstrates how this code is tuned to improve performance through profiling the computational cost of each of the subroutines, adapting the code for cache-based memory systems, and including SWAR (SIMD within a register) based routines. The effect of these improvements is to halve the computational cost on a single node, which in turn significantly increases the performance of the code on a PC cluster.

Keywords for this software

Anything in here will be replaced on browsers that support the canvas element