This chapter describes the efforts to improve performance of the widely used WSM6 microphysics package used in numerical weather prediction via a variety of techniques including threading, vectorization, array alignment, improving data locality, and optional use of compile-time constants for loop and array index bounds. Code examples are included to illustrate each technique along with its performance benefits on Intel Xeon processors and Intel Xeon Phi coprocessors. Use of tools such as the Intel Thread Inspector to speed the performance tuning process are also described.
Not available
Authors who have authored or contributed to this publication.
Not available