Significance of Results
The I/O intensive MSC/Nastran Vendor_2008 benchmark test suite was used
to compare the performance on a Sun Fire X2270 server
when using SSDs internally instead
The effect on performance from increasing memory to augment I/O caching
was also examined.
The Sun Fire X2270 server was equipped with Intel QC Xeon X5570 processors
The positive effect of adding memory to increase I/O caching is offset
to some degree by the reduction in memory frequency with additional DIMMs
in the bays of each memory channel on each cpu socket for these Nehalem
- SSDs can significantly improve NASTRAN performance especially on
runs with larger core counts.
Additional memory in the server can also increase performance,
however in some systems additional memory can decrease memory GHz
so this may offset the benefits of increased capacity.
If SSDs are not used striped disks will often improve performance
of IO-bound MCAE applications.
To obtain the highest performance it is recommended that SSDs be
used and servers be configured with the largest memory possible
without decreasing memory GHz. One should always look at the
workload characteristics and compare against this benchmark to
correctly set expectations.
SSD vs. HDD Performance
The performance of two striped 30GB SSDs was compared to two striped
7200 rpm 500GB SATA drives on a Sun Fire X2270 server.
- At the 8-core level (maximum cores for a single node) SSDs were
2.2x faster for the larger xxocmd2 and the smaller xlotdf1 cases.
- For 1-core results SSDs are up to 3% faster.
On the smaller mdomdf1 test case there was no increase in
performance on the 1-, 2-, and 4-cores configurations.
Performance Enhancement with I/O Memory Caching
Performance for Nastran can often be increased by additional memory
to provide additional in-core space to cache I/O and thereby reduce
the IO demands.
The main memory was doubled from 24GB to 48GB. At the 24GB level one 4GB DIMM
was placed in the first bay of each of the 3 CPU memory channels on each
of the two CPU sockets on the Sun Fire X2270 platform.
This configuration allows a memory frequency of 1333MHz.
At the 48GB level a second 4GB DIMM was placed in the second bay of each of the 3 CPU memory channels
on each socket. This reduces the memory frequency to 1066MHz.
Adding Memory With HDDs (SATA)
The additional server memory increased the performance when running with
the slower SATA drives at the higher core levels (e.g. 4- & 8-cores on
a single node)
- The larger xxocmd2 case was 42% faster and the smaller xlotdf1 case was 32% faster at the maximum 8-core level on a single system.
The special I/O intensive getrag case was 8% faster at the 1-core level.
Adding Memory With SDDs
At the maximum 8-core level (for a single node)
the larger xxocmd2 case was 47% faster in overall run time.
The effects were much smaller at lower core counts and in the tests at the 1-core level most test cases ran from 5% to 14% slower with the slower CPU memory frequency dominating over the
added in-core space available for I/O caching vs. direct transfer to SSD.
Only the special I/O intensive getrag case was an exception running 6% faster at the 1-core level.
Increasing performance with Two Striped (SATA) Drives
The performance of multiple striped drives was also compared to
single drive. The study compared two striped internal 7200 rpm
500GB SATA drives to a singe single internal SATA drive.
On a single node with 8 cores, the largest test
xx0cmd2 was 40% faster, a smaller test case xl0tdf1 was 33% faster and even
the smallest test case mdomdf1 case was 12% faster.
On 1-core the added boost in performance with striped disks was
from 4% to 13% on the various test cases.
- One 1-core the special I/O-intensive test case getrag was 29% faster.
Times in table are elapsed time (sec).
MSC/Nastran Vendor_2008 Benchmark Test Suite
||Sun Fire X2270
2 x X5570 QC 2.93 GHz
2 x 7200 RPM SATA HDDs
|Sun Fire X2270
2 x X5570 QC 2.93 GHz
2 x SSDs
Results and Configuration Summary
Sun Fire X2270
1 2-socket rack mounted server
2 x 2.93 GHz QC Intel Xeon X5570 processors
2 x internal striped SSDs
2 x internal striped 7200 rpm 500GB SATA drives
O/S: Linux 64-bit SUSE SLES 10 SP 2
Application: MSC/NASTRAN MD 2008
Benchmark: MSC/NASTRAN Vendor_2008 Benchmark Test Suite
HP MPI: 02.03.00.00  Linux x86-64
Voltaire OFED-220.127.116.11_5 GridStack for SLES 10
The benchmark tests are representative of typical
MSC/Nastran applications including both SMP and DMP runs involving
linear statics, nonlinear statics, and natural frequency extraction.
The MD (Multi Discipline) Nastran 2008 application performs both
structural (stress) analysis and thermal analysis. These analyses may
be either static or transient dynamic and can be linear or nonlinear
as far as material behavior and/or deformations are concerned. The
new release includes the MARC module for general purpose nonlinear
analyses and the Dytran module that employs an explicit solver to
analyze crash and high velocity impact conditions.
- As of the Summer '08 there is now an official
Solaris X64 version of the MD Nastran 2008 system that is certified
- The memory requirements for the test cases in the new
MSC/Nastran Vendor 2008 benchmark test suite range from
a few hundred megabytes to no more than 5 GB.
Please go here for a more
complete description of the tests.
Key Points and Best Practices
For more on Best Practices of SSD on HPC applications also see the Sun Blueprint:
Additional information on the MSC/Nastran Vendor 2008 benchmark test
Based on the maximum physical memory on a platform the user
can stipulate the maximum portion of this memory that can be
allocated to the Nastran job. This is done on the command line with
the mem= option. On Linux based systems where the
platform has a large amount of memory and where the model does not
have large scratch I/O requirements the memory can be allocated to a
tmpfs scratch space file system. On Solaris X64 systems
advantage can be taken of ZFS for higher I/O performance.
The MSC/Nastran Vendor 2008 test cases don't
scale very well, a few not at all and the rest on up to 8 cores at
The test cases for the MSC/Nastran module all have a
substantial I/O component where 15% to 25% of the total run times
are associated with I/O activity (primarily scratch files). The
required scratch file size ranges from less than 1 GB on up to about
140 GB. Performance will be enhanced by using the fastest available
drives and striping together more than one of them or using a high
performance disk storage system, further enhanced as indicated here
by implementing the Lustre based I/O system. High performance
interconnects such as Infiniband for inter node cluster message
passing as well as I/O transfer from the storage system can also
enhance performance substantially.
MSC.Software is a registered trademark of MSC.
All information on the MSC.Software website is
copyrighted. MSC/Nastran Vendor 2008 results
from http://www.mscsoftware.com and
this report as of June 9, 2009.