Perflib and parallel regions

If perflib is called from within a parallel region, you get the serial version rather than a parallel version. This is probably better than ending up with N\^2 threads when you set OMP_NUM_THREADS to be N.

Comments:

Strange. I would have expected a global thread pool (in which case I can't see the issue to handle nested parallelism) or at least a heuristic of starting parallel code with a value of OMP_NUM_THREADS divided by the number of parallel threads started.

I guess there is room for improvement here...

Posted by Marc on September 30, 2007 at 03:13 AM PDT #

Post a Comment:
Comments are closed for this entry.
About

Darryl Gove is a senior engineer in the Solaris Studio team, working on optimising applications and benchmarks for current and future processors. He is also the author of the books:
Multicore Application Programming
Solaris Application Programming
The Developer's Edge

Search

Categories
Archives
« April 2014
SunMonTueWedThuFriSat
  
1
2
5
6
8
9
10
12
13
14
15
18
19
20
21
22
23
24
25
26
27
28
29
30
   
       
Today
Bookmarks
The Developer's Edge
Solaris Application Programming
Publications
Webcasts
Presentations
OpenSPARC Book
Multicore Application Programming
Docs