Skip to main content

developerWorks >  Power Architecture  >  Forums  >  Cell Broadband Engine Architecture forum  >  developerWorks

linpack for cell clusters    Point your RSS reader here for a feed of the latest messages in this thread


Tags for this thread: 

     

 
 

My developerWorks
 Welcome, Guest
Sign in or register
This question is answered.

Permlink Replies: 10 - Pages: 1 - Last Post: Nov 11, 2009 9:38 AM Last Post By: khanna
khanna

Posts: 55
Registered: Mar 04, 2007 01:09:48 PM
linpack for cell clusters
Posted: Aug 30, 2008 02:39:06 PM
 
Click to report abuse...   Click to reply to this thread Reply
Hi all

Does anyone here have a standard linpack package for Cell based clusters? A number of folks have asked me about the linpack score of my
PS3 cluster. I'd like to try it out, if its something I can do pretty quick.

Of course, there is no point doing this if there is no Cell-optimized linpack suite. I've seen impressive linpack scores for Cell at many
places on the web so it obviously exists. The question is if there is a Cell cluster (message-passing) based suite as well?

Thanks.
mkistler

Posts: 535
Registered: Nov 04, 2005 02:29:23 PM
Re: linpack for cell clusters
Posted: Oct 09, 2008 09:23:17 AM   in response to: khanna in response to: khanna's post
 
Click to report abuse...   Click to reply to this thread Reply
Well ... this is not exactly what you asked for, but we have made available a set of modifications to Linpack that enable it to run on clusters of QS22s. This is available at

http://www.netlib.org/benchmark/hpl/

This is essentially the same code we used to obtain the 170.7 GFLOPS Linpack result for a QS22 that was published in the Linpack report (http://www.netlib.org/benchmark/performance.ps) in June, and that was used for the two QS22-based clusters that appeared in the Top500 list in June.

Mike Kistler
gbn

Posts: 1
Registered: Nov 11, 2008 08:38:08 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 11, 2008 09:11:28 PM   in response to: khanna in response to: khanna's post
 
Click to report abuse...   Click to reply to this thread Reply
I've been running this patch on my QS22 cluster (BladeCenter H with 12 QS22 blades, each with 2 PowerXCell 8i processors).

Problem: Only half of the SPEs are running the xhpl process. This is shown by looking in /spu, at spu-top, and other means. I've confirmed that I can get other programs (such as the 'mandelbrot' demo) to run on all 16 SPEs. But xhpl only runs on the first 8, regardless of how many MPI processes, numactl, and other things I tried.

In the hpl/accel/lib directory, there is a variable in hpl_accel_global.h that sounds like it should help: HPL_ACCEL_CMD_ENTRIES. I set this to 16 (from 8), also HPL_ACCEL_SPES in hpl_accel_spu.h. This increases the number of threads (tasks?) on the SPEs, but they are still only running on all 8 of the SPEs on the first Cell BE, and none of the second Cell's SPEs.

I've followed all the directions in the README supplied by the patch (referenced at the netlib site), recompiled, etc. I have the 3.0.5 SDK, not yet 3.1.

When I ran with just 8 tasks per node, HPL hasn't finished yet (I let it run for over 16 hours, before killing it and experimenting with other options). Maybe this is a symptom, or maybe unrelated.

Has anyone else encountered this? There are relatively few QS22 systems out there, and relatively few people trying to run Linpack, and maybe not everyone looks at whether all SPEs are busy. But I did, and am perplexed!

I did try setting the environment variable BLAS_NUMSPES=16, but this seemed to have no impact. Again, with other applications (not the patched HPL), I'm able to run all 16 SPEs successfully.

TIA. gbn

PS: here's what spu-top looks like. There are 16 tasks, but they are all running on only 8 SPEs:

spu-top: SPU View
Cpu(s) load avg: 0.22, 0.05, 0.06
Spu(s) load avg: 16.07, 16.67, 16.85
Cpu(s): 24.6%us,  1.0%sys,  0.0%wait,  0.0%nice, 74.4%idle
Spu(s): 49.7%us,  0.1%sys,  0.0%wait, 50.2%idle
 
SPE  %SPU  %USR  %SYS  %WAI S   SLB  HFLT  mFLT  MFLT  IRQ2 PPE_LIB
  0 100.0  99.8   0.2   0.0 U 54010   580   578     0  2812	  0
  1 100.0  99.8   0.2   0.0 U 55130   586   583     0  3072	  0
  2 100.0  99.8   0.2   0.0 U 53983   477   475     0  2802	  0
  3 100.0  99.8   0.2   0.0 U 56609   671   669     0  3275	  0
  4 100.0  99.8   0.2   0.0 U 54455   714   709     0  2972	  0
  5 100.0  99.8   0.2   0.0 U 58496   872   865     0  3652	  0
  6 100.0  99.8   0.2   0.0 U 54710   820   815     0  2980	  0
  7 100.0  99.8   0.2   0.0 U 63905  2179  2175     0  4938	  0
  8   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
  9   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
 10   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
 11   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
 12   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
 13   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
 14   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
 15   0.0   0.0   0.0   0.0 I     0     0     0     0     0	  0
IBM employee or contractor CellServ

Posts: 1,346
Registered: Jul 17, 2006 06:15:39 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 12, 2008 10:12:49 AM   in response to: gbn in response to: gbn's post
 
Click to report abuse...   Click to reply to this thread Reply
gbn - please check your private messages for more info.

--
IBM SDK Service Administrator
khanna

Posts: 55
Registered: Mar 04, 2007 01:09:48 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 08, 2009 09:39:31 AM   in response to: CellServ in response to: CellServ's post
 
Click to report abuse...   Click to reply to this thread Reply
I'm having the same issue on my QS22 system i.e. only 8 SPEs are in use with the patched HPL. Please advise ..

Thanks.
mkistler

Posts: 535
Registered: Nov 04, 2005 02:29:23 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 08, 2009 11:15:27 AM   in response to: khanna in response to: khanna's post
 
Click to report abuse...   Click to reply to this thread Reply
Please post the specific versions of the OS and OpenMPI you are using.

Mike
khanna

Posts: 55
Registered: Mar 04, 2007 01:09:48 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 10, 2009 07:32:27 PM   in response to: mkistler in response to: mkistler's post
 
Click to report abuse...   Click to reply to this thread Reply
Hi again, Mike

How are you doing? Its been a while since we interacted.

I'm running Fedora 9 on the QS22 blades and using OpenMPI 1.2.4

Thanks,
Gaurav
mkistler

Posts: 535
Registered: Nov 04, 2005 02:29:23 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 10, 2009 10:43:16 PM   in response to: khanna in response to: khanna's post
 
Click to report abuse...   Click to reply to this thread Reply
Gaurav,

Well howdy! I didn't recognize the userid.

I recommend that you move up to OpenMPI 1.3, and make sure you have the latest version of the Linux kernel for Cell installed (which I believe is kernel-2.6.25.14-108.20090129bsc.ppc64.rpm, available from http://www.bsc.es/projects/deepcomputing/linuxoncell/cellsimulator/sdk3.1/updates/.

I strongly suspect that one of these things will resolve your problem, but if not, please post back and we'll get to the bottom of it.

Mike
khanna

Posts: 55
Registered: Mar 04, 2007 01:09:48 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 11, 2009 08:34:47 AM   in response to: mkistler in response to: mkistler's post
 
Click to report abuse...   Click to reply to this thread Reply
Thanks Mike. Will do and give it a try. One quick question though; is one supposed to launch 2 MPI tasks on a single QS22 blade or just one (I'm using just one ..)?

Gaurav
mkistler

Posts: 535
Registered: Nov 04, 2005 02:29:23 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 11, 2009 09:32:51 AM   in response to: khanna in response to: khanna's post
 
Click to report abuse...   Click to reply to this thread Reply
You want two MPI tasks per QS22 ... or said differently, you want one MPI task per PowerXCell 8i processor. This is probably why you only saw 8 SPEs being used.

Mike
khanna

Posts: 55
Registered: Mar 04, 2007 01:09:48 PM
Re: linpack for cell clusters (not all SPEs being used)
Posted: Nov 11, 2009 09:38:44 AM   in response to: mkistler in response to: mkistler's post
 
Click to report abuse...   Click to reply to this thread Reply
Ah .. that is definitely the reason then. Thanks, Mike.

Gaurav
 Tags
Help

Use the search field to find all types of content in My developerWorks with that tag.

Use the slider bar to see more or fewer tags.

Popular tags shows the top tags for this particular type of content or application that you're viewing.

My tags shows your tags for this particular type of content or application that you're viewing.

 

MoreLess 


Point your RSS reader here for a feed of the latest messages in all forums