What's going on (from twitter)

I enjoyed reading Ian’s analysis on how to interpret “faster” when it comes to comparing infrastructure-as-a-service and a supercomputer.

Snippet:

For example, let's say we want to run the LU benchmark, which (based on the numbers in Ed's paper) when run on 32 processors takes ~25 secs on the supercomputer and ~100 secs on EC2. Now let's add in queue and startup time:

  • On EC2 , I am told that it may take ~5 minutes to start 32 nodes (depending on image size), so with high probability we will finish the LU benchmark within 100 + 300 = 400 secs.
  • On the supercomputer , we can use Rich Wolksi's QBETS queue time estimation service to get a bound on the queue time. When I tried this in June, QBETS told me that if I wanted 32 nodes for 20 seconds, the probability of me getting those nodes within 400 secs was only 34%--not good odds.

So, based on the QBETS predictions, if I had to put money on which system my application would finish first, I would have to go for EC2.

Comments
#

Posted by hard drive (email) on Sat, 26 Sep, 2009 @ 05:33 UTC
#

While reading, I was expecting to hear an argument about how it's

faster on EC2 because otherwise you would have to build your own

infrastructure, and in the meantime you could do a huge amount of

work.

I was very surprised to find that the expected queueing delay for a

job that size would make EC2 the winner because of EC2's faster

startup time.

Posted by hard drive (email) on Sat, 26 Sep, 2009 @ 05:34 UTC
Name:
Comment:
Antispam: Please type only the digits in "iwheo6"
Email:
URI:
Notify me of new comments