1TB sort done in 68 seconds by Google

,

This 3 times faster than the previous known record.

For those who don’t know, Jim Gray et al established a series of tests, including the 1TB sort, in order to give database vendors a playground for honest comparisons. The results are maintained online. Here are the two related papers:

Google managed to sort 1TB in 68secs using their MapReduce infrastructure on 1,000 machines. Then, they attempted to sort 1PB of data on 4,000 machines. It’s interesting how when sorting 1PB of data one hits the hard disk failure rates.

Interesting stuff. I am looking forward to the paper.

One response to “1TB sort done in 68 seconds by Google”

  1. That is impressive; I can see an arms race developing here. What I’d like to see is an estimate of the number or megajoules used in the operation, possibly along with the number of MJ to upload the data and keep it stored for a month. Then we can measure performance not just by wall time, but energy efficiency.