the way that CUDA deals with thread death in the current iterations is lacking. if they make that more graceful, you can really expect to see some insane speedups.
"An idealist is one who, on noticing that a rose smells better than a cabbage, concludes that it will also make better soup." - H.L. Mencken