Revisiting Amdahl's Law

Revisiting Amdahl's Law 54

Posted by Soulskill on Wednesday June 19, 2013 @02:15AM from the looking-for-new-loopholes dept.

An anonymous reader writes "A German computer scientist is taking a fresh look at the 46-year old Amdahl's law, which took a first look at limitations in parallel computing with respect to serial computing. The fresh look considers software development models as a way to overcome parallel computing limitations. 'DEEP keeps the code parts of a simulation that can only be parallelized up to a concurrency of p = L on a Cluster Computer equipped with fast general purpose processors. The highly parallelizable parts of the simulation are run on a massively parallel Booster-system with a concurrency of p = H, H >> L. The booster is equipped with many-core Xeon Phi processors and connected by a 3D-torus network of sub-microsecond latency based on EXTOLL technology. The DEEP system software allows to dynamically distribute the tasks to the most appropriate parts of the hardware in order to achieve highest computational efficiency.' Amdahl's law has been revisited many times, most notably by John Gustafson."

Revisiting Amdahl's Law

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 54 Comments Log In/Create an Account

Comments Filter:

Buzzword-heavy (Score:5, Insightful)

by Animats ( 122034 ) writes: on Wednesday June 19, 2013 @02:33AM (#44046925) Homepage

The article makes little sense. The site of the DEEP project [deep-project.eu] is more useful. It has the look of an EU publicly funded boondoggle. Those have a long history; see Plan Calcul [wikipedia.org], the 1966 plan to create a major European computing industry. That didn't do too well.
The trouble with supercomputers is that only governments buy them. When they do, they tend not to use them very effectively. The US has pork programs like the Alabama Supercomputer Center [asc.edu]. One of their main activities is providing the censorware for Alabama schools. [asc.edu]
There's something to be said for trying to come up with better ways of making sequential computation more parallel. But the track record of failures is discouraging. The game industry beat their head against the wall for five years trying to get the Cell processors in the PS3 to do useful work. Sony has given up; the PS4 is an ordinary shared-memory multiprocessor. So are all the XBox machines.
It's encouraging to see how much useful work people are getting out of GPUs, though.

- Re:Buzzword-heavy (Score:4, Interesting)
  
  by cold fjord ( 826450 ) writes: on Wednesday June 19, 2013 @02:51AM (#44047029)
  
  The article makes sense, but I don't think the work appears to be especially innovative even if it could be very useful.
  It is more than governments that buy supercomputers. They are also used in industry for things like oil and gas exploration, economic modeling, and weather forecasts. Universities and research organizations also use them for a variety of purposes. Time on an actual supercomputer tends to be highly valuable and sought after. You may disagree with the use, but that is a different question from not being used effectively.
  The Secret Lives of Supercomputers, Part 1 [technewsworld.com]
  "It is probably the biggest trend in supercomputers -- the movement away from ivory-tower research and government-sponsored research to commerce and business," Michael Corrado, an IBM spokesperson, told TechNewsWorld. In 1997, there were 161 supersystems deployed in business and industry, but that figure grew to 287 by June 2008, he noted. "More than half the list reside in commercial enterprises. That's a huge shift, and it's been under way for years."
  Uses for supercomputers [zdnet.com]
  
  - Re:Buzzword-heavy (Score:4, Interesting)
    
    by cold fjord ( 826450 ) writes: on Wednesday June 19, 2013 @03:05AM (#44047099)
    
    Demand Surges for Supercomputers [wsj.com]
    Do Supercomputers Still Matter? [hp.com]
    Oil giant Total builds "world's largest commercial supercomputer" [information-age.com]
    
    - Re: (Score:2)
      
      by phantomfive ( 622387 ) writes:
      
      Also worth mentioning that IBM sells more mainframes now than they did 20 years ago......
- - Re:Buzzword-heavy (Score:4, Interesting)
    
    by rioki ( 1328185 ) writes: on Wednesday June 19, 2013 @05:50AM (#44047749) Homepage
    
    You might want to read / view these slides:An Introduction to Modern GPU Architecture [nvidia.com] Especially slide 42.
    Modern GPUs are massively parallel in their execution. Yes they work "only" on one image, but when rendering one scene the sharers work in parallel. For example a fragment (aka per pixel) shader will be run in parallel for each pixel, limited by the number of available shader units (aka core). THIS is why you get the awesome performance: small, self contained programs running in parallel.
    
    - Re: (Score:3)
      
      by Bengie ( 1121981 ) writes:
      
      GPU cores are broken into groups. Each group must be doing the exact same instruction at the exact same time. Branches are horrible for performance as it will force some cores to stop computing all together while waiting for the branch to finish.
      
      There are many concurrent algorithms that don't like to keep the execution path in perfect sync. This is where a many-core CPU will take out a GPU in performance. GPUs also have horrible random access and very small caches. Actually, the per core cache of GPUs ha
- Re: (Score:3)
  
  by smallfries ( 601545 ) writes:
  
  How dare you criticise the author - he is a physicist and he has stooped to coming and telling us computer science types how to do it properly!
  There is a deeply appropriate xkcd but I cannot be bothered to find it. Decoding the garbage in the pcworld story tell us that he is going to break Amdahl's Law by dynamically partitioning the workload between a fast single threaded processor and many slower parallel processors. I would guess that my failing to make a fair comparison they can claim that the portion r
  - Re: (Score:2)
    
    by mysidia ( 191772 ) writes:
    
    claim that the portion running under the boosted clock somehow beats the bounds predicted by Amdahl's law.
    Right... their system cannot 'break' Amdahl's law. They bypass it by allowing the sequential portion of the workload to run on faster hardware, and the parallel portion of the workload to run on the massively parallel (but slower) architecture.
    Designing an approach that allows better parallel computing despite Amdahl's law, does not imply necessarily breaking the law.
    It's more like: working cle
    - Re: (Score:2)
      
      by postbigbang ( 761081 ) writes:
      
      Go back further to Von Neumann and you'll see that this is a hybrid model, where the state machine is respected, with mgmt processes acting as controler daemons to child processes. It's not really a bypass, just a hybrid representation as the distributed portions still respect Amdahl's precepts.
    - Re: (Score:2)
      
      by smallfries ( 601545 ) writes:
      
      Your phrasing is kind of hard to parse - I actually can't tell if you are agreeing with what I wrote, or arguing in a passive-aggressive way. This implies that I have had too many arguments with passive aggressive people recently and I need to learn to read things more neutrally again. But yes, that is what I was pointing out: tweaking the frequency in the fast sequential part is still covered by Amdahl's law, contrary to their wild hyperbole.
  - Re: (Score:3)
    
    by rgbatduke ( 1231380 ) writes:
    
    Hey, don't disrespect physicists in parallel computing. Some of us actually understand how to do it properly and agree with what you state. Superlinear speedup is not precisely unknown, but it is rare and depends on architectural "tricks" that typically preserve Amdahl's law at a low level but apparently violate it at a higher level. In the naivest, stupidest example, if we didn't count cores instead of processors, even embarrassingly parallel code would exhibit superlinear speedup on a single processor
  - - Re: (Score:2)
      
      by smallfries ( 601545 ) writes:
      
      That's a bingo.
- Re: (Score:2)
  
  by RabidReindeer ( 2625839 ) writes:
  
  The trouble with supercomputers is that only governments buy them.
  Actually, not so. For about 15 minutes, I once owned a supercomputer myself, believe it or not.
  It wasn't a major supercomputer, but it was classified as a true supercomputer and I was acting as an intermediary for an oil industry company who had offended the seller, so the seller wouldn't sell directly to them.
  Governments are definitely big consumers of supercomputers, but universities also do a lot of computationally-intensive work, not all of which is necessarily government-funded. I've already mentioned
  - Re: (Score:3)
    
    by rgbatduke ( 1231380 ) writes:
    
    Double ditto. I've written magazine articles on beowulf-style supercomputers I've built at home (I used to write a column for "Cluster World" magazine in the brief time that it existed, but I also wrote an article or two for more mainstream computer mags). I have also set up clusters for companies I've founded and helped others set up corporate clusters. Some of what are arguably the world's largest parallel supercomputers -- Google's cluster, for example -- are not government funded. Many others aren
- Re: (Score:2)
  
  by grizdog ( 1224414 ) writes:
  
  I agree. The article is next to worthless. In particular, it appears (and that is the problem - the article is just too vague) that they are not counting the GPU time against Amdahl's law. That's splitting hairs, at best.
  There might be some "there there" if they tried to refine Amdahl's law to include different kinds of processors, and the kinds of physical restrictions they talk about. All the article does is say such a thing might be possible - I think we already knew that.
- Re: (Score:2)
  
  by delt0r ( 999393 ) writes:
  
  I use supercomputers all the time for my work. I am at university so this perhaps is a government one. But we are not idiots and use it quite effectively thank you very much. In most of the supercomputers in the EU at least are for universities. They are mostly used quite well. At least all the ones i have used. Which is quite a few of them.
- Re: (Score:2)
  
  by kramulous ( 977841 ) writes:
  
  Somebody has an axe to grind.
- Re:Xeon dream on (Score:4, Informative)
  
  by godrik ( 1287354 ) writes: on Wednesday June 19, 2013 @03:23AM (#44047167)
  
  "Xeon Phi = unavailable vaporware"
  You know, I wrote a paper on SpMV for Xeon Phi and I got quite a lot of people from all over the world asking me for clarification and for code. So it seems to be quite widespread. You can actually buy some online, Google points to several vendors.
  "in order to discourage folks from porting big science applications to CUDA"
  There are two things wrong with this statement. First of all, I do not think scientist are discourage from giving a shot to CUDA. Just check any scientific conference and you'll see GPU and CUDA everywhere. Actually we see so much GPU programming that it is getting boring.
  Also porting to CUDA is difficult and alien for most people. If we can get similar performance using programming model people are used to, how is that not a good thing? What is so good about CUDA? It is just pretty much the only way to get good performance out of NVIDIA gpus.
  The tradeoff between performance, hardware cost and developper cost is a difficult tradeoff. I say let's throw them all in the arena and see what stands.
  Disclaimer: my research is supported by both Intel and NVIDIA.
  
  - Re: (Score:3)
    
    by ImprovOmega ( 744717 ) writes:
    
    Optimizing CUDA is almost, but not quite, as arcane as optimizing assembly code by hand. It requires a deep knowledge of the underlying architecture. The addressing, the memory read patterns, and the role of each of the tiers of memory and the cost of moving between tiers, the size restrictions on each buffer, and how to coalesce the whole mess into a coherent answer. I once got a 30% performance increase by offsetting the addressing on my memory buffers so that they didn't all start on 16-byte boundarie
SMBC (Score:3)

by klapaucjusz ( 1167407 ) writes: on Wednesday June 19, 2013 @03:05AM (#44047095) Homepage

SMBC [smbc-comics.com]

- Re: (Score:2)
  
  by godrik ( 1287354 ) writes:
  
  Yeah, there is nothing wrong with amdahl's law. People that need to care about it clearly understand what it means. That is to say, when you increase parallelism, sequential parts become bottlenecks. You need to reengineer the problem/algorithm/architecture around that new bottleneck.
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    "Sequential parts" usually mean "We won't know how to proceed further on until previous step is done". However, if you have really massive, 2^data_word_length or higher scale parallelism, then you can actually try guessing, and executing next step an all possible outcomes of previous step, then throwing away every result but one as previous step completes. Even if your parallelism is of lower scale, statistically it may still yield some speedup, whenever you happen to have a lucky guess. Sure beats letting
    - Re: (Score:3)
      
      by mysidia ( 191772 ) writes:
      
      then you can actually try guessing, and executing next step an all possible outcomes of previous step, then throwing away every result but one as previous step completes.
      However... this requires power consumption, and it still does take time and tie up your infrastructure working on the 'guess'. Meanwhile, the previous step completes, and your CPUs are all still busy working on guessing the previous step, and you need additional sequential overhead to initiate and terminate the guessing process.
      Y
      - Re: (Score:2)
        
        by TheLink ( 130905 ) writes:
        
        I've long wondered if you can set up a quantum computer to process "all possible paths" and then "collapses" on the most probable right answer.
        After all you can have light beams (and other quantum state friendly stuff) that are superpositions of all possible states and perform functions on them.
Poor summary (Score:5, Informative)

by Anonymous Coward writes: on Wednesday June 19, 2013 @03:57AM (#44047315)

Amdahl's Law still stands. TFA is about changing the assumptions that Amdahl's Law is based on; instead of homogenous parallel processing, you stick a few big grunty processors in for the serial components of your task, and a huge pile of basic processors for the embaressingly parallel components. You're still limited by the fastest processing of non-parellel tasks, but by using a heterogenous mix of processors you're not wasting CPU time (and thus power and money) leaving processors idle.

- Repeat after me: (Score:5, Insightful)
  
  by Mashdar ( 876825 ) writes: on Wednesday June 19, 2013 @09:06AM (#44048667)
  
  Ahmdal's Law only applies to individual algorithms. Ahmdal's Law only applies to individual algorithms. Ahmdal's Law only applies to individual algorithms.
  Besides which, Ahmdal's law is an obvious truth unless you can make a process take negative time. All attempts to make Ahmdal's Law sound fancy or complicated are a disservice. All attempts to pigeonhole Ahmdal's Law into only applying to parallel design are a disservice. Any attempts to "revisit" are either fallacious or focus on algorithm changes, which Amdahl made no attempt to address.
  Ahmdal's law in a nutshell: If you spend 10% of your time on X and 90% of your time on Y, you will never get more than a 1/.9 speedup by optimizing X, even if you manage to make X instantaneous. Another way to put it is that if Y takes 9 seconds, you are never going to get the process under 9 seconds by modifying X...
  
- Re: (Score:1)
  
  by Impy the Impiuos Imp ( 442658 ) writes:
  
  Most of the cool stuff is pure parallel anyway, like the brain, or simulations of bodies made of atoms or cells. Plenty of room to grow regardless of some un-de-serializable algorithms.
Shi's Law, revisited (Score:2)

by G3ckoG33k ( 647276 ) writes:

In 2006 I submitted this (http://slashdot.org/comments.pl?sid=183461&cid=15153431):
"Researchers in the parallel processing community have been using Amdahl's Law and Gustafson's Law to obtain estimated speedups as measures of parallel program potential. In 1967, Amdahl's Law was used as an argument against massively parallel processing. Since 1988 Gustafson's Law has been used to justify massively parallel processing (MPP). Interestingly, a careful analysis reveals that these two laws are in fact identi
Hmm (Score:2)

by TheSkepticalOptimist ( 898384 ) writes:

I am sure that means something...
Understanding Amdahl's law (Score:3)

by deadline ( 14171 ) writes: on Wednesday June 19, 2013 @08:55AM (#44048563) Homepage

You can't cheat Amdahl's law anymore than you can give birth in one month with nine women. The law is a rather simple idea similar to chemical kinetics, when you think about it. i.e. a rate limiting steps.
If you are interested in a non-mathematical description of Amdahl's law have a look at http://www.clustermonkey.net/Parallel-Programming/parallel-computing-101-the-lawnmower-law.html [clustermonkey.net]

Not breaking Amdahls law (Score:3)

by sjames ( 1099 ) writes: on Wednesday June 19, 2013 @09:07AM (#44048687) Homepage Journal

This most certainly does NOT break Amdahl's law. It simply partitions the problem to use the cheap gear for the embarrassingly parallel portion of the workload and the expensive gear for the harder to parallelize workload.
It necessarily cannot make a non-parallelizable portion (the serial part) run in parallel.
Note that what part of the problem is serial depends on the hardware. The lower the latency and the higher the bandwidth of the interconnect, the more of the problem you can get to run effectively in parallel. However, there comes a point where the problem cannot be decomposed further. The atoms that remain after that may all be run at once, but the individual atom will run serially. No matter what you do, 5*(2+3) can go no faster than serially adding and then multiplying (yes, you could do two multiplications in parallel and then add, but you gain nothing for it).

- - Re: (Score:2)
    
    by sjames ( 1099 ) writes:
    
    Where does that get you any useful parallelism?

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Revisiting Amdahl's Law 54

Revisiting Amdahl's Law More Login

Revisiting Amdahl's Law

Buzzword-heavy (Score:5, Insightful)

Re:Buzzword-heavy (Score:4, Interesting)

Re:Buzzword-heavy (Score:4, Interesting)

Re: (Score:2)

Re:Buzzword-heavy (Score:4, Interesting)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:Xeon dream on (Score:4, Informative)

Re: (Score:3)

SMBC (Score:3)

Re: (Score:2)

Re: (Score:1)

Re: (Score:3)

Re: (Score:2)

Poor summary (Score:5, Informative)

Repeat after me: (Score:5, Insightful)

Re: (Score:1)

Shi's Law, revisited (Score:2)

Hmm (Score:2)

Understanding Amdahl's law (Score:3)

Not breaking Amdahls law (Score:3)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot