Software to Make Blue Gene Top 200 Teraflops

Become a fan of Slashdot on Facebook

Software to Make Blue Gene Top 200 Teraflops 171

Posted by timothy on Friday June 23, 2006 @04:33PM from the crunching-rather-than-taste dept.

An anonymous reader writes "New Scientist has a story about the most intensive computer program ever created. It runs on IBM's big beast, Blue Gene/L, at Lawrence Livermore National Laboratory in California and carries out 207.3 teraflops (trillion cacluations per second). The program, called Qbox, performs very complex quantum calculations to simulate the behaviour of thousands of atoms in three dimensions. Wow."

This discussion has been archived. No new comments can be posted.

Software to Make Blue Gene Top 200 Teraflops

Load All Comments

Search 171 Comments Log In/Create an Account

Comments Filter:

Slight clarification (Score:4, Funny)

by LiquidCoooled ( 634315 ) writes: on Friday June 23, 2006 @04:34PM (#15592323) Homepage Journal

It does not perform very complex quantum calculations, instead
It simulates interactions between 1000 molybdenum atoms under high pressure using equations that take the quantum behaviour of electrons into account.

Also, when its not being used to dynamically model atomic structures, the IRS uses it to calculate Bill gates's taxes.

Share
twitter facebook
- Re:Slight clarification (Score:5, Funny)
  
  by rolfwind ( 528248 ) writes: on Friday June 23, 2006 @05:06PM (#15592565)
  
  And it almost makes the requirements for Vista!
  
  Parent Share
  twitter facebook
  - Re:Slight clarification (Score:2)
    
    by Fordiman ( 689627 ) writes:
    
    <obligatory>
    But can it run Linux?
    </obligatory>
  - Couldn't run vista.... (Score:2)
    
    by woolio ( 927141 ) writes:
    
    I think BG/L only has something like 256MB ram per node... And with no hard disk and no video card!
    
    But could it run Linux? Yep, it does!
- Re:Slight clarification (Score:3, Interesting)
  
  by Memnos ( 937795 ) writes:
  
  At the unfortunate risk of repeating myself on Slashdot (Oh, the Humanity!) you are correct. It is intrinsically impossible for a discrete-state system to model quantum mechanical events, unless you somehow sneaked under the Planck limit (There is no spoon..) So, they're faking it.. However, if it is a good model of "reality", then it is good science. If it can predict, it is useful.
  - Re:Slight clarification (Score:2, Informative)
    
    by Bill Barth ( 49178 ) writes:
    
    It's not "fake" so much as it's an approximation. I guarantee you the know by exactly how much they are in error (but not in what direction!). The Schroedinger Equation that is at the heart of this represents the probability (well its modulus does, at least) of something as a continuous function of space and time. These scientists make errors in that the equations that they use are discrete (in terms of mathematical degrees of freedom, strictly speaking, by discretizing space and time directly) models of th
  - Re:Slight clarification (Score:3, Insightful)
    
    by mfago ( 514801 ) writes:
    
    impossible for a discrete-state system to model quantum mechanical events
    Huh? QM was a while ago, but I'm afraid you'll have to give a reference or two. You're saying that Density Functional Theory [wikipedia.org] is impossible? The authors (of DFT) did win the Nobel proze a while ago, so I'm sure I'm missing something. Mind you, any implementation is only an approximation, but that's true of almost any computational method.
    - Re:Slight clarification (Score:2, Insightful)
      
      by Memnos ( 937795 ) writes:
      
      Yes. I am am saying that a discrete-state-system, such as a Markov chain, cannot follow quantum mechanical events. QM state reduction is not beforehand deterministic because it it follows a wave function that be known beforehand in its full vector state (e.g. position and velocity.) If you wish references I would need to look them up, except for my remembrances of Richard Feynman and Stephen Hawking lecturing to me on this subject, and my own experiments. But I can find them. That neither obviates your
  - Re:Slight clarification (Score:2)
    
    by aminorex ( 141494 ) writes:
    
    > However, if it is a good model of "reality", then it is good science. If it can predict, it is useful.
    
    Only if it is open source. Otherwise, it belongs in the Journal of Irreproducible Results. Unless I can reproduce the numerical experiment, the predictions are as meaningful as a call to the psychic friends network.
Yeah, but... (Score:3, Funny)

by wiz31337 ( 154231 ) writes: on Friday June 23, 2006 @04:34PM (#15592325)

Yeah, but can it beat Kasparov at chess?

Share
twitter facebook
- Re:Yeah, but... (Score:5, Funny)
  
  by elrous0 ( 869638 ) * writes: on Friday June 23, 2006 @04:53PM (#15592476)
  
  It's so powerful, it can beat Kasparov in chess and monitor millions of phone calls for the NSA *at the same time*!
  -Eric
  
  Parent Share
  twitter facebook
  - Re:Yeah, but... (Score:4, Funny)
    
    by elrous0 ( 869638 ) * writes: on Friday June 23, 2006 @05:14PM (#15592617)
    
    Geez, I am sick of getting modded down for this. Can /. please stop giving the White House unlimited mod points?
    -Eric
    
    Parent Share
    twitter facebook
    - - Re:Yeah, but... (Score:2, Insightful)
        
        by Gorshkov ( 932507 ) writes:
        
        The *real* sad truth is that inspite of the arrogance of some people, not voting the way you do does not mean that people are stupid, evil, or mindless. It just means they disagree with you.
        
        Re:Yeah, but... (Score:2)
        
        by Fordiman ( 689627 ) writes:
        
        Oh, my, GOD, thank you.
        
        It's good to see a sensible human around for once.
        
        Re:Yeah, but... (Score:3)
        
        by Doc Ruby ( 173196 ) writes:
        
        The real sad truth is that the arrogance of Bush voters lets you think that putting that criminal in charge of the world, then lying us into the Iraq War, letting New Orleans drown, spending our surplus into $9-45 TRILLION debt, shredding American global credibility and an endless list of other stupid, evil and/or mindless acts is just a "disagreement".
        
        That's your excuse, but turning your responsibility for electing that criminal into a "disagreement" with people who voted for a competent president, especia
    - - Re:Yeah, but... (Score:3, Insightful)
        
        by elrous0 ( 869638 ) * writes:
        
        we aren't loud and obnoxious like you.
        when you already control everything, you don't HAVE to be loud.
        As for obnoxious, well, that's in the eye of the beholder. For example, I consider monitoring my phone calls and locking people up without due process to be pretty damn obnoxious. But that's just me.
        -Eric
- Re:Yeah, but... (Score:3, Funny)
  
  by Sabaki ( 531686 ) writes:
  
  No, but it's already mapped his genome and is working on a clone that will be completely under its control.
Too bad for Q-box... (Score:4, Funny)

by vishbar ( 862440 ) writes: on Friday June 23, 2006 @04:38PM (#15592353)

New Scientist has a story about the most intensive computer program ever created.

Too bad for Q-Box that their title will be stripped of them so soon. Vista's almost here.

Wait a minute, Vista? Nevermind...Q-box should have it for a long while.

Share
twitter facebook
- Re:Too bad for Q-box... (Score:2, Funny)
  
  by bl00d6789 ( 714958 ) * writes:
  
  Since QBox's title is for requiring the most computing power to carry out its intended application, Vista may well unseat it. It's just that QBox's intended application is extremely complex quantum physics calculations, and Vista's intended application is letting people check their email. So... not quite a victory for Vista.
More importantly... (Score:2, Funny)

by batmn42 ( 158573 ) writes:

"Wow."

More importantly, at what FPS does it play WoW?

Though I wouldn't be surprised if it needs a new graphics card for Crysis [youtube.com]...
Only the most intensive USEFUL program (Score:5, Funny)

by stratjakt ( 596332 ) writes: on Friday June 23, 2006 @04:42PM (#15592383) Journal

I mean, I'm sure I could use up more than 200 teraflops with my "while (1);" program.

Share
twitter facebook
- Re:Only the most intensive USEFUL program (Score:3, Informative)
  
  by frank_adrian314159 ( 469671 ) writes:
  
  while(1); uses no FLOPS. OTOH, if you used while (1.0);...
  (And for those of you who are humor-impaired, I do realize that neither would use any FLOPS because they would both be optimized into L1: jmp L1).
  - Re:Only the most intensive USEFUL program (Score:3, Funny)
    
    by owlstead ( 636356 ) writes:
    
    I would not use an optimizing compiler on that one either :)
- Re:Only the most intensive USEFUL program (Score:2, Funny)
  
  by Ant P. ( 974313 ) writes:
  
  Don't be silly.
  
  Everyone knows Linux can finish that loop in 5 seconds.
  - "Infinite" loops (Score:2)
    
    by mangu ( 126918 ) writes:
    
    Linux can finish that loop in 5 seconds
    
    Not *that* infinite loop. The "infinite" loop that Linux and any other OS can finish in 5 seconds (if the CPU speed is right) is:
    int n; for (n = 1; n > 0; n++) ;
    This loop will actually finish because n will overflow and become negative after it reaches the largest value that can be represented as an integer in the machine it's running.
    - Re:"Infinite" loops (Score:2)
      
      by yarbo ( 626329 ) writes:
      
      yarbo@oxygen /crap/src/temp/inf $ cat inf.c
      int main(){
      int n;
      for (n = 1; n > 0; n++) ;
      return 0;
      }
      
      yarbo@oxygen /crap/src/temp/inf $ time ./inf
      
      real 0m6.761s
      user 0m6.748s
      sys 0m0.003s
      
      #and just for fun
      yarbo@oxygen /crap/src/temp/inf $ gcc -O2 -fomit-frame-pointer inf.c -o infO2
      yarbo@oxygen /crap/src/temp/inf $ time ./infO2
      
      real
- Re:Only the most intensive USEFUL program (Score:2)
  
  by necro2607 ( 771790 ) writes:
  
  It's pretty funny to see that mentioned because just the other night I was thinking about the "while" function as I've never really thought of this before but I realized you could just put a "1" in the parentheses and it would return true indefinitely and you could get some pretty fun results, perfect for prank-related endeavours and so on. Then I thought, "oh, like all those texts from the 90s [that I didn't understand]"... ;)
  - Re:Only the most intensive USEFUL program (Score:2)
    
    by Al Dimond ( 792444 ) writes:
    
    In C, and in most other languages, "while" is not a function, it's a looping construct. It doesn't return anything. Hate to be a pedant, but this is very important to understanding its operation.
- Re:Only the most intensive USEFUL program (Score:3, Insightful)
  
  by TheSkyIsPurple ( 901118 ) writes:
  
  Sorry, but I imagine you'd keep one of the many processors very busy, with the rest left idling away.
  Now, spawn a thread for each processor running this, and you might have something =-)
  - Re:Only the most intensive USEFUL program (Score:2)
    
    by CarpetShark ( 865376 ) writes:
    
    with a good compiler designed for the machine, something like:
    
    #define NUMPROCS x
    int array[NUMPROCS];
    
    function getval(int indx) {
    return array[indx];
    }
    
    while(1) {
    for(i=0; iNUMPROCS; i++) {
    array[i] ^= getval(i);
    }
    }
    
    should probably be optimised for multiple processors. I'm not sure how fine-grained the optimisation is, but I doubt you have to manually launch threads to get
- Re:Only the most intensive USEFUL program (Score:2)
  
  by dave1g ( 680091 ) writes:
  
  Hopefully that would use exactly ZERO FLOPS given that only the integer unit would be used and no floating point calculations would be made :-p
...wow... (Score:5, Interesting)

by sarlos ( 903082 ) writes: on Friday June 23, 2006 @04:43PM (#15592402)

So in essence, it takes about .2 teraflops per atom... And that was only after spending a lot of time condensing the algorithms. This makes me wonder two things. First, what do these equations look like such that it takes 200 gigaflops just to model one atom. Second, over what timeframe does this simulation take place? Are we talking real-time, calculating for 50 years, what?

Regardless, as a computer scientist, I say way to go to these guys, this is damn impressive.

Share
twitter facebook
- Re:...wow... (Score:2)
  
  by ScottLindner ( 954299 ) writes:
  
  I can't imagine it's real time. From what I understand, most chaotic simulations are far far slower than real time.
- it doesn't work like that (Score:4, Informative)
  
  by tpjunkie ( 911544 ) writes: on Friday June 23, 2006 @04:52PM (#15592468) Journal
  
  It doesn't take .2 teraflops to model one atom, or even two atoms, even account for effects on the quantum level.. However, when you take into acount that each atom will more or less interact with every other atom, you have a massive amount of interactions to model. Thats what takes so much processing power.
  
  Parent Share
  twitter facebook
- Re:...wow... (Score:3, Insightful)
  
  by MustardMan ( 52102 ) writes:
  
  So you're a computer scientist, but you apparantly don't understand Big-O notation or the concept that algorithms don't neccesarily scale linearly with the number of elements.
- Re:...wow... (Score:5, Informative)
  
  by mhore ( 582354 ) writes: on Friday June 23, 2006 @04:57PM (#15592507)
  
  So in essence, it takes about .2 teraflops per atom... And that was only after spending a lot of time condensing the algorithms. This makes me wonder two things. First, what do these equations look like such that it takes 200 gigaflops just to model one atom. Second, over what timeframe does this simulation take place? Are we talking real-time, calculating for 50 years, what?
  0.2 TFlops per atom, yes. But there are 1000 atoms, and it's molybdenum which has 42 eletrons... so that's 42,000 particles that all interact with each other. Still... that's not too many. But maybe they're considering interactions between nuclei, too. Who knows...
  As for your question about what the equations look like? They're probably very nasty integrals of sines and cosines and what not to various odd (read: strange) powers and stuff. I do fairly computationally intensive simulations on some big IBM machines and just simple equations can amount to quite a bit of calculations. Nothing like what these guys are doing, though.
  Finally... what time frame is the simulation over? I'd wager VERY SHORT times, maybe nanoseconds or something like that. Even casual "molecular dynamics" simulations can only probe very short timeframes. Their coarse-grained cousins can maybe do microseconds or milliseconds.
  Mike.
  
  Parent Share
  twitter facebook
  - - Re:...wow... (Score:2)
      
      by mhore ( 582354 ) writes:
      
      42 thousands particle? you have any idea what is size of electrons vs atoms? interaction between nuclei? do you understand what you talking about? sines and cosines.... adn this is 5 score comment! rated informative!
      I'll bite this time. Just once. I have a very good idea what I'm talking about. I said sines and cosines because to a first approximation the wavefunctions of the atoms probably resemble that, so I'd assume the interactions be built off of them in some fashion. I'm sure you took elementary qu
- Re:...wow... (Score:3, Informative)
  
  by exp(pi*sqrt(163)) ( 613870 ) writes:
  
  In a classical physical system the time to compute what happens to N particles typically grows as a polynomial in N. The masses and positions of the particles form a 6N dimensional space (3 for velocity, 3 for position) and you're typically trying to trace a path through that 6n-dimensional space.
  In quantum mechanics the state of the system is defined by a wavefunction on a 3N dimensional space. The state of a system is no longer a point, it's a *function* on a 3N dimensional space. That means that at any
  - Re:...wow... (Score:2)
    
    by mfago ( 514801 ) writes:
    
    Actually, in classical molecuar dynamics, the algorithm is usually N^2. However, in this case "N" is the number of _electrons_, not atoms, i.e. 42000 electrons.
    
    Oh, and this is not classical physics, but QM. Thus each electrons wave function has to be represented by a (possibly substantial) set of basis functions. Not sure if anyone's been able to get Density Functional Theory (DFT) to scale that high, but if so, DFT scales as (IIRC) either N^7 or N^9. Ouch! Sure there are tricks, such as pseudopotentials th
    - Re:...wow... (Score:2)
      
      by exp(pi*sqrt(163)) ( 613870 ) writes:
      
      the algorithm is usually N^2
      As I say, modulo a polynomial. The complexity of quantum systems typically grows exponentially because we're looking at the tensor product of the subsystems.
      I'd love to find out a bit more about the algorithms used here. And I'd be interested to know what kind of validation there is for the methods. I guess I can start here [wikipedia.org]. (My background is more particle physics than many-body systems.)
- Quantum Monte Carlo (Score:3, Informative)
  
  by poszi ( 698272 ) writes:
  
  First, what do these equations look like such that it takes 200 gigaflops just to model one atom.
  The article is light on details but I suppose the only quantum algorithm that can handle 1000 atoms is Quantum Monte Carlo [wikipedia.org]. The problem is that the algorithm is cubic with the number of particles (and has a huge prefactor). So in essence 1000 atoms is 1000^3=10^9 more time consuming than one. And I'm sure they still use dramatic simplifications, even though they have the most powerful computer. They probably
- Re:...wow... (Score:2)
  
  by Metasquares ( 555685 ) writes:
  
  As another computer scientist (specializing in algorithms), I think this is inefficient and needs further research :)
- - Re:...wow... (Score:2)
    
    by MustardMan ( 52102 ) writes:
    
    And that's why you're an engineer and not a scientist. You can't TEST a theory unless you can see whether that theory actually WORKS to predict what really happens. It's kind of hard to carry out the calculations by hand to test a model that includes a thousand atoms.
  - Re:...wow... (Score:3, Insightful)
    
    by gardyloo ( 512791 ) writes:
    
    Because those atoms do their things on VERY short timescales. There's no way you can probe what they're doing on short enough time (and length) scales, even with pump-probe laser experiments, and track movements. Possibly, in some very special circumstances, you can look at beginning and ending states, and then figure out intermediate states. However, in general, this isn't possible, and so we need such simulations to track in-between processes, especially in ergodic systems.
  - Re:...wow... (Score:3, Insightful)
    
    by poszi ( 698272 ) writes:
    
    Why not just get 1000 Molybdenum atoms and watch what the fuck they do
    Because they are apparently simulating them under extreme conditions that are present during nuclear explosions. And nuclear tests are banned.
    - Re:...wow... (Score:2)
      
      by owlstead ( 636356 ) writes:
      
      "Because they are apparently simulating them under extreme conditions that are present during nuclear explosions. And nuclear tests are banned."
      
      Never understood that stuff. You throw one, we throw 100, you throw 10000 and the earh is destroyed. It goes bang in a big way, and contaminates everything in the direct surrounding. What do you need a super-computer for? Why would you need to test such a thing in the first place?
      
      Now if they would put UD on it you could topple the top ranking in cancer research. /ra
- - Re:...wow... (Score:2)
    
    by Fordiman ( 689627 ) writes:
    
    It would be 999 relationships per atom, 999000 relationships in total. Each relationship would require vector calculations, positional calculations, calculations of EM field flux, electron position probability, etc, which would bring the calculations up into the thousands per relationship (I wouldn't be surprised if it's greater than 200,000 discrete operations). That brings us well into the teraflop range for realtime (one second virtual = one second real) processing.
Molest me not (Score:5, Funny)

by Weaselmancer ( 533834 ) writes: on Friday June 23, 2006 @04:44PM (#15592407)

The program, called Qbox, performs very complex quantum calculations to simulate the behaviour of thousands of atoms in three dimensions.

"Molest me not with this pocket calcualtor stuff." [earthstar.co.uk]

Share
twitter facebook
How to test a nuke.. without testing one (Score:4, Insightful)

by ScottLindner ( 954299 ) writes: on Friday June 23, 2006 @04:46PM (#15592418)

How do they know they got it right?

Share
twitter facebook
- Re:How to test a nuke.. without testing one (Score:2, Funny)
  
  by stratjakt ( 596332 ) writes:
  
  They simply check the result by hand.
  
  Call your broker, because it's a good time to invest in pencil and paper futures.
  - Re:How to test a nuke.. without testing one (Score:2)
    
    by ScottLindner ( 954299 ) writes:
    
    ROFL.. that's awesome.
    
    For the record.. I'm not throwing stones at them. It just struck me as a somewhat amusing way to think about it. How *do* they know they got it right?
- Re:How to test a nuke.. without testing one (Score:2, Interesting)
  
  by LiquidCoooled ( 634315 ) writes:
  
  Thats actually quite simple.
  
  If they are modelling everything without calibration from known experimental results then anything this machine can produce is as trustworthy as internet gossip.
  
  For instance, if you were creating a weather prediction machine (easier to explain), you would feed it with all your historical data and allow the calculations to run from a set date in the past. If the results matched up with actual observed results for the following day/week/periods then you begin to build confidence i
  - Re:How to test a nuke.. without testing one (Score:2)
    
    by ScottLindner ( 954299 ) writes:
    
    Yah.. I know. I was trying to be funny. Didn't work so well I guess.
- Re:How to test a nuke.. without testing one (Score:2)
  
  by owlstead ( 636356 ) writes:
  
  Why, they compare it with the 100ths of useless tests that pre-dated it of course. If it did not go "boom" it went wrong.
  - Re:How to test a nuke.. without testing one (Score:2)
    
    by ScottLindner ( 954299 ) writes:
    
    They weren't useless tests. They were measuring yield and I'm sure a wide variety of other things.
    
    However, the tests are useless in concept. I mean.. I personally would prefer they all were a dud.
- - Re:How to test a nuke.. without testing one (Score:2)
    
    by Fordiman ( 689627 ) writes:
    
    Spoken like a true creationist.
    
    Tell me, do you do barmitzvahs?
Smart, sure. But is it happy? (Score:5, Funny)

by fred_sanford ( 678924 ) writes: on Friday June 23, 2006 @04:48PM (#15592434)

Oblig. H2G2. "Here I am, brain the size of a planet and they ask me to take you down to the bridge. Call that job satisfaction? 'Cos I don't." - Marvin

Share
twitter facebook
Just wait... (Score:4, Informative)

by Raul654 ( 453029 ) writes: on Friday June 23, 2006 @04:48PM (#15592436) Homepage

BlueGene/L has a sister project, Cyclops64 (formerly known as BlueGene/C) due out sometime late in 2006 or early 2007. My research group is (a) helping IBM do hardware verification on it. and (b) developing the systems software for it [esp. the compiler]. Cyclops64 could very well blow BlueGene/L out of the water.

Share
twitter facebook
- Re:Just wait... (Score:2)
  
  by LWATCDR ( 28044 ) writes:
  
  The compiler sounds like about as much fun as the one for Cell.
  Sounds like a very interesting project. I guess you have no problem writing and debugging multithreaded code?
  - Re:Just wait... (Score:5, Interesting)
    
    by Raul654 ( 453029 ) writes: on Friday June 23, 2006 @05:41PM (#15592802) Homepage
    
    Cell was designed around one single objective - to get a clock rate as sickeningly high as possible, because clock speed cells. Trust me when I say that programmability was not (at all) a consideration (I should mention - my research group got one of the very first Cell processor's sent to the US. We are currently in the process of implimenting OpenMP on it to make it a little nicer to program).
    
    As far as writing multi-threaded code, I've spent the last 5 months rewriting the NAS CG benchmark to work effeciently on Cyclops64, which will probably play some part of my PhD thesis. (A sidenote: All of NASA's NAS implimentations are written in Fortran (except Integer Sort), which would have necessitated me rewriting NAS-CG in C. Fortunately, I didn't have to start from scratch, because the Japanese had already done the hard part [phase.hpcc.jp]).
    
    Parent Share
    twitter facebook
    - Re:Just wait... (Score:2)
      
      by LWATCDR ( 28044 ) writes:
      
      I did notice when I read the description of the Cyclops64 that the CPU seemed a bit more balanced than the Cell. It almost seemed like the inverse of the Cell with multiple threaded units tied to an FPU. I would guess that the FPU is optimized for double precision operations vs the Cell being optimized for single.
      Does the Cyclops64 support out of order execution?
      Just kind of wondering. My programing is limited to Xscale, Intel, and AMD cpus. The big cool toys fascinates me.
      - Re:Just wait... (Score:3, Insightful)
        
        by Raul654 ( 453029 ) writes:
        
        The compiler [the current version, at any rate] is based on gcc. So it sports the same out-of-order execution you would expect to get from compile-time optimization. I am not sure if it has hardware-based re-ordering. My guess would be that no, it does not, but without the Principles of Operation in front of me, I couldn't say (the advisor borrowed my paper copy for IPDPS 2006 and hasn't given it back yet).
        
        Re:Just wait... (Score:2)
        
        by LWATCDR ( 28044 ) writes:
        
        If the hardware doesn't support reordering wouldn't you get a big performance hit if the you use a gcc's standard optimization? I am just a compiler user not a writer so I could be totaly wrong, But if I don't ask I will never know. Over all it looks very cool but very programer dependant. For a super computer that isn't a terrible thing.
        I also assume that the interger units are basied on the Power ISA.
        
        Re:Just wait... (Score:2)
        
        by Raul654 ( 453029 ) writes:
        
        I don't think the integer units are based on anything. The whole chip is being custom designed from scratch. (Interestingly enough, the VHDL code for the chip is being written by only one guy - the project leader himself).
        
        As far as instruction re-ordering -- for parallel computation, the big peformance hits occur with waits, synchronizations/barriers, and locks/mutexes. Making these cheap and reducing the number of them is the biggest way to increase performance.
        
        Re:Just wait... (Score:2)
        
        by LWATCDR ( 28044 ) writes:
        
        "for parallel computation, the big performance hits occur with waits, synchronizations/barriers, and locks/mutexes"
        Is Cyclops64 using a shared memory system? Most clusters I have seen used message passing. On those systems your bottle necks tend to be in message passing.
        Yes mutexes are a lot of fun. I tend to use mutexes in my code just long enough to make a copy of the data structure for the thread to use. Yes it is cheating and relatively inefficient but it is also pretty safe and keep blocking to a minim
- Re:Just wait... (Score:2)
  
  by Kesch ( 943326 ) writes:
  
  How does it compare to ? [lanl.gov]
  - Re:Just wait... (Score:2)
    
    by Kesch ( 943326 ) writes:
    
    AHH! I forgot to close the HTML tag properly(Note to self: USE PREVIEW!)
    
    The question mark should have the word "Roadrunner" before it.
    
    Also for those who don't want to follow the link. Roadrunner is a supercomputer being developed at Los Alamos National Laboratory with aims to run at a sustained petaflop.
  - Re:Just wait... (Score:2)
    
    by Raul654 ( 453029 ) writes:
    
    The most important sentence in that article: "If a 'go' decision is made to pursue the goal of a sustained petaflop, a final phase would be executed, with plans for completion at the end of 2007" The whole world is racing to build the world's first computer to sustain one petaflop. It's only a matter of time. I'm told the Japanese project (which is already underway) is expected to finish sometime around 2008/2009. Our project, C64, has been going since 1999, and I think it's got a really good shot of being
- - Re:Just wait... (Score:5, Insightful)
    
    by Raul654 ( 453029 ) writes: on Friday June 23, 2006 @05:20PM (#15592660) Homepage
    
    C64 takes a totally different approach to high performance computing. Most supercomputer architectures are built around a moderate to large number of very, very fast (and power-hungry) processors. For example, Big Mac at Virgina Tech had something like 10,000 pentium 4 class processors. Cyclops64 is have an *enormous* number of processors (on the order of a million), but running only at 500 mhz, making them much easier to cool). The idea is to give the programmer more thread units than he knows what to do with, running very close together at a low level.
    
    Parent Share
    twitter facebook
    - Re:Just wait... (Score:2)
      
      by Cicero382 ( 913621 ) writes:
      
      Interesting. But if you use this approach to handle cooling problems, don't you lose processing power due to the effect of inter processor communications?
      
      We use a large(ish) cluster - admittedly not even remotely in your league, but our main limitation has always been node connection bandwidth/lag.
      - Re:Just wait... (Score:2)
        
        by Raul654 ( 453029 ) writes:
        
        I honestly have no idea about inter-chip communication issues. All of our work has focused on intra-chip issues.
  - Re:Just wait... (Score:2)
    
    by ChrisGilliard ( 913445 ) writes:
    
    RTFA, most programs use only 5% of the processing power according to TFA so 57% is pretty good.
    - - Re:Just wait... (Score:2)
        
        by Raul654 ( 453029 ) writes:
        
        You have to ask yourself - 57% of what? Of the peak theoretical performance. Peak theoretical performance is the performance you get under ludicrious circumstances - a continious stream of (only) multiply-and-accumulate instructions. (Each MDAC counts as two floating point operations, even though many architectures impliment it as a single instruction). Not only is it impossible to reach 100% effeciency with any program, but it's nearly impossible to even approach 100% if the program is supposed to do somet
        
        Re:Just wait... (Score:4, Informative)
        
        by Raul654 ( 453029 ) writes: on Friday June 23, 2006 @06:08PM (#15592955) Homepage
        
        What you are describing has already been done, and was done quite a while ago. Around 1990, NASA realized that the way we do parallel benchmarks sucks. The way most benchmarks (including hte parallel ones) work is that some organization posts the code, and people have to compile and run the code as-is. There's not much room there for optimization (other than tweaking the compiler flags, some trivial hardware settings, 'etc), which is essential to getting good parallel performance (because parallel machines vary so widely). So performance was tied very closely to the implimention over which nobody had any control.
        
        NASA approached the problem differently. Their numerical analysis group put out a set of "paper and pencil" benchmarks (based on real world problems that one would encounter, for example, fluid dynamics). The actual implimentation was left up to the individual companies. This is what we know today as the NAS benchmark suite.
        
        Parent Share
        twitter facebook
I suspect the answer ends up (Score:4, Funny)

by jhw539 ( 982431 ) writes: on Friday June 23, 2006 @04:52PM (#15592460)

42.

Share
twitter facebook
- Re:I suspect the answer ends up (Score:2)
  
  by Kesch ( 943326 ) writes:
  
  Since this machine is built with FLOPS in mind the answer is more likely
  
  42.0
- only (Score:2)
  
  by tacokill ( 531275 ) writes:
  
  Only if it's in base-13. [wikipedia.org]
- You are absolutely right! (Score:2)
  
  by mangu ( 126918 ) writes:
  
  FTFA: "It simulates interactions between 1000 molybdenum atoms under high pressure"
  
  And the atomic number of molybdenum [webelements.com] is... 42
Fill in Blank Please (Score:2, Funny)

by Frightening ( 976489 ) writes:

Imagine a _________ cluster of those.

Well done, you may now enter. Gaming room to the right, pron cubicles left, and crazy linux hardware center up ahead.
We hope you enjoy your stay at Geek Heaven.
HPCWire Interview (Score:4, Informative)

by multimediavt ( 965608 ) writes: on Friday June 23, 2006 @04:56PM (#15592499)

http://www.hpcwire.com/hpc/699401.html [hpcwire.com]

There's some additional info about BlueGene and what Livermore thinks of it here. What this interview neglects to mention is the millions of dollars being spent on IBM and internal developers to get this code (and any others) working on BlueGene. I was briefed by the hardware and software teams that built BlueGene and I can tell you, it's no easy task to bring apps to that platform. Kuznezov seems to trivialize it in the interview and I'm gonna have to go back and review the process again. Maybe it has changed since my briefing in early 2004, but somehow I doubt it.

Share
twitter facebook
- Re:HPCWire Interview (Score:2)
  
  by multimediavt ( 965608 ) writes:
  
  Ok, having reviewed the following document, IBM Redbook BlueGene/L: Application Development [ibm.com] I'll lighten my "it's no easy task to bring apps to [this] platform" statement. It does appear that IBM has done some considerable work on the APIs and MPI support. Like any big beasty, there's always something you have to do to code before it will run well. I'd say from having another look, that it's probably no harder to bring apps to BlueGene than it is to bring them to System X running Mac OS X. It ain't a pi
  - Re:HPCWire Interview (Score:2)
    
    by Kupek ( 75469 ) writes:
    
    You may be interested in a paper from PLDI of this year, Shared Memory Programming for Large Scale Machines [ualberta.ca]. They implemented UPC on BlueGene/L, and the paper explains the infrastructure they developed to do that. However, it's not about how to program on BlueGene/L.
wait, only 3 dimensions??? (Score:2)

by Fry-kun ( 619632 ) writes:

i thought there were more dimensions in the subatomic world [wikipedia.org] o_O
Screenshot here (Score:3, Funny)

by ultramk ( 470198 ) writes: <ultramk@noSPAm.pacbell.net> on Friday June 23, 2006 @05:16PM (#15592630)

I wonder what the cubes [ataritimes.com] represent?

Oh, wait. Qbox. Nevermind.

m-

Share
twitter facebook
Why not put that power to good use (Score:2)

by Skraut ( 545247 ) writes:

Like finding The Answer to The Ultimate Question Of Life, the Universe and Everything
- Re:Why not put that power to good use (Score:2)
  
  by Zero__Kelvin ( 151819 ) writes:
  
  Ah, hell ... I could do that on an old TRaSh/80:
  
  10 print "42"
  
  Those who don't know what the hell the parent poster and I are talking about obviously has not read their Douglas Adams [douglasadams.com]!
What are these simulations calculating? (Score:2)

by Vellmont ( 569020 ) writes:

Does anyone know what these calculations are trying to determine? In essence, what's the central problem to determining the reliability of old nuclear weapons? I would have thought they're doing simulations of detonation of these aged weapons, but the article talks about using molybdenum, which isn't a fissile material.
- - Re:What are these simulations calculating? (Score:2)
    
    by Vellmont ( 569020 ) writes:
    
    Huh. I had assumed the unreliability came from increased impurities in the plutonium do to natural decay and spontaneus fision, not radiation affecting the surrounding electronics, explosives, tamper, etc.
- - Re:What are these simulations calculating? (Score:2)
    
    by Fordiman ( 689627 ) writes:
    
    Actually, 2+2==5 for very large values of 2.
Thousands of atoms (Score:2)

by the Atomic Rabbit ( 200041 ) writes:

performs very complex quantum calculations to simulate the behaviour of thousands of atoms in three dimensions.
Sounds impressive, but that's only about a 10 atoms on a side.
- Re:Thousands of atoms (Score:2)
  
  by mangu ( 126918 ) writes:
  
  that's only about a 10 atoms on a side
  Exactly. That only goes to show how much CPUs still have to evolve. Every time someone mentions a new more powerful CPU here in /. there are people who ask "why, what's the use?". For many types of physical simulations, the most powerful CPUs in the world are still pathetically slow.
  And that's also a reason why carefully optimized code in C or Fortran with the inner loops written in assembler is still needed. Java, or Ruby, or Python, or any other interpreted language
A few more iterations (Score:3, Insightful)

by ender_ ( 131275 ) writes: on Friday June 23, 2006 @06:28PM (#15593088) Homepage

Imagine, if you will, taking this super-computing ability out a few years. Can the U.S. justify the invasion of a country X because X successfully simulated an attack on the U.S? Or maybe they just had the computing power to simulate it.

To the UN: We'd like you to look at these satellite images that clearly show a super computer simulating the destruction of the U.S. We have to take out these terrorists and we're willing to go it alone.

Afterward: Well it turns out that they didn't have the computing power at all, the images we had were of a mobile home park.

Share
twitter facebook
Wow indeed (Score:3, Interesting)

by mnmn ( 145599 ) writes: on Friday June 23, 2006 @09:47PM (#15594037) Homepage

Thousands of atoms. Shrodingers/Bohrs equations for all of them.

This has interesting consequences for the study of plastics, DNA, virii and other complex molecules.

Perhaps the program can run in a loop trying every possible atomic combination to produce the best of certain attributes, as in give me the hardest material or give me an easy to manufacture room temp superconductor. It bypasses the whole invention/discovery step.

Share
twitter facebook
- Re:Linux (Score:4, Funny)
  
  by spun ( 1352 ) writes: <loverevolutionary&yahoo,com> on Friday June 23, 2006 @04:39PM (#15592362) Journal
  
  Only after it calculates Bill Gates' taxes and beats Kasparov at chess, apparently.
  
  Parent Share
  twitter facebook
  - Re:Linux (Score:2)
    
    by MrSquirrel ( 976630 ) writes:
    
    The researchers wanted to use the computer to try and understand women, but realized that in order to have a computer answer something, it has to be based on logic.
    - Re:Linux (Score:2)
      
      by Tackhead ( 54550 ) writes:
      
      > The researchers wanted to use the computer to try and understand women, but realized that in order to have a computer answer something, it has to be based on logic.
      > --
      > *insert guitar solo here*
      "She don't read Slashdot, but software makes her Blue Gene talk!"
      - Dr. Hook and the Medicine Show
- Specs (Score:5, Informative)
  
  by neonprimetime ( 528653 ) writes: on Friday June 23, 2006 @04:39PM (#15592372)
  
  Specs here [ibm.com] and yes, Suse
  
  Parent Share
  twitter facebook
  - Re:Specs (Score:2)
    
    by multimediavt ( 965608 ) writes:
    
    Umm...no, the BlueGene system does not run Suse, the head nodes (which are IBM eServer or xServer nodes) run Suse. The BlueGene racks themselves run a stripped down Linux that is more like an embedded Linux with HPC support.
- Re:Linux (Score:2)
  
  by LWATCDR ( 28044 ) writes:
  
  "Does it run Linux(R)?"
  Umm.. Yes as do most supercomputers these days.
  http://www.top500.org/stats/26/osfam/ [top500.org]
  When it comes to supercomputers Linux is the OS of choice these days. Even Mac OS/X has five times the market share of Windows when you talk about the top 500 supercomputers. You see when it comes to doing real work Windows is just a hobbyist play thing. The big boys run Linux.
- !Physics + !CS = CRAP (Score:2)
  
  by woolio ( 927141 ) writes:
  
  The problem I see with projects like BG/L is one of two things a happen:
  
  1) Software is implemented by CS majors who have little understanding of the math and physics involved. They probably implement highly computationally intensive (inefficent) algorithms well (i.e. no bubblesort).
  
  2) Software is implemented by Physics majors who although knowing the syntax of C/fortran, don't understand how to write good programs. Their implementations are numerically correct, but highly inefficient [e.g. they use non-

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Slight clarification (Score:4, Funny)

Re:Slight clarification (Score:5, Funny)

Re:Slight clarification (Score:2)

Couldn't run vista.... (Score:2)

Re:Slight clarification (Score:3, Interesting)

Re:Slight clarification (Score:2, Informative)

Re:Slight clarification (Score:3, Insightful)

Re:Slight clarification (Score:2, Insightful)

Re:Slight clarification (Score:2)

Yeah, but... (Score:3, Funny)

Re:Yeah, but... (Score:5, Funny)

Re:Yeah, but... (Score:4, Funny)

Re:Yeah, but... (Score:2, Insightful)

Re:Yeah, but... (Score:2)

Re:Yeah, but... (Score:3)

Re:Yeah, but... (Score:3, Insightful)

Re:Yeah, but... (Score:3, Funny)

Too bad for Q-box... (Score:4, Funny)

Re:Too bad for Q-box... (Score:2, Funny)

More importantly... (Score:2, Funny)

Only the most intensive USEFUL program (Score:5, Funny)

Re:Only the most intensive USEFUL program (Score:3, Informative)

Re:Only the most intensive USEFUL program (Score:3, Funny)

Re:Only the most intensive USEFUL program (Score:2, Funny)

"Infinite" loops (Score:2)

Re:"Infinite" loops (Score:2)

Re:Only the most intensive USEFUL program (Score:2)

Re:Only the most intensive USEFUL program (Score:2)

Re:Only the most intensive USEFUL program (Score:3, Insightful)

Re:Only the most intensive USEFUL program (Score:2)

Re:Only the most intensive USEFUL program (Score:2)

...wow... (Score:5, Interesting)

Re:...wow... (Score:2)

it doesn't work like that (Score:4, Informative)

Re:...wow... (Score:3, Insightful)

Re:...wow... (Score:5, Informative)

Re:...wow... (Score:2)

Re:...wow... (Score:3, Informative)

Re:...wow... (Score:2)

Re:...wow... (Score:2)

Quantum Monte Carlo (Score:3, Informative)

Re:...wow... (Score:2)

Re:...wow... (Score:2)

Re:...wow... (Score:3, Insightful)

Re:...wow... (Score:3, Insightful)

Re:...wow... (Score:2)

Re:...wow... (Score:2)

Molest me not (Score:5, Funny)

How to test a nuke.. without testing one (Score:4, Insightful)

Re:How to test a nuke.. without testing one (Score:2, Funny)

Re:How to test a nuke.. without testing one (Score:2)

Re:How to test a nuke.. without testing one (Score:2, Interesting)

Re:How to test a nuke.. without testing one (Score:2)

Re:How to test a nuke.. without testing one (Score:2)

Re:How to test a nuke.. without testing one (Score:2)

Re:How to test a nuke.. without testing one (Score:2)

Smart, sure. But is it happy? (Score:5, Funny)

Just wait... (Score:4, Informative)

Re:Just wait... (Score:2)

Re:Just wait... (Score:5, Interesting)

Re:Just wait... (Score:2)

Re:Just wait... (Score:3, Insightful)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:5, Insightful)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:2)

Re:Just wait... (Score:4, Informative)

I suspect the answer ends up (Score:4, Funny)

Re:I suspect the answer ends up (Score:2)

only (Score:2)

You are absolutely right! (Score:2)

Fill in Blank Please (Score:2, Funny)

HPCWire Interview (Score:4, Informative)