Ask Slashdot: How Do I De-Dupe a System With 4.2 Million Files?

HiggsBison Only if you have 100 unique files (440 comments)

If you have 100 files all of one size, you'll have to do 4950 comparisons.

You only have to do 4950 comparisons if you have 100 unique files.

What I do is pop the first file from the list, to use as a standard, and compare all the files with it, block by block. If a block fails to match, I give up on that file matching the standard. The files that don't match generally don't go very far, and don't take much time. For the ones that match, I would have taken all that time if I was using a hash method anyway. As for reading the standard file multiple times: It goes fast because it's in cache.

The ones that match get taken from the list. Obviously I don't compare the one which match with each other. That would be stupid.

Then I go back to the list and rinse/repeat until there are less than 2 files.

I have done this many times with a set of 3 million files which take up about 600GB.

more than 2 years ago

F-Secure Report: Another SCADA Attack in Iran — This Time With AC/DC

HiggsBison Evergreen FTW! (253 comments)


Yes, sure, she has a pure, wonderful, beautiful voice, blah blah blah. But that's the point. In my experience, the notes she sings travel hundreds of yards down the corridor and infect everyone's office.

Play it over and over and over and over and over and ... people will be tearing their hair out. We could call it ... I don't know ... the Streisand Effect?

more than 2 years ago

If the Higgs Boson Is Found I'll.....

HiggsBison Re:Higgs Bosun? (253 comments)

Or maybe the Higgs Admiral?

more than 2 years ago

LHC Discovers New Particle That Looks Like the Higgs Boson

HiggsBison Re:Particle That Looks Like the Higgs Boson (396 comments)

Well, we can assume that it wasn't a Higgs Bison. CERN doesn't have a herd. Fermilab still has the advantage there.

more than 2 years ago

Lonesome George Is Dead At 100

HiggsBison Turtle rings? (154 comments)

I thought I remembered reading about this. Um... yeah, it works for turtles, too. Well, young turtles at least. Google for turtle count rings.

more than 2 years ago

Lonesome George Is Dead At 100

HiggsBison Re:Poor bastard... (154 comments)

Not to get too picky, but: I went to Wikipedia to find out what the heck a C. elegans was. (Nematode) Anyway, the split seem to be more 99.95%/0.05%.

more than 2 years ago

Oracle Sues Lodsys For Patent Trolling

HiggsBison Re:confused (119 comments)

Mutual annihilation would be nice.

more than 2 years ago

The Future of Browser Choice

HiggsBison Uh... Java Icon on article? (188 comments)

This article has a Java Icon. Because "Java, JavaScript, whatever, it's all the same"? Perhaps "mobile" isn't the big threat here.

about 2 years ago

Backdoor In RuggedOS Systems: Infrastructure, Military Systems Vulnerable

HiggsBison It was a typo. (154 comments)

It was supposed to be RiggedOS.

more than 2 years ago

Canadian Telcos Lobby Against Pick-and-Pay TV

HiggsBison Unconvinced (244 comments)

The real answer is that you're not the customer. You're the product

If I'm being billed, I'm a customer.

Maybe I'm not a priority, but I'm still a customer, and if I don't like what I'm getting for the price, then they lose me as a customer.

And if I'm also a product, they lose that too.

more than 2 years ago

TSA 'Warning' Media About Reporting On Body Scanner Failures?

HiggsBison BS? Barbara Streisand? (465 comments)

I call BS.

I was wondering who would bring up the Barbara Streisand Effect first.

So now, Barbara Streisand is a Terrorist!

more than 2 years ago

Researchers Seek Help In Solving DuQu Mystery Language

HiggsBison Seriously! (131 comments)

I'm sure he did write assembly. But Object Oriented assembly?

I'm incredulous that you are incredulous. I thought I saw a book about that somewhere. So I walked over to my tall stack of random language books and there it is:
Object-Oriented Assembly Language, Len Dorfman, McGraw-Hill, 1990

I hereby thwack you upside the head.

more than 2 years ago

Exercise and Caffeine May Activate Metabolic Genes

HiggsBison Mostly right. (148 comments)

and I'd bet that someone's found a transcription factor somewhere that binds to methylated DNA and ...

I believe there are inhibitor regions which will, when not methylated, attract some special-purpose snotball (yeah, I'm gonna call that a technical term) which interferes with transcription. And then when methylated, these inhibitor regions fail to interfere.

more than 2 years ago

Aderall Or Nothing: Anatomy of the Great Amphetamine Drought

HiggsBison Re:You'd think, but... (611 comments)

they opt to make the most money with a supply ordained by the government

Logical, plain and simple. Sure. You don't think the pharmaceuticals had a hand in that "government" decision?

more than 2 years ago

Job Seeking Hacker Gets 30 Months In Prison

HiggsBison Re:$1 mil? Seriously? (271 comments)

Once the notice comes to IT that they've had a break-in you've got an awful lot of work to do.

Of course. Reactive security audits are much more expensive than proactive security audits. Life sucks when you are inept. What he did was inexcusable, but to put all the blame on a script kiddie is just unprofessional. If a criminal organization had broken in it could be way more expensive.

Concentrate on fixing the problem, not the blame.

more than 2 years ago

Alzheimer's Transmission Pathway Discovered

HiggsBison What are these 'cures' of which you speak? (154 comments)

While I agree that the pharmaceutical businesses is a complete disaster area in terms of cures-per-dollar ...

The pharmaceutical industry is not about prevention or cure, they are all about perpetual treatment.

more than 2 years ago

SpaceX Tries Out Its New SuperDraco Rocket Engine

HiggsBison Re:Names... (118 comments)

And when you make something that's like the thing with the cool name, but way above it, "Super" is often applied.

Meh. I'll wait for the SuperDuperDraco.

more than 2 years ago

America's Future Is In Software, Not Hardware

HiggsBison Oh, there's a leap in logic (630 comments)

If they are so highly skilled and have so much experience, why don't they start new companies ...

How does being a skilled, experienced computer scientist suddenly make me a good entrepreneurial business manager?

And where will all these unemployed software engineers get their start-up money?

more than 2 years ago

High School Students Send Lego Man 24 Kilometers High

HiggsBison Lego Man... (115 comments)

Lego Man, Lego Man, does whatever a Lego can...

more than 2 years ago

Cinnamon Gnome-Shell Fork Releases Version 1.2

HiggsBison Cinnamon... (81 comments)

"It's cinnamonnamony!"--The Swedish Chef

more than 2 years ago


