×

Announcing: Slashdot Deals - Explore geek apps, games, gadgets and more. (what is this?)

Thank you!

We are sorry to see you leave - Beta is different and we value the time you took to try it out. Before you decide to go, please take a look at some value-adds for Beta and learn more about it. Thank you for reading Slashdot, and for making the site better!

Comments

top

Common Crawl Foundation Providing Data For Search Researchers

Gumber Re:Saves you on bandwidth (61 comments)

Bitch moan, bitch moan. If I had a need for such a dataset, I think I'd be damn grateful that I didn't have to collect it myself. As for the cost of processing the pages, the article suggests that running a hadoop job on the whole dataset on EC2 might be in the neighborhood of $100. That's not that costly.

more than 3 years ago
top

Common Crawl Foundation Providing Data For Search Researchers

Gumber Re:Interesting, however (61 comments)

It may or may not be a small part of the problem, but it isn't a small problem to crawl that many web pages. This likely lets people save a lot of time and effort which they can then devote to their unique research.

Maybe it will cost a fortune to analyze that much data, but there isn't really anyway of getting around the cost if you need that much data. Besides, for what its worth, the linked article suggests that a hadoop run against the data costs about $100. I'm sure the real cost depends on the extent and efficiency of your analysis, but that is hardly "a fortune."

more than 3 years ago
top

Common Crawl Foundation Providing Data For Search Researchers

Gumber Re:Is this an Amazon sponsor thingy? (61 comments)

A conspiracy? You're going to have to pay someone for the compute time. It's not like a lot of people have big clusters lying around, so lot of people are going to opt to pay Amazon anyway.

As for selling access to the data on physical media, it doesn't look like there is anything to stop you from taking advantage of Amazon's Export Service to get the data set on physical media.

more than 3 years ago
top

Common Crawl Foundation Providing Data For Search Researchers

Gumber Re:Is this an Amazon sponsor thingy? (61 comments)

I don't get it. You are going to have to pay someone if you want to do any research on it. If you don't want to pay Amazon you could either crawl the data yourself, or pay the cost of transferring the data out of Amazon's cloud.

more than 3 years ago
top

Fedora Aims To Simplify Linux Filesystem

Gumber Re:When do we get compression? (803 comments)

Well, putting aside the fact that you are talking about filesystem internals, and the OP is talking about conventions for filesystem layout:

Disks are really big these days. The things people tend to fill them with are images, video and audio that is already in a compressed format. So, for the average user, directory compression isn't going to be a big win.

To put it more succinctly, this isn't an important filesystem feature.

more than 3 years ago
top

Boeing To Deliver First 787 Today

Gumber Re:I would be a bit worried to fly in this plane. (366 comments)

Aeronautical engineers involved with civillian passenger aircraft seem to have an appropriately conservative attitude about risk. That doesn't mean that there won't be problems when they try to innovate, but I have a hard time imagining that the 787 will actually go into commercial service without thorough vetting. Its still a new design, of course, and problems will be discovered and fixed once the aircraft are in regular use.

It may be less safe than, say, an older model with more real world use, or a new model with less ambitious design and technology, but it is less safe from a baseline with a remarkable degree of safety.

more than 3 years ago
top

Newly Digitized Film Shows Ed Catmull's 3D Graphics From 1972

Gumber Re:Disappointing lack of technical details. (95 comments)

Thanks for the clarification. I'd misread the 2.5 minute time as being the total throughput, not the time it took to output a single completed frame, but rereading the paper, it seems like it is indeed the time to expose a 1024x1024 frame. Its unclear to me how long the computation took.

more than 3 years ago
top

Newly Digitized Film Shows Ed Catmull's 3D Graphics From 1972

Gumber Re:Disappointing lack of technical details. (95 comments)

I dug into the technical details a bit and posted some of what I found on my blog, along with links to the papers describing the hand and facial animation work in more detail: http://geekfun.com/2011/09/03/early-cgi-animation-by-ed-catmull/

The short answer is that the facial animation was produced by software written in Fortran and run on a pair of PDP-10s, and the hand animation was likely running in the same environment. When each frame was finished, it was displayed on a CRT and captured to film using a 35mm animation camera. For the facial animation, each frame took about 2.5 minutes to render.

more than 3 years ago
top

Why Warriors, Not Geeks, Run US Cyber Command Posts

Gumber Um, what about all the gamers? (483 comments)

What about the hundreds of thousands of geeks who have been refining their command of strategy and tactics since they were old enough to hold a mouse?

I can tell you one thing, the US is f-ucked in the event of a major cyberattack if someone as clueless as this clown is in charge.

more than 4 years ago
top

Man Attacked In Ohio For Providing Iran Proxies

Gumber Beware Agent Provocateurs (467 comments)

Assuming this story is true, I'd be concerned that this is an attempt to draw the US Government into a confrontation that will help the hard-liners in Iran. As for who would want such a thing.

Clearly the hard-liners would like to try, once again, to get people to rally behind them in the face of "the great satan." You'd also have to look at the US Neocons, many of whom would like to remove any sympathy for Iran or Iranians that gets in the way of their long-disgraced axis-of-evil BS. And then there is Israel. At least some in Israel are on the same page as the neocons, though I wouldn't want to suggest that their position is universally held.

Anyway, I'm suspicious of the motives of anyone who wants to use this as anything but a reason to get the cops and/or FBI on the case.

more than 5 years ago
top

Getting a Grip on Google Code

Gumber Good for Django (91 comments)

This is a nice design win for Django as a web framework. I wonder how much of the stack he ended up using and whether he used the ORM layer at all.

about 8 years ago

Submissions

Gumber hasn't submitted any stories.

Journals

Gumber has no journal entries.

Slashdot Login

Need an Account?

Forgot your password?