Google Acquires Metaweb

Follow Slashdot blog updates by subscribing to our blog RSS feed

Google Acquires Metaweb 63

Posted by Soulskill on Friday July 16, 2010 @04:33PM from the nothing-to-do-with-neal-stephenson dept.

eldavojohn writes "A startup called Metaweb (looks like an ontological, entity-based approach to Web 2.0 tagging) has been acquired by Google. You can find out what they're about from a super marketing fluff video they put together. The neat thing about Metaweb is that the database of entities it has is free. Will Google be able to make Metaweb work on their omniscient scale, or was this just Google making sure a startup doesn't become yet another player in search?"

This discussion has been archived. No new comments can be posted.

Google Acquires Metaweb

Load All Comments

Search 63 Comments Log In/Create an Account

Comments Filter:

Silly Logic (Score:5, Insightful)

by Monkeedude1212 ( 1560403 ) writes: on Friday July 16, 2010 @04:35PM (#32931708) Journal

Will Google be able to make Metaweb work on their omniscient scale, or was this just Google making sure a startup doesn't become yet another player in search?"
If Metaweb doesn't work at Google's Scale, then it couldn't compete with them.

Share
twitter facebook
- Re: (Score:1, Insightful)
  
  by Svartalfar ( 867908 ) writes:
  
  This wouldn't be the first time a large company bought a start up to prevent it from possibly ever becoming a competitor though. Better to spend peanuts now to buy the possibility then spend a fortune later to fight off a competitor you didn't see coming.
  - Re: (Score:1)
    
    by jbell730 ( 1586063 ) writes:
    
    Better to spend peanuts now to buy the possibility then spend a fortune later to fight off a competitor you didn't see coming.
    See also: Microsoft.
  - Re: (Score:2)
    
    by yppiz ( 574466 ) * writes:
    
    This is a good point. I don't know if this was a factor in Google's acquisition, but Powerset (acquired by Microsoft and now part of Bing) uses Metaweb's Freebase.
  - Re: (Score:2)
    
    by RockDoctor ( 15477 ) writes:
    
    Better to spend peanuts now to buy the possibility then spend a fortune later to fight off a competitor you didn't see coming.
    If you didn't "see it coming" in five years from now, then you either didn't know of it's existence today, or you made an error of judgment today, or you forgot tomorrow what you knew of today. In which cases you still deserve to die (in a corporate sense) for being incompetent.
    Sounds like this company/ idea has been around a while and Google have decided that their ideas/ technologi
- - Re:Silly Logic (Score:4, Funny)
    
    by Monkeedude1212 ( 1560403 ) writes: on Friday July 16, 2010 @05:50PM (#32932696) Journal
    
    I'm not sure how you got from "Machines automatically creating and ranking indexed entities" to "Destroy all humans" but I'm sure it involved an illegal substance.
    
    Parent Share
    twitter facebook
    - Re:Silly Logic (Score:4, Funny)
      
      by Jurily ( 900488 ) writes: <jurily&gmail,com> on Friday July 16, 2010 @05:52PM (#32932734)
      
      I'm not sure how you got from "Machines automatically creating and ranking indexed entities" to "Destroy all humans" but I'm sure it involved an illegal substance.
      It's the AI that was supposed to index 4chan.
      
      Parent Share
      twitter facebook
    - Re: (Score:2)
      
      by nobodylocalhost ( 1343981 ) writes:
      
      Simple, currently humans are integrated in this system. But we will eventually be phased out in favor of better performing and more efficient machines that don't generate as much repeated junk data (aka social media). It's a natural upgrade process.
      - Re: (Score:2)
        
        by Monkeedude1212 ( 1560403 ) writes:
        
        But the system is FOR humans - there'd be no need for a system without humans - I think thats the part of your theory that fails entirely. AI cannot be self serving.
- Re: (Score:2)
  
  by Itninja ( 937614 ) writes:
  
  I always thought "Ontological" was a synonym for a useless philosophy degree.
  - Re: (Score:3, Informative)
    
    by infinitelink ( 963279 ) writes:
    
    "Ontological" is an essential adjective for describing different aspects of knowledge (science); ontology for ordering it.
  - Re: (Score:2)
    
    by thoughtsatthemoment ( 1687848 ) writes:
    
    Well, a thousand years ago computer science would also have been a synonym for a useless philosophy degree.
- Re: (Score:2)
  
  by Fnkmaster ( 89084 ) writes:
  
  The problem is that nobody wants to express information through RDF tuples and ontologies. Instead, they express information in human-readable text, with structural and visual markup. Search technology has come very far in terms of figuring out what information we actually want, with things like personalization, disambiguation (see DuckDuckGo for example), shopping/product search, and so on. All this stuff can be teased out of traditional web content with far less effort than trying to get every company
  - Re:"Ontological" is a synonym for failure. (Score:5, Insightful)
    
    by LarryRiedel ( 141315 ) writes: on Friday July 16, 2010 @05:12PM (#32932264)
    
    stuff that requires hard AI or tons of human labor and thus won't be happening any time soon.
    
    Wikipedia.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Fnkmaster ( 89084 ) writes:
      
      ... which is exactly what DuckDuckGo uses as its data source to handle disambiguation. But Wikipedia is structured for humans and features a large volume of knowledge in human language form with some basic markup. It's not a bunch of information encoded in RDF tuples. Thus my point. Trying to get everybody on the web to re-encode the vast body of knowledge out there in RDF, explicitly referencing ontologies is a setup for failure. Sure, you might use some sort of tuple format to internally store inform
    - Re: (Score:2)
      
      by Hurricane78 ( 562437 ) writes:
      
      Aaahhh Wikipedia... the idea of collecting “facts” by determining how many idiots did not disagree.
      Or in other words: Argumentum ad populum hard at work.
  - Re: (Score:2)
    
    by mandelbr0t ( 1015855 ) writes:
    
    This is why "ontologies" have become synonymous with fail.
    So you're saying that Google bought a failure to save the rest of the world from it? It's the "tons of human labor" part that becomes the issue; it's bad enough trying to teach a human about semantics, let alone a pedantic automaton. Wake me up when an AI can disambiguate without me spending 45 minutes explaining the basics of English language.
  - Re: (Score:2)
    
    by yppiz ( 574466 ) * writes:
    
    Early on, we knew we'd have to make a UI so that users could have as close to a free-text experience as possible while still contributing structured data. Freebase lets you create a topic that is generic, and then co-type it with multiple specific types later. It allows ontology geeks to do their thing, and regular users to just work where they are comfortable. It's a tough balance to strike, but Metaweb's Freebase was populated by a small team of data wranglers using a mix of automated methods and coordina
Didn't see it coming. (Score:5, Funny)

by pushing-robot ( 1037830 ) writes: on Friday July 16, 2010 @04:37PM (#32931740)

Everyone was thinking Google would take over the Web, and here they skip right past it and acquire the Metaweb.
Well played, Google, well played.

Share
twitter facebook
- Re: (Score:2)
  
  by Monkeedude1212 ( 1560403 ) writes:
  
  Before you know it, they'll move on from the web and acquire the mesh!
- Re:Didn't see it coming. (Score:5, Interesting)
  
  by IAmGarethAdams ( 990037 ) writes: on Friday July 16, 2010 @04:44PM (#32931876)
  
  I've been using Freebase integrations on a couple of sites, and the possibilities Freebase already offers for rich metadata integration is HUGE.
  For example, a couple of their simple API samples are a list of Police songs from the Synchronicity album, ordered by track length [freebase.com], or Graduates of Stanford born since 1960 who are board members of companies [freebase.com]
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by Spazntwich ( 208070 ) writes:
    
    Holy shit that is cool.
    I wonder if investigative reporters will be able to utilize this. Datamining for the little guy.
  - Re: (Score:3, Insightful)
    
    by Joe Tie. ( 567096 ) writes:
    
    Reading over all of this, I've been wondering what the hell metaweb is. Your couple sentences explained it better than the pages of text leading up to it by others. Showing, in this case, seems far better to telling in order to properly describe it. And holy shit is that awesome!
    - Re: (Score:2)
      
      by IAmGarethAdams ( 990037 ) writes:
      
      Well, Freebase is just an application of the Metaweb technologies. However, the storage and organisation of data (which is what the core of Metaweb is geared around) is useless without any means of retrieval
      - Re: (Score:2)
        
        by yppiz ( 574466 ) * writes:
        
        Freebase makes their data available as free CC-licensed data dumps. You can import this into any database you want. There's no requirement for Metaweb's technology to use the data. It's just a very convenient way to do so!
  - Re: (Score:2, Funny)
    
    by Anne_Nonymous ( 313852 ) writes:
    
    Once you start freebasing, you just can't stop.
- Re: (Score:2)
  
  by Frankie70 ( 803801 ) writes:
  
  Everyone was thinking Google would take over the Web, and here they skip right past it and acquire the Metaweb.
  Well played, Google, well played.
  Well, they could have got the web for free - http://www.free-web.org/ [free-web.org]
I never metaweb (Score:2, Funny)

by abbynormal brain ( 1637419 ) writes:

i didn't' like.
Rehab (Score:5, Funny)

by masterwit ( 1800118 ) * writes: on Friday July 16, 2010 @04:45PM (#32931896) Journal

Will Google be able to make Metaweb work on their omniscient scale, or was this just Google making sure a startup doesn't become yet another player in search?
Wrong and wrong, you see Google is freebasing [freebase.com] now:

The web isn’t merely words[, or water-soluble,] it’s information about things in the real world, and understanding the relationships between real-world entities...
Sometimes you have to give it a good ole "smoke-test" to see the possibilities...Google should be careful though, the path they have chosen is a slippery slope!

Share
twitter facebook
For a web 2.0 company (Score:2)

by TyFoN ( 12980 ) writes:

They sure have an ugly web page.
- Re: (Score:2)
  
  by Tumbleweed ( 3706 ) * writes:
  
  For a web 2.0 company ... They sure have an ugly web page.
  Okay, two jokes come to mind right away:
  1) That's why _Google_ bought them!
  2) You already said 'web 2.0'; you don't need to say 'ugly' when you've said that.
Palantir (Score:1)

by Squib ( 124309 ) writes:

Looks like this may be a way to make a play for competition in homeland security and business support, like Palantir [palantirtech.com] has done plus medical data tracking, and other possible extrapolations
I'm fairly sure it's not going to be used for just generating websites.
Expanding reach (Score:2, Interesting)

by ceraphis ( 1611217 ) writes:

Slowly but surely google continues to acquire startups and expand their business. Not that I mind it that much in Google's case but isn't this the type of thing that Microsoft or AT&T eventually got hammered for?

Legitimately wondering if Microsoft and AT&T did it much more dastardly or if there's no significant comparison whatsoever.
- Re: (Score:3, Interesting)
  
  by thoughtsatthemoment ( 1687848 ) writes:
  
  In this case, Google is trying to enhance their core business that is search. The way we search on the internet is still quite primitive and it's also some kind of brute force. I bet all search engine providers are working on making their engines more intelligent and the result will ultimately decide which one will be the last one standing.
I hope they keep working on Gridworks (Score:2)

by xenocide2 ( 231786 ) writes:

One of the challenges with generating and using data sets is cleaning them up. Data entry errors, OCR failures, conflicts between multiple sources, etc. make it a pain to search and summarize data. Gridworks [google.com] helps me hunt down bad records and normalize fields. If it keeps improving, people might start using it before publishing their crap data.
Freebase (Score:2)

by Haffner ( 1349071 ) writes:

Was anyone else amused by this? (RTFA)
Something Alta Vista had Google does not... (Score:5, Interesting)

by wowbagger ( 69688 ) writes: on Friday July 16, 2010 @05:32PM (#32932496) Homepage Journal
In a way, I miss Alta Vista, in that they had a few things that Google does not:
- NEAR operator (require the phases occur close in the page, which helped to eliminate the "pile of unrelated stuff" pages)
- proper Boolean operators in the search, with arbitrary complexity (e.g. "((pre-emergent OR preemergent) AND herbicide AND liquid) AND NOT gluten")
- and the thing that makes this post on-topic: Alta Vista had a search mode where-in you could refine your search by it presenting a set of additional search terms that helped qualify the meaning of what you searched for.
Say you searched for "wine", and activated that mode. It would present you with some possible extra terms you could search on, such as "white", "red", "tannic", "windows", "microsoft", "emulator".
Were you to be searching for the fermented beverage, you could select "red", "white", "tannic" and so on.
Were you searching for the ABI adapter package, you could select "windows", "Microsoft", and "emulator" (which yes, Wine is NOT...)
I'd love to see Google add that sort of refinement, ideally "learning" what sorts of terms go with what (Wine + tannic = beverage, wine + OLE = software).
Share
twitter facebook
- Re: (Score:2)
  
  by morcego ( 260031 ) writes:
  
  I wish they would just allow us to use regular expressions and be done with it ...
  - Re: (Score:1)
    
    by IAmGarethAdams ( 990037 ) writes:
    
    Well, that might work if indexes were stored as full text representations of a string
  - Re:Something Alta Vista had Google does not... (Score:4, Interesting)
    
    by gilleain ( 1310105 ) writes: on Friday July 16, 2010 @06:00PM (#32932828)
    
    I wish they would just allow us to use regular expressions and be done with it ...
    There's a good reason why not - because of regex DDOS [wikipedia.org] with people inputting "N(o|oo)" to match "Nooooooo....ooooo!" (or similar).
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by martin-boundary ( 547041 ) writes:
      
      That's easy to fix by putting in a hard limit on the size of the returned text (yes, that breaks the principle of having a general regex). The real difficulty is efficiently building and maintaining an index that allows actual regex searching, such as (variations on) a suffix array [wikipedia.org]. Even for a small document collection, that sort of thing eats up a lot of resources.
    - Re: (Score:2)
      
      by Hurricane78 ( 562437 ) writes:
      
      If it is known, it can be prevented. If it can be prevented, your argument is invalid.
- Re: (Score:2)
  
  by omar.sahal ( 687649 ) writes:
  
  Alta Vista had a search mode where-in you could refine your search by it presenting a set of additional search terms that helped qualify the meaning of what you searched for.
  This is in Google! I typed wine and if gave me a whole bunch of choices, like wine tasting and so on. Turn on JavaScript and it should work
  - Re: (Score:3, Insightful)
    
    by wowbagger ( 69688 ) writes:
    
    It's not quite the same. The Alta Vista approach grouped the tags - it would have grouped "tasting" with "red" and "white", while grouping "OLE" and "DirectX" in a separate grouping. Moreover, it was smart enough to use that grouping to allow you to select the whole group.
    Thus, Alta Vista was better able to detect that sometime "wine" means a beverage, and sometimes software, and that the two concepts are different.
    Google still has trouble understanding that the fermented liquid and the software aren't the
- Re: (Score:3, Informative)
  
  by Joe Snipe ( 224958 ) writes:
  
  http://www.google.com/advanced_search?hl=en [google.com]
- Re: (Score:2)
  
  by Jimmy King ( 828214 ) writes:
  
  Ah hah! Thank you! Every couple of months I find myself trying to remember which search engine used to offer the proper, complex boolean search functionality. Occasionally I even make the rounds through all of the old search engines (the ones that haven't just become aggregators of other search engines, anyway) and give it a try in case I can stumble upon it. I guess I'm out of luck on that.
From the early days to acquisition! (Score:2, Interesting)

by yppiz ( 574466 ) * writes:

I was on the founding team at Metaweb when we spun out of Applied Minds. I can answer some questions here, but first I wanted to congratulate the team that brought this company all the way to acquisition.
So, from the beginning we knew that semantic this and ontology that would be a non-starter for most contributors from Planet Earth. While Freebase is a complex system under the hood, the user interface makes contributing data to an existing type (schema) pretty easy. You can add content from a browser windo
- Re: (Score:2)
  
  by beakerMeep ( 716990 ) writes:
  
  Pinging ganymede.cs.brandeis.edu [129.64.2.21] with 32 bytes of data: Reply from 129.64.2.21: bytes=32 time=24ms TTL=47 Reply from 129.64.2.21: bytes=32 time=24ms TTL=47 Reply from 129.64.2.21: bytes=32 time=26ms TTL=47 Reply from 129.64.2.21: bytes=32 time=25ms TTL=47 Ping statistics for 129.64.2.21: Packets: Sent = 4, Received = 4, Lost = 0 (0% loss), Approximate round trip times in milli-seconds: Minimum = 24ms, Maximum = 26ms, Average = 24ms
  /Sorry, dont actually need but this is slashdot I
  - Re: (Score:2)
    
    by yppiz ( 574466 ) * writes:
    
    Pong!
Alternative Headline (Score:1)

by bestadvocate ( 816742 ) writes:

Look forward to Freebasing with Google!
Google has it. (Score:1)

by NeoXon ( 1718618 ) writes:

All your (free)base are belong to us.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Silly Logic (Score:5, Insightful)

Re: (Score:1, Insightful)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re:Silly Logic (Score:4, Funny)

Re:Silly Logic (Score:4, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re:"Ontological" is a synonym for failure. (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Didn't see it coming. (Score:5, Funny)

Re: (Score:2)

Re:Didn't see it coming. (Score:5, Interesting)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2, Funny)

Re: (Score:2)

I never metaweb (Score:2, Funny)

Rehab (Score:5, Funny)

For a web 2.0 company (Score:2)

Re: (Score:2)

Palantir (Score:1)

Expanding reach (Score:2, Interesting)

Re: (Score:3, Interesting)

I hope they keep working on Gridworks (Score:2)

Freebase (Score:2)

Something Alta Vista had Google does not... (Score:5, Interesting)

Re: (Score:2)

Re: (Score:1)

Re:Something Alta Vista had Google does not... (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:3, Informative)

Re: (Score:2)

From the early days to acquisition! (Score:2, Interesting)

Re: (Score:2)

Re: (Score:2)

Alternative Headline (Score:1)

Google has it. (Score:1)

Related Links Top of the: day, week, month.

Slashdot Top Deals