Integrating Wikipedia With a Local Intranet Wiki

Become a fan of Slashdot on Facebook

Integrating Wikipedia With a Local Intranet Wiki 121

Posted by samzenpus on Thursday July 16, 2009 @02:40AM from the mix-and-match dept.

An anonymous reader writes "I work for a large company taking a preliminary look at developing an honest-to-goodness wiki. We have tried to launch a company-wide wiki before, but with little success. The technical domains of each part of the company are different, thus each article needs a good deal of background to be useful. Of course, due the proprietary nature of our work we cannot share our articles outside of the intranet. What we would like to do is leverage existing wikis by augmenting our internal wiki with an external wiki. When a user accesses Wikipedia from inside our intranet, they receive the wikipedia content, plus the local domain specific information. For example, links to company-specific wiki pages would be available in Wikipedia pages. Has anyone else tried to do something like this? I know it sounds like a logistical nightmare; are there any thoughts on how to make this successful?"

This discussion has been archived. No new comments can be posted.

Integrating Wikipedia With a Local Intranet Wiki

Load All Comments

Search 121 Comments Log In/Create an Account

Comments Filter:

URLs (Score:2, Funny)

by BadAnalogyGuy ( 945258 ) writes:

URLs. Look into it.
- Re: (Score:3, Insightful)
  
  by smallfries ( 601545 ) writes:
  
  Noise.
  It's a good place to bury the signal.
- Re: (Score:1, Offtopic)
  
  by CarpetShark ( 865376 ) writes:
  
  Hahhah, 100% accurate, and nicely put :D
- Re: (Score:3, Interesting)
  
  by S77IM ( 1371931 ) writes:
  
  Said in a crude way; but to the OP: This guy is right. The most brain-dead simple way to make this work is to just set up your own wiki, and pepper it liberally with links to relevant Wikipedia pages. As someone below points out, there's even a feature in MediaWiki to make this linking easier (look up "InterWiki" in the MediaWiki help).
  You may even be able to set up #REDIRECTS using InterWiki links so that people can still see the page names you want in your search and category listing, and then be taken
  - Re: (Score:2)
    
    by TheRaven64 ( 641858 ) writes:
    
    If you want a simpler solution and have a few tens of GBs of space to spare, then you can just download a snapshot of Wikipedia and use that as the base for your wiki. You won't get any future articles, but you'll get the current ones.
    On the other hand, I don't really see the point. Is it really hard to read both the wikipedia page and the local page?
    - cron + rsync + tar (Score:2)
      
      by itomato ( 91092 ) writes:
      
      Every organization needs their own, up to date version of . [wikipedia.org]
      But seriously, process the SQL dump when you retreive a monthly (quarterly?) update. Generate a set of strings that are relevant to your organization, and strip articles that don't match.
      Someone can always visit the upstream site, or you can use the interwiki facilities, as mentioned elsewhere.
      - Re: (Score:2)
        
        by broeman ( 638571 ) writes:
        
        dammit, now I just HAD to know the history of hot dogs and wasted an whole hour in studying adjoining links.
      - Re: (Score:2)
        
        by collinstocks ( 1295204 ) writes:
        
        You could probably pretty easily write an extension for mediawiki that attaches to the 'ArticleAfterFetchContent' hook and augments the page with content fetched on the fly from Wikipedia. That would be easy enough to do. Just make sure that when the user is editing the page, the function you attach to the hook does not activate (otherwise you will end up saving the wikipedia content into your page, and it will be there twice when a user visits the page).
  - Re: (Score:2)
    
    by Nefarious Wheel ( 628136 ) writes:
    
    Said in a crude way; but to the OP: This guy is right
    Agree with S77IM in whole. I've put together several Wikis for corporate use. URL's are magic. Aggregators aren't quite that simple, and the ones we tend to see from casual Google searches are almost universally held in contempt. Don't go there.
    The company I work for settled on Confluence because we insisted on attribution and integration with our global AD (by "Global" I mean "about 40 countries"). It isn't all that bad. Stylistically and for tracking I prefer Wikimedia, but in an engineering and SI f
bad idea (Score:5, Interesting)

by uepuejq ( 1095319 ) writes: on Thursday July 16, 2009 @02:47AM (#28713343) Homepage

create a firefox addon that downloads a master list of wikipedia urls to add a link to the intranet site to. you can use regular expressions to parse the wikipedia source so that your link is consistently placed. the master list can be updated at will, and could probably be filled the first time with a simple database request. or something.

Share
twitter facebook
- Re:bad idea (Score:5, Informative)
  
  by jayminer ( 692836 ) writes: on Thursday July 16, 2009 @02:51AM (#28713361) Homepage
  
  Good idea. You can even use an existing add-on, Greasemonkey to do this.
  
  https://addons.mozilla.org/en-US/firefox/addon/748 [mozilla.org]
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by Max Romantschuk ( 132276 ) writes:
  
  I wholeheartedly agree with the parent. Your best bet at doing this well is doing this as dynamically as possible. Scraping web pages is a huge pain. Building an extension to detect when you're visising wikipedia and inject something into the page is a hell of a lot simpler.
  Another poster suggested greasemonkey. I haven't used it myself, but I suspect it would make sense to develop a prototype with greasemonkey first. It might well be that a custom extension is not needed at all.
  Also, Firebug is your friend
  - Re: (Score:2)
    
    by Canazza ( 1428553 ) writes:
    
    A well written Javascript Bookmarklet will do the job too. You likely don't even need Greasemonkey, and it can be made cross-browser
  - Re: (Score:2)
    
    by Jane Q. Public ( 1010737 ) writes:
    
    Scraping web pages is not so bad... I have been doing it for years. But in this case it is entirely unnecessary.
    
    I know of at least two ways this could be done, neither of which is nearly as much work as this would seem at first. First, did you know that the entirety of the content of Wikipedia is downloadable, in different formats? You can get everything, or just the current articles without the history (much smaller), and there are other options as well. While there is a lot of data, it is really not th
    - Re: (Score:2)
      
      by Canazza ( 1428553 ) writes:
      
      If we're talking about redirects, it would be quite easy to generate a 404 page that would redirect you to the Wikipedia page, either through a link or as a straight redirect. Or, if you can, use .htaccess and set up redirect rules there (it's the way wikipedia works anyway AFAIK, it just means adding more rules to your existing one)
      - Re: (Score:2)
        
        by Jane Q. Public ( 1010737 ) writes:
        
        That's true, it just seemed to me it be made to look more seamless if it were done via the API. You could put your own look on it.
    - Re: (Score:1)
      
      by uepuejq ( 1095319 ) writes:
      
      i think you really get into too much development and design effort when such a simple task can be accomplished by using resources that are already available. wikipedia already exists, and they already have servers to handle the search requests. i still say developing a small application (whether it's some javascript, a new add-on, or something that already exists) to add links to wikipedia entries that have alternate internal wiki entries, since according to the parent question the employees are going to
      - Re: (Score:1)
        
        by uepuejq ( 1095319 ) writes:
        
        jesus i hate when i start sentences, find something else to think about, then never oh look a flashing light
        
        Re:bad idea (Score:4, Funny)
        
        by dyingtolive ( 1393037 ) writes: <[gro.erihrofton] [ta] [ttenra.darb]> on Thursday July 16, 2009 @09:37AM (#28715777)
        
        i'd mod you funny
        if
        you didnt read like
        e.e. cummings
        
        Parent Share
        twitter facebook
    - Re: (Score:1)
      
      by JuzzFunky ( 796384 ) writes:
      
      Check out the Media Wiki API [mediawiki.org]
      Here's a detailed howto on creating a Bot [wikipedia.org].
  - Re: (Score:1)
    
    by fedxone-v86 ( 1080801 ) writes:
    
    Also, Firebug is your friend.
    Have you just woken up from cryostasis*, too?
    And you know the craziest thing I've heard about Firebug? Allegedly, people also use it to debug web applications written in JavaScript! Applications... On the web... In JavaScript...
    What's next? Apple running on Intel? Bill Gates becoming a humanitarian?
    ----
    *) I was frozen just before GWB started WWIII and thawed after the Blacks won :D
    **) Thank you, thank you. I'll be here all night.
Download it (Score:2, Informative)

by Anonymous Coward writes:

http://en.wikipedia.org/wiki/Wikipedia_database
Download their database, put it into your system, and you're set.
- Re: (Score:1)
  
  by paulatz ( 744216 ) writes:
  
  It does not include images, and all the integration with Wikimedia Commons, the Wiktionary and other projects. Last but not least, it does not update as wikiepdia is edited.
Solution (Score:5, Informative)

by Z34107 ( 925136 ) writes: on Thursday July 16, 2009 @02:57AM (#28713399)

Perhaps the easiest thing to do would be start with a complete dump of Wikipedia and add your own stuff to it. Their database dump page is here [wikipedia.org].
It is 2.8TB, however. They allude to a "Wikipedia API" for working on a "random subset" of Wikipedia; maybe that would be helpful too.

Share
twitter facebook
- - Re: (Score:2, Informative)
    
    by negge ( 1392513 ) writes:
    
    Why use a dump from early last year when you can have yesterdays (http://download.wikimedia.org/enwiki/latest/)?
    - Re: (Score:2, Funny)
      
      by BadAnalogyGuy ( 945258 ) writes:
      
      Have you *seen* the latest?
      I'd much rather have something that's been vetted a couple
      YOU'RE A FAG LOL
      - Re: (Score:1, Insightful)
        
        by MaskedSlacker ( 911878 ) writes:
        
        Your Karma must be shit BadAnalogyGuy.
        Why would anyone one commit be less vetted than any other commit? The old commits don't get new edits merged into them. A commit from year ago is no less likely to have vandalism present than the commit from yesterday. It will just be different vandalism.
        
        WHOOOOOOOOOSH (Score:1)
        
        by CrashandDie ( 1114135 ) writes:
        
        n/t
  - Re: (Score:2)
    
    by MaskedSlacker ( 911878 ) writes:
    
    That's the compressed version. The meta-history file (compressed:17GB) decompresses to 2.8TB on its own. Assuming the same compression ratio (likely not a valid assumption) the articles file would decompress to 500GB, give or take.
  - Re: (Score:2)
    
    by sonamchauhan ( 587356 ) writes:
    
    > It's not 2TB, it's only 3.2gb. You need enwiki-20080103-pages-articles.xml.bz2,
    > from http://www.archive.org/details/enwiki-20080103 [archive.org]
    i recall reading somewhere the unzipped size of wikipedia was 1-2 TB... not sure about this file though
- Re:Solution (Score:5, Interesting)
  
  by mcrbids ( 148650 ) writes: on Thursday July 16, 2009 @03:25AM (#28713563) Journal
  
  Dumps go stale, Wikipedia is updated all the time. I'd suggest something a bit more dynamic.
  I did something similar (conceptually) as a dynamic help system for our web-based application, and had content in a wiki based on the URL of the page where the help message was to apply. In my case, clicking the "help" button on a page would make a proxy call to a private wiki to get the help menu content. If none was found, an email was sent to support desk and the end-user was given a web-chat prompt to tech support (with the URL prepended so that tech support could jump in, answer the questions, and write the help menu in one fell swoop)
  In your case, start with your local wiki. Presumably you have some stuff in there already. Rename the articles as necessary to match URLs from Wikipedia.
  Then, build a simple proxy server that rewrites wikipedia content to include a header of your local content. Probably 100 lines (or so) of glue code, and anywhere from a few man-hours to a few man-days coding.
  The rest is all training.
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by raistlinwolf ( 1365893 ) writes:
    
    Dumps go stale
    what about periodic dumps of an LTS wikipedia release.
  - Re: (Score:2)
    
    by ibbey ( 27873 ) writes:
    
    I guess I don't see the problem... I'm a lousy programmer, and I could work up a proof-of-concept for this in about 10 minutes in PHP. Put your internal information in one frame and the Wikipedia information in another. Simply load the data from Wikipedia, either using the Wikipedia API, or just put the wikipedia page in a frame. This is not really any different than about a million other websites that aggregate information from multiple sources. This doesn't actually integrate the data (for example inserti
IFRAME? Intelligent proxy/page modification? (Score:3, Insightful)

by seifried ( 12921 ) writes: on Thursday July 16, 2009 @03:05AM (#28713437) Homepage

I assume you want up to date content and to have it clearly seperated from what is yours. Why not enclose the content within an IFRAME? Seriously, it's stupid and simple but might be all you need. Alternatively you coudl use some form of an intelligent proxy/page modifier, either as a mediawiki plugin or whatever floats your boat (i.e. every time a page is loaded also try to get the wikipedia stuff).

Share
twitter facebook
- Re: (Score:2)
  
  by foniksonik ( 573572 ) writes:
  
  If you want to get fancy, use AJAX to grab the Wikipedia content, stuff it into a hidden div, then DOM select the contents of the article and set a visble div's html to the wiki content:
  [code]
  var wikiSource = JQuery.get("http://wikipedia.com/somearticle/", function (wikiHtml){ setContent(wikiHtml); })
  function setContent(wikiHtml){
  JQuery("#hiddenDiv.html(wikiHtml);
  var wikiContent = JQuery("#hiddenDiv #content").html();
  JQuery("#visibleDiv").html(wikiContent);
  }
  [/code]
  - Re: (Score:2)
    
    by truthsearch ( 249536 ) writes:
    
    Except you can't currently make off-domain AJAX calls. It's blocked for security reasons. There's a proposed standard for whitelisting domains, but it doesn't appear to be implemented in any browsers yet.
    - Re: (Score:2)
      
      by billcopc ( 196330 ) writes:
      
      And writing a 3-line AJAX proxy script is too difficult ?
      CURL page
      strip garbage
      output to client
      How hard was that ?
Business Talk is Stupid Talk (Score:4, Insightful)

by rm999 ( 775449 ) writes: on Thursday July 16, 2009 @03:11AM (#28713475)

"What we would like to do is leverage existing wikis by augmenting our internal wiki with an external wiki"
What does that even mean? If you want to design something, you'll have to use more precise language. And for god's sake, stop using the word leverage without thinking about it. You used it backwards - if you are augmenting your internal wiki with external wikis, you are leveraging your internal wiki with the external wikis. You leverage a boulder with a lever, but you don't leverage a lever with a boulder.

Share
twitter facebook
- Re:Business Talk is Stupid Talk (Score:5, Funny)
  
  by MrMr ( 219533 ) writes: on Thursday July 16, 2009 @06:13AM (#28714379)
  
  As a non native speaker I find a dictionary quite convenient in these cases. so I'll do some back and forth translation for you here:
  
  leverage (v.) -> opkrikken -> fuck up
  augment -> duurder maken -> make more expensive
  internal wiki -> krabbel zonder net -> off-line blurb
  external wiki -> krabbel met net -> on-line blurb
  existing -> nog bestaand -> not yet deleted
  
  So the English to English translation is: "What we would like to do is fuck up non yet deleted blurbs by making our off-line blurbs more expensive with on-line blurbs".
  Now that I can understand.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by PMBjornerud ( 947233 ) writes:
  
  "What we would like to do is leverage existing wikis by augmenting our internal wiki with an external wiki"
  What does that even mean? If you want to design something, you'll have to use more precise language.
  His example is much clearer:
  For example, links to company-specific wiki pages would be available in Wikipedia pages.
  One solution could be a Firefox greasemonkey script, as someone above already suggested.
- Re: (Score:2)
  
  by Hognoxious ( 631665 ) writes:
  
  I have no idea what he means, but when he ran it up the flagpole I saluted.
- Re: (Score:2)
  
  by Lumpy ( 12016 ) writes:
  
  give him some slack, he's been in meetings all week with PHB's and Executives that throw the terms around like candy and they don't even know what it means.
  "It will bring us a whole new dynamic by leveraging our skill-set when applied to the future latitude and positions."
  Everyone knows the suits in the corner offices talk only to hear themselves talk. It's either that or Business Administration degrees have a "ramble on like you are educated" class requirement.
- Re: (Score:2)
  
  by psmears ( 629712 ) writes:
  
  You leverage a boulder with a lever, but you don't leverage a lever with a boulder.
  Actually I lever [cambridge.org] the boulder with a lever. Where I come from, "leverage" is a noun, to which the corresponding verb is "lever" :-)
Browser overlay (Score:2)

by Phroggy ( 441 ) writes:

It seems to me I've seen a browser extension somewhere that lets users add their own comments to any arbitrary web page, and those comments can be made public so anyone else running the same browser extension will see them when they load the same page. I bet you could use something like that, with all your users having a browser plugin that pulls URL-based content from an internal server.
Friendly MITM attack (Score:2)

by RiotingPacifist ( 1228016 ) writes:

Sounds like a weird setup, so you'll probably need to do most of it yourself. Perhaps the easiest way is
1) setup a normal local wiki, with care to name pages the same as the relevant wikipedia page [I'm guessing you know how to do this]
2) use DNS redirects or similar tricks to get all wikipedia requests to go to a proxy
3a) do html injection on the page and stick your stuff at the bottom [MITM attack using ettercap or something like that]. This is probably a pretty bad solution, but is going to be the easies
- Re: (Score:1)
  
  by readthemall ( 1531267 ) writes:
  
  Consider also that Wiki pages are mainly intended to be updated. You should decide if your users should be able to modify the content they are viewing or not. If yes, make sure they can modify only the local content and not the content borrowed from Wikipedia.
  - Re: (Score:2)
    
    by JWSmythe ( 446288 ) writes:
    
    That wouldn't work so well, if it were time to update from Wikipedia. I would assume they'd update frequently from Wikipedia (say once every month or so), but is it really necessary to suck down their whole database, when in reality if it's a small network (say less than 10,000 users), there will only be a handful of pages read.
    Ah, what happened to the good ol' days, when the whole Internet fit on that one AOL disk. :)
Doinitwrong (Score:5, Insightful)

by Anonymous Coward writes: on Thursday July 16, 2009 @03:35AM (#28713613)

Agreed. Appending to wikipedia is the ass backwards way to do it. Everyone suggesting greasemonkey and other addons are just enabling your backassery.
What you do is create an internal wiki, and wherever relevent you link to the wikipedia article. Or an external doc. Or nothing at all and expect your employees to look it up on their own.

Share
twitter facebook
- Re: (Score:2)
  
  by karstux ( 681641 ) writes:
  
  That's what I'd have suggested as well. Least amount of work, efficient, usable, no questionable hacks. It's common sense.
  - Re: (Score:2)
    
    by tomhudson ( 43916 ) writes:
    
    Of course it's common sense - which is why it won't be done that way. It's the "OMG you expect users to figure this out?" shit.
    Sounds like someone never heard of 'target="new"' to force the external link to open in a new tab so that the user doesn't go "where did my f*ing internal wiki page go to?"
    ... which explains why they never succeeded before - dumb users, and dump "implementors", and not even the basic understanding of how things work.
    - Re: (Score:2)
      
      by Lumpy ( 12016 ) writes:
      
      which explains why they never succeeded before - dumb users, and dump "implementors", and not even the basic understanding of how things work.
      Welcome to corporate America. Like what we did to the economy?
      - Re: (Score:2)
        
        by geminidomino ( 614729 ) * writes:
        
        Welcome to corporate America. Like what we did to the economy?
        Should have gone with Art Deco. The whole "Early Mongolian Clusterfuck" theme clashes.
interwiki (Score:5, Interesting)

by MadFarmAnimalz ( 460972 ) * writes: on Thursday July 16, 2009 @03:35AM (#28713619) Homepage

You probably want interwiki [wikipedia.org].

Share
twitter facebook
google wave (Score:1, Offtopic)

by linhares ( 1241614 ) writes:

sorry folks, it's all over and google has won. Google wave, coupled with an internal dump of wikipedia, seems to me perfect for your needs.
watch 1.20hs here and see for yourself [google.com]. This monster will change email, chat, wikis and forums. I'd be worried if I was a slashdot overlord. In fact, an idea for an extension to google wave would be to implement slashdot's moderation system into it.
Maybe I drank too much of the kool-aid, but I think wikis and forums will all have to rapidly adapt, or adopt the co
- Re: (Score:2)
  
  by oatworm ( 969674 ) writes:
  
  Mountain View?! Eww... I'll take the crab juice!
Don't (Score:5, Interesting)

by pfafrich ( 647460 ) writes: <rich@@@singsurf...org> on Thursday July 16, 2009 @03:43AM (#28713675) Homepage
Merging wikipedia with you company wiki is a bad idea:
- The wikipedia content will always be out of date
- Changes made to wikipedia content don't get fed back into wikipedia
- Creates confusion as to what is and is not company information
- Trying to load the wikipeida DB locally is a headache due to its shear size
Share
twitter facebook
- Re: (Score:1, Informative)
  
  by Anonymous Coward writes:
  
  sheer
- Re: (Score:3, Interesting)
  
  by korpique ( 807933 ) writes:
  
  I agree (would mod up but gave up modding way back). However this is an interesting and probably reoccurring problem: extending the wealth of public net wisdom with precision data from local context (organisational or task-centric rather than geolocational).
  A proxy adding local content into pages loaded from outside as suggested in Re:Solution by mcrbids [slashdot.org] would solve some of the problems you mention:
  * The wikipedia content will always be out of date
- Re: (Score:1)
  
  by FlyingBishop ( 1293238 ) writes:
  
  Also, strictly speaking, what the poster wants to do is illegal according to the CC-BY-SA and the GFDL.
  See http://en.wikipedia.org/wiki/Wikipedia:License#Re-use_of_text [wikipedia.org]
  Copyleft/Share Alike
  If you make modifications or additions to the page you re-use, you must license them under the Creative Commons Attribution-Share-Alike License 3.0 or later.
  I'm not sure he's planning on modifying, but it still sounds like a pretty clear-cut copyright violation.
  - Re: (Score:2)
    
    by pfafrich ( 647460 ) writes:
    
    It does depend on quite what distribute means
    "Distribute" means to make available to the public the original and copies of the Work or Adaptation, as appropriate, through sale or other transfer of ownership.
    If the modified version is only used in-house then it not made available to the public so clauses about redistribution do not apply.
Hyperlinks? (Score:1)

by Malibee ( 1215790 ) writes:

Maybe I'm missing something, but why not just have an external links section on your internal wiki, or a "Required Reading" section? Seems like the solution you're proposing is a little bit heavyweight for the described problem.
Legitimate use for this hack (Score:2, Interesting)

by biduxe ( 541904 ) writes:

Am I the only one which cannot see any legitimate uses for this hack.
Why lure your users into thinking the content is on wikipedia if it is on your network?
Can't your users use wikipedia _and_ your wiki.
Sincerely I think that the goal for this hack is luring users to think they're reading/editing wikipedia for someone's profit.
- Re: (Score:2)
  
  by tomhudson ( 43916 ) writes:
  
  Why lure your users into thinking the content is on wikipedia if it is on your network?
  Can't your users use wikipedia _and_ your wiki.
  
  Obvious answer: If they're as retarded as the person posting the question ...
  Seriously, if the user can't figure out how to open 2 sites in 2 tabs, a "merged wiki" should be low on you list of priorities.
your content, how proprietory is it? (Score:1)

by tumbleweedsi ( 904869 ) writes:

You need to make sure that there is a clear demarcation between your content and the wikipedia content and this will limit your integration. The last thing you want is for one of your users to upload confidential information onto wikipedia in the mistaken belief they are putting it on the in house wiki.
Maybe... (Score:2)

by denmarkw00t ( 892627 ) writes:

Open page in intranet for...say, capcitor.
Script grabs wikipedia article, strips out header, sidebar, etc and fill in remaining links/images with proper URLs to wikipedia (so they work)
Stores in a database for diff'ing and updating later, dumps remaining content from Wikipedia at the bottom with a good 'ol <hr> and you're off!
What? (Score:3, Interesting)

by madcow_ucsb ( 222054 ) writes: <slashdot2@sanksEULER.net minus math_god> on Thursday July 16, 2009 @04:04AM (#28713787)

Why? Can't you just link to wikipedia pages where appropriate? OK, my company has an internal server we link through to sanitize referrer info so our internal wiki titles don't get all over teh interwebs. But if the wiki users can't figure out "hey, this article is too specific - maybe wikipedia has more general information that would help me," you've got bigger problems than your wiki management.

Share
twitter facebook
part of an Intelligent Book (Score:4, Interesting)

by williamhb ( 758070 ) writes: on Thursday July 16, 2009 @04:16AM (#28713837) Journal

A very small part of My PhD [cam.ac.uk] looked at this (but with "collaborative textbooks" rather than wikis) -- see Chapter 4. Adding a very simple metadata-based navigation layer over the top of the wiki is pretty easy, clean (doesn't confuse users), and seems to do the trick. The wiki itself shows in an embedded frame. Of course, I had to go further and let students do difficult number theory proofs backed by machine reasoning systems within the book, but you won't have to solve that problem!
I'm (gradually) putting this fairly simple but useful part of the software into an online resource at www.theintelligentbook.com [theintelligentbook.com], though it's in my spare time and the system is down at the moment. I'll put my contact details back up there shortly in case the question-asker wants to discuss it technically.

Share
twitter facebook
Simple. Two Tabs (Score:2)

by Phoe6 ( 705194 ) writes:

One Tab for your Internal Wiki. Another one for wikipedia.
You can also highlight a particular word in your internal wiki, do a right click and search wikipedia (if your search is set so). The search term automatically open the wikipedia content in a new tab. How amazing. Isn't it?
Is it only me wondering how did this article ever made it to /. ?
Learn from mistakes. (Score:1)

by JamesR404 ( 1546869 ) writes:

I think this is a very interesting story. Aside from the technical question raised, I am wondering why the first corporate Wiki wasn't successful. If it failed the first time because the culture isn't right or there wasn't any management support, a second wiki tool - no matter how seamlessly integrated - won't succeed either. Even if you have a company with many different technical domains it's even more reasonable to be able to share information. And an article shouldn't try to be totally comprehensible.
Not without merit. (Score:1)

by asdfndsagse ( 1528701 ) writes:

This is something the Google Wave protocol and platform [youtube.com] completely anticipates.
Its based on a tree structure and source code management. People who edit from the synergized wiki could add to either the private or public versions, and patches to public versions or additional documents could be changed and maintained internally.
- Re: (Score:1)
  
  by asdfndsagse ( 1528701 ) writes:
  
  That would essentially be the way it would happen. You would hot pull down the mediawiki source, apply local changes, and locally render to pages with active diffs. You would add have pages that only exist locally. Due to limitations in the platform you would have to custom design any way to have changes that people make go either to public or private system, this would be difficult under the current system constraints, where the documents structure is not kept track of.
The simplest solution I can think of is.. (Score:1)

by kikito ( 971480 ) writes:

1. On your personal wiki server, have a copy of each page of the wikipedia you want to apply modifications to, and add whatever you want on those.
2. Have a modified http proxy on the intranet that detects queries to the wikipedia about items that you have on the server and re-route them.
For example, let's say you want custom information on http://en.wikipedia.org/wiki/Socks [wikipedia.org]. You copy it to http//yourintranetserver/wiki/Socks, and make your changes.
Then, if someone from inside your network tries to g
Freebase.com (Score:1)

by TwistedPants ( 847858 ) writes:

Go and look at Freebase: http://www.freebase.com/ [freebase.com]
They provide an API to obtain articles and structured data from them. They handle all of the wikipedia import.
Additionally, you can do much more with the structured data there
For instance - Olympic Cyclists and the Way They Died.
http://www.freebase.com/view/user/doconnor/default_domain/views/olympic_cyclists_and_they_way_they_died [freebase.com] Try doing that with Wikipedia.
Done this before (Score:1)

by MarkH ( 8415 ) writes:

1) Install Wikipedia software locally and use this for any locally created articles
2) The web server running this simply proxies out to en.wikipedia.org for that request if not available in the local version. The easiest way to do this is with Apache + rewrite rules
This means that users can get to articles locally and on wikipedia from the same command
You then need to consider the following
1) The search request needs to go to the local version of wikipedia then the external one and concatinate the results t
Why not (Score:1)

by Vahokif ( 1292866 ) writes:

Why not run MediaWiki on your intranet and use InterWiki links to Wikipedia in your own articles?
howabout about.com? (Score:1)

by bball99 ( 232214 ) writes:

they were pretty good at page-hijacking, IIRC :-)
seriously though, perhaps i mis-read the question? are you looking for automated tools to do the hyper-links?
There's lots of value in a compound wiki (Score:4, Interesting)

by davide marney ( 231845 ) * writes: on Thursday July 16, 2009 @06:32AM (#28714479) Journal

Ignore the nay sayers. Of course there is a lot of value in aggregating content and creating a compound page that blends your internal content with other sources.
From a usuability and authority-of-source perspective, however, I think it would be best to list each source in a separate section on the page, starting with your internal content at the top. You can get to the other content either by embedding links into your internal content, or by collecting the links in a separate section.
Wikipedia itself uses the embedded technique. When composing or editing an article, the author can embed markup for external references. On display, this markup is turned into a footnote link at the point of embedding, and a footnote at the bottom of the page. I don't see why you couldn't do something similar. In this case, however, you would be embedding references to Wikipedia articles.
I don't see why you couldn't do something similar. In your internal wiki templates, have a custom markup for embedding wikipedia queries related to the article. On display, turn this markup queries either into embedded links to footnotes, resolve the queries and deposit them at the bottom of the page, or toss them into iframes and let the user sort it out.
The other technique is to have a custom form in your internal wiki template where you collect the cross-references. On display, turn these queries into links or resolve them into content.
In any event, why limit yourself to Wikipedia? Include cross-references to patent search engines and other domain-specific sources.
A big word of caution, of course, is owed to the legal angle. Make sure you follow the law whenever reusing anyone else's content, even if it's just a link. Have your legal department sign off on your reuse policy. Don't distract them with technical aspects of what you want to do. They're lawyers; they only care about the law. Ask them a specific legal question, such as, "what is our legal exposure if we republish (links to or actual content from) Wikipedia on our internal wiki?".

Share
twitter facebook
WikiSlurp... (Score:1)

by bagsta ( 1562275 ) writes:

Accidentally I saw this site [thecodetrain.co.uk]. I haven't tested and I don't know the results. I think it's in the early stages of development.
Is this really worthwhile? (Score:1)

by gertam ( 1019200 ) writes:

I don't get it. Are people in your company using Wikipedia so much in their daily work that this would really be useful. Just set up your internal wiki. It is your focal point. Why try and integrate the two beyond just making a link to Wikipedia? Using Mediawiki, you can even use Interwiki links to easily link outside of your internal wiki.
Try the other way round (Score:1)

by RogL ( 608926 ) writes:

Why not try the other way round:
Create your wiki, add pages, add links from your wiki pages (which you have full control over) to relevant wikipedia pages?
Much simpler, and should still produce the desired effect.
The Real issue is Social. (Score:2)

by Electrawn ( 321224 ) writes:

You are trying to force a technical solution on a social problem. It's probably not going to work. Your best bet for success is to try and install a WYSIWYG editor for mediawiki. There are several out there. wiki, underneath, is just a programming language. It requires training people - no matter how much it is designed to be "easy." Make it easier.
Consider Sharepoint. As much as /. is Anti-Microsoft, if your users are used to Exchange and Windows then Sharepoint is worth paying for.
I've worked for Larry Sa
Extensions (Score:2)

by Jjeff1 ( 636051 ) writes:

I wrote a very simple extension for my own mediawiki site that pulled in external pages as an iframe within a wiki page. I'd imagine you can do the same, Build your own wiki, with the wikipedia pages included below your own content.
Tearline Wiki (Score:2)

by ijones ( 83977 ) writes:

The experimental Tearline Wiki [galois.com] system we've developed at Galois [galois.com] might suit your needs. Inside the firewall, you use MediaWiki with the Tearline system, and get a combined view of your internal wiki(s), possibly different wikis on different sub-nets, and you can integrate it with Wikipedia or other internet-based wikis to get the global context of the article.
As others have said, integrating your content with other people's content can be a legal issue.
Contact me if you want more information on Tearline :)
pe
Wikipedia is X rated (Score:1)

by cellurl ( 906920 ) * writes:

Just keep them separate.
I work for a huge corporation and we have our own thing called etipedia.
Also, don't forget, wikipedia is X rated. [wikipedia.org]
MediaWiki interwiki links (Score:2)

by BitZtream ( 692029 ) writes:

Use interwiki links. I use them to link our intranet, mediawiki, our external developer wiki, and our external support wiki.
You will probably be unable to use them since using them requires the ability to get off your lazy ass and read the MediaWiki documentation or google for it, which results in plenty of information.
Also the fact that you're going to have to be able to insert a row in a database is probably going to be over your head.
READ THE DOCUMENTATION YOU LAZY FUCK.
Squid Proxy (Score:1)

by psychcf ( 1248680 ) writes:

Use squid proxy to inject the extra content, that way it's centralized.
IFrame + JavaScript = robust and simple (Score:2)

by thasmudyan ( 460603 ) writes:

If Wikipedia is indeed a good base for a lot of your company knowledge, you can do something dead simple: build a single PHP (or whatever language you prefer) page with an IFrame in it. Inside the IFrame you let users browse Wikipedia or any other web resource. Outside, in the parent document, there is a script that looks at the current IFrame URL and checks a local database for additional information. This could be additional text or even a stream of internal comments on this URL. The beauty of this idea
Semantic MediaWiki and SMW+ (Score:1)

by javester ( 260116 ) writes:

You may want to check the Semantic MediaWiki (semantic-mediawiki.org) or SMW+ (wiki.ontoprise.de).
Both are built on top of MediaWiki (which powers Wikipedia) so you can tap the very rich pools of extensions (numbering in the hundreds).
SMW+ is actually built on top of SMW, and it focuses on increasing usability and it preinstall pre-configured extensions out of the box to make it easier to deploy.
With SMW/SMW+, you can put in semantic annotations for an article describing just about anything you want to ass
seems like a very easy hack (Score:2)

by portscan ( 140282 ) writes:

unless i am completely misunderstanding you, this seems like a pretty easy hack on any wiki engine. just query the page's title at other wikis and append the content to the bottom. for example: you have a page called Server Farm -- detailing your companies server farm. whenever that page is loaded in a browser, the dynamic content generator in the website downloads the page with the same name from wikipedia, strips out their formatting, and sticks it at the bottom of your page. your users can only edit y

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

URLs (Score:2, Funny)

Re: (Score:3, Insightful)

Re: (Score:1, Offtopic)

Re: (Score:3, Interesting)

Re: (Score:2)

cron + rsync + tar (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

bad idea (Score:5, Interesting)

Re:bad idea (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re:bad idea (Score:4, Funny)

Re: (Score:1)

Re: (Score:1)

Download it (Score:2, Informative)

Re: (Score:1)

Solution (Score:5, Informative)

Re: (Score:2, Informative)

Re: (Score:2, Funny)

Re: (Score:1, Insightful)

WHOOOOOOOOOSH (Score:1)

Re: (Score:2)

Re: (Score:2)

Re:Solution (Score:5, Interesting)

Re: (Score:1)

Re: (Score:2)

IFRAME? Intelligent proxy/page modification? (Score:3, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Business Talk is Stupid Talk (Score:4, Insightful)

Re:Business Talk is Stupid Talk (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Browser overlay (Score:2)

Friendly MITM attack (Score:2)

Re: (Score:1)

Re: (Score:2)

Doinitwrong (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

interwiki (Score:5, Interesting)

google wave (Score:1, Offtopic)

Re: (Score:2)

Don't (Score:5, Interesting)

Re: (Score:1, Informative)

Re: (Score:3, Interesting)

Re: (Score:1)

Re: (Score:2)

Hyperlinks? (Score:1)

Legitimate use for this hack (Score:2, Interesting)

Re: (Score:2)

your content, how proprietory is it? (Score:1)

Maybe... (Score:2)

What? (Score:3, Interesting)

part of an Intelligent Book (Score:4, Interesting)

Simple. Two Tabs (Score:2)

Learn from mistakes. (Score:1)

Not without merit. (Score:1)

Re: (Score:1)

The simplest solution I can think of is.. (Score:1)

Freebase.com (Score:1)

Done this before (Score:1)

Why not (Score:1)

howabout about.com? (Score:1)

There's lots of value in a compound wiki (Score:4, Interesting)

WikiSlurp... (Score:1)

Is this really worthwhile? (Score:1)

Try the other way round (Score:1)