×

Welcome to the Slashdot Beta site -- learn more here. Use the link in the footer or click here to return to the Classic version of Slashdot.

Thank you!

Before you choose to head back to the Classic look of the site, we'd appreciate it if you share your thoughts on the Beta; your feedback is what drives our ongoing development.

Beta is different and we value you taking the time to try it out. Please take a look at the changes we've made in Beta and  learn more about it. Thanks for reading, and for making the site better!

'Culturomics' Spreads From Google Books To Scientific Preprints

Soulskill posted about 2 years ago | from the no-making-up-words dept.

Google 12

ananyo writes "Cultural Observatory at Harvard University in Cambridge, Massachusetts is to index the whole of the ArXiv pre-print database of papers from the physical sciences, breaking down the full text of the articles into component phrases to see how often a particular word or phrase appears relative to others — a measure of how 'meme-like' a term is. The team has already applied a similar approach to 5 million books in the Google Books database to produce their n-gram viewer. But the Google Books database carries with it a major limitation: because many of the works are under copyright, users cannot be pointed to the actual source material. Applying the tool to ArXiv means it could be used to chart trends in high-energy physics, for example: a quickening pulse of papers citing the Higgs boson, for example, or a peak in papers about supersymmetry, a theory which may soon be waning."

cancel ×
This is a preview of your comment

No Comment Title Entered

Anonymous Coward 1 minute ago

No Comment Entered

12 comments

News for Nerds? (-1, Offtopic)

Jorl17 (1716772) | about 2 years ago | (#39153455)

Stuff that matters?

Re:News for Nerds? (1)

ackthpt (218170) | about 2 years ago | (#39153529)

Stuff that matters?

In Soviet Russia books scan YOU!

Re:News for Nerds? (0)

Anonymous Coward | about 2 years ago | (#39153689)

Hmm, I wonder what an Ngram viewer using Slashdot as the corpus for "first post" and "in Soviet Russia" would look like...

Re:News for Nerds? (1)

Anonymous Coward | about 2 years ago | (#39153647)

all stuff has matter.

this story has:
1. stuff about researchers
2. stuff about scientific papers and journals
3. nerd-ness

you, sir, are an imposter. kindly hand over your counterfeit /. papers and be on your way.

Re:News for Nerds? (0)

Anonymous Coward | about 2 years ago | (#39153935)

-sniffle- Why can't Jorl17 be the Slashdot censor?

Re:News for Nerds? (1)

Jorl17 (1716772) | about 2 years ago | (#39154075)

Don't know, probably because he doesn't represent a big enough portion of the community in spite of having a mind of his own?

Strongest meme-like correlation found: (0)

Anonymous Coward | about 2 years ago | (#39153619)

"I used to look for the Higgs Boson, but then I took an arrow in the knee."

SUSY hardly waning... (1)

Anonymous Coward | more than 2 years ago | (#39154633)

Did I miss something? All that's been cast into doubt is minimal supersymmetry. One might as well say that unified theories died with Kaluza-Klein. Supersymmetry is still the best solution to the heirarchy problem, so if it is incompatible with nature then we are royally fucked and have little to no clue what the physical principles on this scale are.

What kind of article is this? (1)

TaoPhoenix (980487) | more than 2 years ago | (#39154753)

Since there is only EIGHT comments and I just lost my mod points, here goes without reading The A.

Of course in Scientific Circles there are Memes, but they're NOT the same ones that go Viral among Biz. Masters.

Science has to break new ground, so it's Anti-Meme.

The Memes circulate a level down, somewhere in the Consultant range.

Wrong link to the n-gram viewer - CaSe mAttErS. (4, Informative)

tibit (1762298) | more than 2 years ago | (#39154963)

The link to the n-gram viewer in the submission is wrong. The Ngram Viewer is case-sensitive. The link goes to the uncapitalized sarch using terms "spock, skywalker". If you correctly capitalize the terms [google.com] , you get results higher by 2 orders of magnitude.

Preliminary data - the top 10 words and phrases (1)

RDW (41497) | more than 2 years ago | (#39157347)

1. However
2. Moreover
3. Furthermore
4. Indeed
5. Subsequently
6. Utilized
7. Methodology
8. Data not shown
9. Further research is warranted
10. Vajazzle

Re:Preliminary data - the top 10 words and phrases (0)

Anonymous Coward | more than 2 years ago | (#39158195)

Vajazzle: to bedazzle [urbandictionary.com] a vagina.
Check for New Comments
Slashdot Account

Need an Account?

Forgot your password?

Don't worry, we never post anything without your permission.

Submission Text Formatting Tips

We support a small subset of HTML, namely these tags:

  • b
  • i
  • p
  • br
  • a
  • ol
  • ul
  • li
  • dl
  • dt
  • dd
  • em
  • strong
  • tt
  • blockquote
  • div
  • quote
  • ecode

"ecode" can be used for code snippets, for example:

<ecode>    while(1) { do_something(); } </ecode>
Sign up for Slashdot Newsletters
Create a Slashdot Account

Loading...