Google Tags Content Creators 67
bizwriter writes "Google announced that it will support authorship HTML tags, a way to associate Web content with the individuals who create it. Suddenly, search engines know when one person was responsible for a body of work, no matter where content appears on the Web. If Google incorporates this into page relevance and ranking, as it is considering, the result could change the balance of power between those who create and those who publish."
Where's the incentive for publishers? (Score:1)
Re: (Score:1)
Because anything that helps put Gawker Media out of business is OK by me.
More seriously, because if I'm reading your blog's link to an article, it's because I want your commentary on the article. I might want the Fark thread about it, but I certainly don't want Gawker's take on BoingBoing's post about that dude on Reddit who read a NASA press release. If you're ju
Re: (Score:1)
Article Explained (Score:5, Informative)
It is made to sound more uncontrolled that it is. This is what really happens:
The markup uses existing standards such as HTML5 (rel=”author”) and XFN (rel=”me”) to enable search engines and other web services to identify works by the same author across the web.
This is handy, allowing search engines to find content by a specific author. It's not like Google will automatically decide what content links to which author.
We can't expect Google to give purely weighted search results based on this either. More like they will keep their existing page rankings, and include this extra author meta-data in specialized searches.
We know that great content comes from great authors, and we’re looking closely at ways this markup could help us highlight authors and rank search results.
The bnet article seems to over dramatize it, possibly due to a lack of understanding what this means for content creators.
Or do I also have the wrong idea?
Re: (Score:2)
Re: (Score:3)
I agree that they probably won't use it in search rankings, otherwise everyone will just copy the current number 1 "best author" in their tags..
Re: (Score:2)
Re: (Score:2)
Yes, but does that apply to the source code or to the displayed content? Copyright law doesn't seem to support HTML tags, whereas a direct statement "Copyright 2011 by Firstname Lastname" passes muster.
(Note than in the USA we all know you don't need a copyright statement to have the copyright. That's not what this is about.)
Re: (Score:1)
Yes, but does that apply to the source code or to the displayed content?
I just checked, and the answer is in the link provided to you. But I'm not going to tell you what the answer is, because that would be enabling your asshat behavior.
Re: (Score:2)
Yes, but does that apply to the source code or to the displayed content?
I just checked, and the answer is in the link provided to you. But I'm not going to tell you what the answer is, because that would be enabling your asshat behavior.
By my reading of the law... it makes no distinction between source or displayed content, but I see nothing in the law that would prohibit a copyright holder from claiming that someone else was the author. Perhaps some other law would, particularly if the claim could be construed as defamation, but I don't see anything in copyright law that addresses this issue.
Re: (Score:1)
What about plagiarism? (Score:2)
Will this help or hurt? A little before the turn of the century I researched Quake and Quake II console commands, tested them all, and wrote short descriptions of how to use them and what they did. It was copied on dozens of other web sites, word for word, usually with no attribution and usually with someone else's name on it.
Meta tags were badly misused to spam search engines. And what if you're putting content [slashdot.org] on someone else's site and have no control over the meta tags?
Re: (Score:2)
Will this help or hurt? A little before the turn of the century I researched Quake and Quake II console commands, tested them all, and wrote short descriptions of how to use them and what they did. It was copied on dozens of other web sites, word for word, usually with no attribution and usually with someone else's name on it.
I'm not sure that would even be covered by copyright law. You aren't allowed to copyright "facts" or "factual data". Maybe if your "short descriptions" were long enough, or expounded on the command beyond being a simple summary, it could be considered an original work. But for the most part, a simple compilation or list of factual information is not considered a copyrightable work.
Re: (Score:2)
I, on the other hand, believe it would be.
Here's the original authorship:
wrote short descriptions of how to use them and what they did
Or are you saying that technical help documentation cannot be copyrighted?
I imagine there are a few other people who would disagree with that as well.
Note - this is entierly seperate from a discussion on what *should* be able to be copyrighted, much less what goals we wish with the laws and whether they accomplish those goals.
Regards.
Re: (Score:2)
The data can't be copyrighted, but its presentation is. If you write a book about chemistry I can read it, learn from it, and write my own chemistry book using the facts from your book as long as I present those facts in my own words. The plagarists copied the entire thing whole cloth, even using the same IP address I used in one of the examples. Although my question here is about plagarism rather than copyright infringement (I had no problem with someone republishing it provided they gave me credit and a l
Re: (Score:2)
Well then let me thank you for those lists, muchly appreciated! :: Q1 fan
Authorship Tag (Score:1)
Re: (Score:3)
probably nothing.. as well as another site copying your site can just remove your tag and replace it with theirs, claiming they're the original author..
Re: (Score:2)
Replying to myself: seems it has to be reciprocal to work.So that's stopping someone from linking to an official author.
You need rel=me on both sites linking to eachother.
http://www.google.com/support/webmasters/bin/answer.py?answer=1229920 [google.com]
Now I wonder - it's an html5 tag. Should I already implement it on my own website which isn't html5 or would google then just ignore it ? .. but if it's going to be ignored then I won't bother..
I can already put it on my own site, blog, facebook,
Re: (Score:2)
Google's engine does not distinguish between the various versions of HTML. As long as Google successfully detects the page as html (and it is quite good at determining that), you can use any feature from any version and Google could not care.
For what it is worth, this markup is also valid HTML 4, but HTML 4 simply does does not define the meaning of the "me" or "author" values of the rel attribute, while HTML 5 does define the meaning (although I have not actually verified that).
Fraudulent authorship notices (Score:2)
what's to stop me from "borrowing" someone's author tag
Federal law, as I pointed out in another comment [slashdot.org].
That's because the UK has its own counterpart (Score:3)
Didn't know the federal law had jurisdiction in the UK.
That didn't stop your Parliament from enacting its own counterpart to this legislation in 2003, as section 296ZG of British copyright law [legislation.gov.uk].
Re: (Score:2)
The full power of the Copyright SWAT team. Or Slander & Libel.
Summarizing you, you're talking about putting Respected_Author tags on 4chan posts.
Re: (Score:1)
Which is exactly what will happen. Current link farms will cross pollinate each other and it will be nearly impossible to tell who really wrote anything. Least likely will be the person who did write the original content.
What could possibly go wrong? (Score:3)
Oh dear me, am I missing something?
So you can totally spoof random people's names into any webpage? So searches for author=Obama come up with doctored pics of Osama-Obama slash or something?
Re: (Score:2)
Oh dear me, am I missing something?
So you can totally spoof random people's names into any webpage? So searches for author=Obama come up with doctored pics of Osama-Obama slash or something?
Thanks for the imagery, but what is it that makes you think you can't _already_ claim any random person wrote something? Do you think the normal non-tag text in an HTML document is under a magic spell that present misattribution?
Re:Claim (Score:2)
Because this is an Author Tag! (Cue the Serious Stern Face.)
Of course twerps can claim stuff. So far people can just laugh stuff off.
Now the obvious use of the tag is for the copyright police... they're gonna try to make the author tag a statement almost akin to under oath. So all those tv show clips on youtube that don't have the network=author tag are instant slam-bait.
But now the more dangerous case is when Da Gov wants to do False Flag cases, and posts pics of Democrats sharing lingerie, and they put "A
not even that obvious (Score:2)
I pick a respected author, perhaps academic, who writes about similar things as me. I publish my crap whitepaper claiming to be him. It's likely that no human will notice the deception. Depending on my goals, the human-readable text of the whitepaper will claim the author to be him or me.
Re: (Score:2)
Oh, of course.
I used a little humor. But yes, you absolutely have a clear case - you submit something in an intelligent style, and the first pass no one notices, until it accidentally gets picked up and then they slam the original creator.
What for example if that math paper that got hosed last week was *spoofed*? It's bad enough if the original author goofed, but since he got pulverized for "not checking", what if it was a classy defamation attack?
Re: (Score:2)
Re: (Score:2)
A tag in the HTML source? It can be ripped... (Score:2)
If this is implemented via tags in the HTML itself, it can be easily detected and stripped by content thieves, can't it?
If I copy the entire body of work of, say, the War Nerd, and set up a copycat blog ("the war geek"), how can these tags (which I've already modified) tell this is a blatant rip-off?
Re: (Score:2)
That's probably true. But if I understood this right, the point is to make the authors more visible on the internet - for example if I find a blog I like, I can easily find more writings by the same author, no matter what site they're on.
Re: (Score:2)
Unless the author has a common name like John Doe...
The only way a tag like this *might* work would be to make the tag value a public-key signature of the content enclosed inside the tag. Which would allow you to see that content A was signed by key XYZ, as was conten
Re: (Score:2)
Judging from the Google blog this doesn't sound much like a rip protection, but more as a way to allow searches like "Show me everything else the author of this particle has written". That said, rip protection should be possible, when they would mark the first page that they find with content as special and then everything with the same content as copy.
Re: (Score:1)
Re: (Score:3)
It seems like this falls into the category of 'potentially useful incremental change'. It isn't resistant to rip-offs(but neither was the status quo) and it makes it somewhat easier for good-faith actors to make a pertinent piece of metadata easily accessible. The metadata dreams
Re: (Score:2)
If you include the host domain in the digital signature, you'd be able to prevent people from re-hosting the work (or at least detect it and ignore copies). You'd still need the priority system you suggested to identify THE author (otherwise, as you say, somebody could rip and re-sign the content for a new host).
It's probably too much work for the benefit you'd get, but it might be worth the experiment, and Google is exactly the people to do that experiment. It means a vast amount of crunching, possibly t
Looks abusable to me (Score:1)
If somehow it's discovered that a particular author earned a high pagerank, what exactly would prevent linkfarms from tagging that author on every one of their pages?
Re: (Score:1)
Google is not that dumb, the article is just wrong.
From google [google.com]
This tells search engines: "The linked person is an author of this linking page." The rel="author" link must point to an author page on the same site as the content page. For example, the page http://example.com/content/webmaster_tips could have a link to the author page at http://example.com/authors/mattcutts. Google uses a variety of algorithms to determine whether two URLs are part of the same site. For example, http://example.com/content, http://www.example.com/content, and http://news.example.com can all be considered as part of the same site, even though the hostnames are not identical.
publisher or re-publisher? (Score:2)
Most people add their HTML to a server in one way or another. Isn't that publishing? It isn't like there are private web sites with articles that where written by an author then transferred to HTML to be posted to the web. Oh wait. No. AOL isn't that way any longer.
John Smith (Score:2)
Lots of people have common names. You could be a Michael or a Mary or a Mohammed or a Jennifer or a William.
Links to URL, not name (Score:1)
See details here [google.com], where it is explained that all works authored by someone in a domain should be linked to a unique author page at that domain, and that authors can associate/link their author pages between various domains using reciprocal linking.
Boy. and how (Score:2)
Re: (Score:2)
Who says it's meant to prevent it?
Locke and Demosthenes (Score:1)
FINALLY! (Score:2)
We'll get to find out who Goatse REALLY is.
to those of you saying (Score:2)
that it will be easy to randomize/ spoof/ rip off, and a stupid tag doesn't change anything:
FIRST APPEARANCE of author tag means something. and no, it doesn't mean i can change the publish date on the file to June 1st, 1896 and always be the first author: when did SEARCH ENGINES first see content XYZ with author tag ABC?
that's case closed, right there. you can't spoof this system, unless you have a time machine, or you can hack google
now, if anyone rips off your content, you will be able to point to google'
Re: (Score:2)
Re: (Score:2)
you're talking about some pretty fringe time cases
besides, the problem is easily corrected: if you write something valuable to you that you fear someone will rip off, you ACTIVELY submit the page to the search engines, rather than waiting for them to be passively scanned
Re: (Score:2)
we're talking about a whole new system here, that google just put in place
so either google is really concerned about properly attributing sources, and guarantees the timestamp on a submission
or google just added support for the author="" attribute, and all their work means nothing
besides, you really believe there's no timestamp record on their addurl page?
google, the people who track everyone and everything?
Re: (Score:2)
hey, asshole: it's a new system, give it time. i'm glad you've decided everything already for all of us. don't be such a blowhard
Gaming the system? (Score:1)
If this were used for ranking, then I would expect web masters to attribute articles to Big Names.
I would hope that Google would have a policy of fingerprinting the articles. Most people's writing style is sufficiently unique that claiming that someone else wrote Foo is fairly obvious on analysis.
I hope also that there is a search tool so that I can find all articles attributed to me.
And suppose that Slashdot and phpBB support this tag so that I can find all the posts by a given author.