ISSUE-112: negative keywords-not meta tags

meta-keywords-not

negative keywords-not meta tags

State:
CLOSED
Product:
HTML 5 spec
Raised by:
Maciej Stachowiak
Opened on:
2010-04-28
Description:
Escalated from: http://www.w3.org/Bugs/Public/show_bug.cgi?id=6609
Escalation requested by: Nick Levinson

Rumor to the contrary notwithstanding, keyword meta elements do work, albeit
within limits. I did a test and also found confirmatory recent discussion
online about major search engines.

Insofar as they work, what's needed is a way to clarify relevance to one theme
by distinguishing it from another. Negative keywords would thus be helpful. For
example, a page about "virus" could be about computer viri or biological viri
but usually won't be about both. While major search engines may be intelligent
enough to distinguish in that well-known case, new subjects may not be well
known to search engine managers, and thus an author may prefer to control how
their theme is understood from the date of going live. A negative keyword could
quickly clarify the theme of the page.

Using body text may not be adequate. Consider a doctor writing a carefully
exhaustive article about aspirin's less-well-known uses and thus without
discussing headaches, since almost everyone already knows about that use. Being
careful, the doctor writes in the introduction that "the article will not
discuss headaches." Someone does a search for "aspirin NOT headache". They
should get that paper but they do not. A negative metatag may aid a search
engine in understanding the doctor's thematic intention and thus in supplying
what a searcher is seeking. Search engine designers would have to do some
careful work to handle the aspirin case as intended but they could do that far
more easily if we page authors have an HTML facility that would give search
engines something to work with.

Antonyms are usually a waste of time in this area, so the keywords-not
attribute need not be invoked just to provide an antonymy. Rather, this is for
cases where the same word serves very different meanings, such as _virus_,
including opposite meanings by the same word, such as _sanction_. Thus, writing
keywords-not would be infrequent, although the sheer scale of the Web and of
HTML usage means the attribute would be still used enough to warrant
recognition in a standard and adaptation by search engines.

Search engines give more weight to thematic words written directly into page
content. However, some thematic words may be difficult for authors to work into
text without going to some length to explain important complications, and that
might make the whole page too cumbersome, losing readers. If the main text is
to be short, leaving those secondary keywords out may be smarter writing of
content. This is often true when stating principles, which may be more easily
understood if stated in just a few words, leaving redundant particulars out.
But searchers may still use various common particulars to find this principle
via search engines. To support search, the keywords that represent the
particulars and are not in the visible text should be put into meta tags. Some
would go into meta elements with the keywords attribute. But, for some of them,
keywords-not may be the more relevant attribute. And that would keep the
positive keywords metatag from getting enormously long.

Keyword metatags long ago lost favor after their widespread abuse. However,
they are used by search engines; and I don't see how negative keywords are any
more susceptible to abuse than positive ones. Further, a page author could use
either positive or negative keywords without having to offer both so there'd be
no unwanted increase in the designer's workload. Optimizers could use
essentially the same tools to generate either kind of keyword. The only risk, I
think, is putting a word in both, but I think that would only be an author's
error, so each search engine could prepare for that eventuality any way they
see fit and editing software and validators could choose to alert an author to
the apparent conflict without requiring an author to change an element. Thus,
if a page author uses the same word in both but with differing case because one
represents a common product and the other a brand name the page author would
take the risk of being misunderstood by a search engine while a search engine
might observe the case distinction and consider how to handle it. The page
author could also use longer phrases either positively or negatively and thus
ease distinguishing themes.

Because of the relevance of Boolean NOT searches and for relative brevity and
to avoid an abbreviation that may not be familiar to speakers of other
languages, I propose calling it "keywords-not". I'm preparing to include
keywords-not in a website I'm designing, but I don't know when the site will go
live. My method will probably be to use a separate meta tag following the
metatag for keywords used positively, since they can't be combined into one
element, but I see no reason to require any position other than that both go
into the head, as one tag already must. E.g.,

<head>
. . . . .
<meta name="keywords" content="aspirin,heart,blood" />
<meta name="keywords-not" content="headache" />
. . . . .
</head>
<body>
<h1>Aspirin Except For Headaches</h1>
<p>. . . .</p>
</body>
Related Actions Items:
No related actions
Related emails:
  1. Re: ISSUE-112 (meta-keywords-not): Chairs Solicit Proposals (from rubys@intertwingly.net on 2010-06-03)
  2. [minutes] HTML WG 20100603 (from plh@w3.org on 2010-06-03)
  3. {agenda} HTML WG telecon 2010-06-03 (from rubys@intertwingly.net on 2010-06-02)
  4. RE: {agenda} HTML WG telecon 2010-05-27 (from adrianba@microsoft.com on 2010-05-27)
  5. Re: {agenda} HTML WG telecon 2010-05-27 (from faulkner.steve@gmail.com on 2010-05-27)
  6. Re: {agenda} HTML WG telecon 2010-05-27 (from laura.lee.carlson@gmail.com on 2010-05-27)
  7. {agenda} HTML WG telecon 2010-05-27 (from rubys@intertwingly.net on 2010-05-26)
  8. {agenda} HTML WG telecon 2010-05-20: Surveys close, Publishing new Working Drafts (from mjs@apple.com on 2010-05-19)
  9. [Bug 6609] negative keywords-not meta tags (from bugzilla@jessica.w3.org on 2010-05-12)
  10. Re: ISSUE-112 (meta-keywords-not): Chairs Solicit Proposals (from mjs@apple.com on 2010-04-29)
  11. RE: {agenda} HTML WG telcon 2010-04-29: Action items, new issues, Task Force reports - minutes of the meeting (from adrianba@microsoft.com on 2010-04-29)
  12. Re: ISSUE-112 (meta-keywords-not): Chairs Solicit Proposals (from julian.reschke@gmx.de on 2010-04-29)
  13. ISSUE-112 (meta-keywords-not): Chairs Solicit Proposals (from mjs@apple.com on 2010-04-28)
  14. {agenda} HTML WG telcon 2010-04-29: Action items, new issues, Task Force reports (from mjs@apple.com on 2010-04-28)
  15. ISSUE-112 (meta-keywords-not): negative keywords-not meta tags [HTML 5 spec] (from sysbot+tracker@w3.org on 2010-04-28)

Related notes:

Closed without prejudice: http://lists.w3.org/Archives/Public/public-html/2010Jun/0056.html

Sam Ruby, 3 Jun 2010, 17:16:33

Display change log ATOM feed


Paul Cotton <Paul.Cotton@microsoft.com>, Maciej Stachowiak <mjs@apple.com>, Sam Ruby <rubys@intertwingly.net>, Chairs, Michael[tm] Smith <mike@w3.org>, Staff Contact
Tracker: documentation, (configuration for this group), originally developed by Dean Jackson, is developed and maintained by the Systems Team <w3t-sys@w3.org>.
$Id: index.php,v 1.323 2013-12-19 14:47:09 dom Exp $