Date Code Web Pages | |||||||||||||||||
Manually add a date code to Web pages for Google use. Date range searching using the Google search engine is not effective because the date stamp attached to each Web page has little value. The date stamp is attached to each page by Google and is simply the indexing date. This date is often renewed automatically several times a year by the Google indexer software and in each case, the date is updated. If each web page owner would periodically include an official, Google-published, date code on the header of each page on his web site each month, this would serve as a valid search term for date range searching. If this were done, in many cases, owners would look at each page they date stamp and decide if the information is currently valid and if publication of the page should continue. Once this practice starts to gain popularity, owners of Web pages would begin to realize that most people are only searching for and reaching pages with this new date stamp and then page owners would have the incentive to date stamp each of their pages according to this new system. To prove the ineffectiveness of the current Google date range search situation, do the following: 1) Open Google and then do an Advance Search. In the pane opposite "this exact phrase" enter "liederkranz cheese". Then click on the Google Search button. If you had done this test on April 11, 2004 you would have gotten 129 hits. Now:2) Repeat the above Advance Search with this additional selection criteria: In the pane opposite the phrase "Return web pages updated in the" select "past 3 months." If you had done this test on April 11, 2004 you would also have gotten the same number of hits--129. It should be noted here that liederkranz cheese is an extinct product and virtually no news has developed on this product in several years. Here's how the date coding could work. Google would publish the date code for the month in some sort of nonsense string like "2wjjq" in some area of their home page and clearly indicate that this is the date code for the month. Google would also configure the date coding so that a computer could not simply read it each month and install it in the Web pages. They would do this by using varied placement of the code and by varying the language in which it is stated. Refer to the book "Google Hacks" published by O'Reilly, pages 34 to 37, for additional information on date range searches. Maybe Google can change the design of their indexer so that date range searches will be more effective. Until and unless that time occurs, my proposed system above would make date range searching effective.
dawesley, Apr 12 2004
What do you think of this idea or comment? | |||||||||||||||||
Users who liked this idea also liked: | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
Add your comment
I like the idea but think that Google should give the date based on a set of more general considerations, from worst to best.
1) Creating date -- If nothing else, when the page first "appeared"
2) IE Properties -- In IE's properties, I get a Modified date, if that is readable by IE, then lets use that as better than "1)"
3) Creator's date statement -- Personally, I'd permit the page creator to set the date of his page, and to have this override the automatic dates. 1)" or "2)"
I suspect that I and the idea's originator will disagree on whether the creator should be able to automatically update this date, but I would leave this up to the page's author unless a vewer could use 1,2,3 or specify that they only wanted to search for 4.
Note: I'd also like all pages that Google shows me to be able to have their date stamps shown. I might not want to search on the date, but often want to drill down to new pages.
Thanks -- Good idea.
How many different file systems exist? 4 or 5? Google could add code to their indexer to get both the created date and modified date from the metadata of any stored file system.
The problem exists with pages which are created on the fly where different bits of content are taken from different database fields or files. Which field/file is considered to be the created/modified date.
I think that relying on content authors to create this information is unrealistic, there has to be a method to automate this for all types of content.
I think one of the biggest problems with the Web is that web pages don't have visible (and searchable) date stamps by default. The people who invented the Web were smart; why didn't they fix this problem from day one? The fact that Google can't do a date search that means anything is a side effect of this design fault with the Web. The web is full of out of date material. Several times a month I see web pages that say something is "upcoming" when the date is more than 6 months old. Or worse, they say something is "upcoming" and don't state the year, so you think it's a year later than it actually is. When the web first came out, I saw "Last updated" dates on most web pages. I am seeing fewer of these now. Could it be that webmasters don't want users to know how out of date their web pages are?