Updated: 10/23/2002; 11:55:12 PM.

Howard's Musings
Wherein we learn of Howard's mind


daily link  Monday, August 05, 2002


Paul Andrews: Microsoft to give PCs a little Google

Longhorn promises to give Microsoft a powerful edge in search and all that it signifies for advertising, subscriptions and user allegiance. But it may open up an opportunity for Google as well. Leveraging the lingua franca of XML in Longhorn, Google could develop a "bot" or utility that, downloaded onto your PC, would provide Google's features for your hard disk.

To me, this begs the question: how do you rank the hits? The most important feature of Google is its accuracy. Accuracy comes from page rank, which comes from a complex soup of update frequency, incoming links, and a potpourri of other factors.

Now shift gears to your hard drive. To be sure, there's a lot of great stuff in there. In a day, I could probably write code to index all of the text and pdf files on a chosen hard drive. In a week, I could probably write something to index MS Office files (piggybacking on Microsoft's Indexing service built into IIS). It would be brute force and ugly, but it would work.

Searching the data soup would constitute another relatively trivial task, at least for implied and searches, where all search terms had to be there, similarly for implied or searches, where one of the search terms had to be there.

Now that I think about it, Zope has a cross-platform catalog/index/search function built in. I'd read through that code and "leverage" it.

Sorting presents a major problem. Most recently updated or created first? Most instances of the search terms? Hyperlinking does not exist in the world of your documents, so it can't help. You could build its intelligence by pre-ranking some of your files and folders as more important than others, but we don't want to go there needlessly.

Historical Note: When I worked at MSFT (97-99), we all knew about this issue. Too bad they went for an Apple solution (i.e., waiting for perfection), rather than doing it quick-and-dirty.   


1:21:41 PM  comment []  permalink  

testing testing   

1:00:06 PM  comment []  permalink  

 
August 2002
Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31
Jul   Sep
 
Categories:
 
Blogs:
 
Reference:
 
Sites I Manage:
OH
MT
Subscribe to "Howard's Musings" in Radio UserLand.

Click to see the XML version of this web page.

Click here to send an email to the editor of this weblog.
Click here to visit the Radio UserLand website.

jenett.radio.simplicity.1.3R

Howard/Male/36-40. Lives in United States/Seattle/Greenlake and speaks English. Spends 60% of daytime online. Uses a Fast (128k-512k) connection.
Google! DayPop! This is my blogchalk: English, United States, Seattle, Greenlake, Howard, Male, 36-40!


Now Playing:



Copyright 2002 © Howard Hansen.
Last update: 10/23/2002; 11:55:12 PM.