Guido Casper's Radio Weblog :
Updated: 03.01.2004; 11:47:29.

 


















Subscribe to "Guido Casper's Radio Weblog" in Radio UserLand.

Click to see the XML version of this web page.

Click here to send an email to the editor of this weblog.

 
 

Dienstag, 30. Dezember 2003
Distributed Internet search architecture

As the number of pages on the Internet is still growing exponentially and more and more people are realizing that knowledge (and the ability to learn) is the ultimate business asset and that data != information and information != knowledge, searching becomes the most prominent problem domain of the Internet. Even Microsoft seems to be interested in building its own internet-scaled search engine lately.

Several approaches to the semantic web claim to know how to build the ideal meta data based searching infrastructure. As Tom Bray's excellent series on searching told me, there is always a trade-off between having the desired set of meta data and actually getting people collecting these meta data (being able to search is the only reason you collect document meta data in case you didn't know :-) - given that navigation is a special searching discipline).

Google's major achievement are useful page ranks without having someone to collect explicit meta data.

However having a single giant index for the whole Internet seems to be flawed from the start. So I am happy to see people stepping forward suggesting distributed search architectures for the Internet. What I don't like with this proposal is that it uses SOAP. Immediately Tim Bray's proposal came to my mind. It somehow appears most suitable for that kind of architecture.


7:39:59 PM    

© Copyright 2004 Guido Casper.



Click here to visit the Radio UserLand website.

 


December 2003
Sun Mon Tue Wed Thu Fri Sat
  1 2 3 4 5 6
7 8 9 10 11 12 13
14 15 16 17 18 19 20
21 22 23 24 25 26 27
28 29 30 31      
Nov   Jan