Jon Phipps' NSDL Weblog
Good stuff that NSDLers might find interesting, and an experiment in using weblogs for community building and knowledge transfer.
"Despite some well-meaning attempts to smash everything of interest into the library catalog, the fact remains that most integrated library systems expect MARC records and MARC records only. This means that whatever we want to put into the catalog must be described using MARC and AACR2 (see "Marc Must Die ," LJ 10/15/02, p. 26ff.). This is a barrier to dramatically increasing the scope of a catalog system, even if we decided to do it. How would you, for example, use the Open Archives Initiative Harvesting Protocol to crawl the bibliographic records of remote repositories and make them searchable within your library catalog? It can't be done, and it shouldn't. The library catalog should be a record of a given library's holdings. Period."
5:45:15 PM comment []
"Google and Other Tools Searching the 'blogosphere' has become a pressing need particularly for bloggers who are seeking to find other blogger commentary on a particular topic. Microdoc News discusses a number of different ways to search the blogosphere and provides a good method to achieve this using Google..."
5:30:21 PM comment []
"Teenager, Michael Fagan, completing his last year in High School in Ontario, Canada, has one of the most comprehensive Search Link Compilations on the Internet today. Even though it is listed in the Google Directory as a Link Compilation, it also contains query interfaces that can direct your search query directly to the best full search engines as well as many of the specialist engines and directories. This could well be at great page to make as your home page. We interviewed Michael to find out how this great search page started and where it is going..."
5:28:48 PM comment []
"Our research seeks to bring modern information management and retrieval technologies to the average computer user in order to make computers a more compelling place for users to interact with their information. Haystack looks into the use of artificial intelligence techniques for analyzing unstructured information and providing more accurate retrieval. We also deal with the modeling, management, and display of user data in more natural and useful ways. "
1:41:42 PM comment []
Google Robot Timetable. News on Google Quite frankly, Google times its visits to your site according to the type of site you have and how likely your site is to have changes since its last visit. If you have a site that hardly ever changes, then do not expect to see Googlebot very often. Webloggers, on the other hand, can expect to see Googlebot about as frequently as you update your site. However, since the last update in early April, Googlebot has been a little less active, although still visiting some sites as frequently as everyday. [Elwyn Jenkins: MicrodocHeadlines]
A dynamic reharvest algorithm based on adjusting the frequency of reharvests accordinig to the number of records contained in the last reharvest, combined with a manual "(re)harvest my metadata now" trigger might be a better way to assign reharvest frequency than askiing collections how often they think they should be reharvested.
And why is that? Because most businesses aren't yet smart enough to look beyond the exciting COTS vision and assess whether the system will do the job..."