Have been spending some time researching on using Lucene, the search engine from Jakarta project. One of the things that I have not been particularly successful at achieving was to index into MS Word documents. Good thing that I subscribed to the lucene users mailing list, which led me to this page:
http://www.textmining.org/modules.php?op=modload&name=Downloads&file=index&req=viewdownload&cid=2
Downloaded but have yet to try the API. Will update when it is ready
1:24:46 AM
|