|
Tuesday, May 13, 2003
|
|
|
"While the Web is widely recognized as an amazing source of unstructured data, much of this data is rather difficult to search and navigate. In fact, the Web was organized along a model meant for human consumption, not for optimizing machine searches. HTML (hypertext markup language), the means by which hypertextual information is organized for the Web, is a presentation language. It concerns itself with the appearance of data rather than its underlying structure. This makes the process of extracting content from the Web a daunting task for automatic processors. For this reason, there's a widespread sense that unstructured data on the Web represents a great untapped value..."
6:07:13 PM
|
|
|
|
© Copyright
2003
Jon Phipps.
Last update:
6/10/2003; 8:51:43 PM.
|
|
|