Mittwoch, 5. Februar 2003


Screen-Scraping. Simon Willison has pointed to a number of web screen-scraping tools in his weblog. These are useful for two reasons: they can be used to automate web activities that the websites themselves don't easily support; they can also be used as a testing bench for web development.
  • Screen-scraping with WWW::Mechanize. An article by Chris Ball describing a Perl library for screen-scraping.
  • Snoopy is a PHP class that simulates a web browser. It automates the task of retrieving web page content and posting forms, for example.
  • PyWebPerf is a performance measuring tool written in Python which simulates a web browser fetching a page (handles cookies, multiple threads, image and css download). Command-line and cgi interfaces are provided, with a scripting facility coming soon.

[Der Schockwellenreiter]
11:46:58 PM     comment []   [Macro error: Can't evaluate the expression because the name "trackbackLink" hasn't been defined.]