Mittwoch, 5. Februar 2003


Screen-Scraping. Simon Willison has pointed to a number of web screen-scraping tools in his weblog. These are useful for two reasons: they can be used to automate web activities that the websites themselves don't easily support; they can also be used as a testing bench for web development.
  • Screen-scraping with WWW::Mechanize. An article by Chris Ball describing a Perl library for screen-scraping.
  • Snoopy is a PHP class that simulates a web browser. It automates the task of retrieving web page content and posting forms, for example.
  • PyWebPerf is a performance measuring tool written in Python which simulates a web browser fetching a page (handles cookies, multiple threads, image and css download). Command-line and cgi interfaces are provided, with a scripting facility coming soon.

[Der Schockwellenreiter]
11:46:58 PM     comment []     

 


Zensur. Jürgen Büssow, der Provinz-Politiker mit den Sperrungsverfügungen an Internet-Provider, wurde letztens beim WDR in einer Kolumne von Jörg Schieb kritisch beleuchtet. Schieb verwies, meines Erachtens zu Recht, darauf, daß diese Sperrungsverfügungen Unsinn sind. Wenig später war der Artikel offline. Odem.org hat in dieser Sache noch einige Hintergründe parat. [Der Schockwellenreiter]
11:44:40 PM     comment []     
 


Treasures of the World | Guernica [Daypop Top News Stories]
8:21:56 PM     comment []