August 2003
Sun Mon Tue Wed Thu Fri Sat
          1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31            
Jul   Sep


Archives

Blogroll


Subscribe to "Dev" in Radio UserLand.

Click to see the XML version of this web page.



Click here to send an email to the editor of this weblog.
 
 Saturday, August 23, 2003
Scraping HTML with curl, tidy, and XSL
Continuing with making it easier for "Big Pubs" to create RSS feeds. I'm assuming that they have a publishing system, but it wasn't built with RSS in mind, but they want on the bandwagon.

Using curl, tidy, and XSL to scrape content from HTML pages into an RSS feed. This is basically what I do now with a half-baked Java app using JTidy, XPath, and BeanShell. I keep meaning to release it, but it’s too embarassing to share so far. Yet, it’s been working well enough to scrape what sites I’m interested in such that I haven’t been too motivated to tidy it up and tarball it. One thing I like better about Bill Humphries’ approach, though, is that it doesn’t use Java :)
[0xDECAFBAD
2:14:32 PM      comment []   trackback []  



Bindows
Erik Arvidsson, the DHTML guru behind WebBoard and WebFX, revealed what he had been working on since last year: Bindows.  Bindows is a DHTML framework that emulates Swing/WinForms UI, similar to what Convea and Oddpost.  I am not sure yet, but Bindows seems to use XML to define its GUI.  It seems pretty slow though.  I suspect that most, but not all, of the slow speed is due to the server-side misdesigns.
[Don Park's Daily Habit
1:35:17 AM      comment []   trackback []  



Practical mod_perl: Chapter 6: Coding with mod_perl in Mind. Pt. 4
The following is the conclusion of our series of excerpts from Chapter 6 of the O'Reilly title, Practical mod_perl. (O'Reilly)
[WebReference News
12:41:21 AM      comment []   trackback []  



Object Sniffing New Browsers, Part 3: Opera
In this series we've looked at how you can use object detection to distinguish between various browsers and tailor your Web code to take advantage of their individual features (or lack thereof). In this last article, we look at the other major browser out there, namely Opera. (By Keith Schengili-Roberts)
[WebReference News
12:40:29 AM      comment []   trackback []  



XML machine the successor to von Neumann
Really bring data and programs together.
(The Register) [via Der Schockwellenreiter
12:31:05 AM      comment []   trackback []  



Understanding Web Services [via Der Schockwellenreiter
12:04:56 AM      comment []   trackback []  



Animated Population Pyramid for England and Germany
SVG, JavaScript and SMIL [via Der Schockwellenreiter]

When the inimitable Schockwellenreiter raves about a cool implementation of anything, it pays to take look...

Now if only it wouldn't crash Safari, Firebird, Camino, Mozilla, IE, iCab... (only OmniWeb doesn't barf - but it doesn't work with it either) :-( 
12:00:17 AM      comment []   trackback []