blogchalking
Spidering weblogs, looking for chalk.

 





Subscribe to "blogchalking" in Radio UserLand.

Click to see the XML version of this web page.

Click here to send an email to the editor of this weblog.

jenett.radio.randomizer - click to visit a random Radio weblog - for information, contact randomizer@coolstop.com

Blogchalk:
Portsmouth, NH, US

Blogroll




 

 

  Wednesday, July 24, 2002

Some Blogchalk Data

The phenomenon seems to be growing...

In the database currently:

  • there are 2821 total weblogs (this counts unique titles)
  • 1251 of those are as-yet unspidered

Of the 1570 that have been spidered:

  • 28 had parse failures (I'm using python's htmllib).
  • 501 contain a META keywords tag
  • 122 contain a META keywords tag containing "blogchalk".

I exchanged some email today with Daniel Padua about a few things. Coming soon:

  • A search function.
  • An out-of-page data format (xml). This will make the database more capable. www.blogchalking.tk will probably generate the necessary code for you like it does now.

If anyone has experience writing search functionality and would like to lend a hand, please let me know. Click here to send an email to the editor of this weblog. Otherwise, I'll just hack together something that works, no matter how ugly.


6:55:00 PM    
categories: blogchalking



Click here to visit the Radio UserLand website. © Copyright 2002 Brian St. Pierre.
Last update: 8/6/2002; 2:47:58 PM.

July 2002
Sun Mon Tue Wed Thu Fri Sat
  1 2 3 4 5 6
7 8 9 10 11 12 13
14 15 16 17 18 19 20
21 22 23 24 25 26 27
28 29 30 31      
Jun   Aug