Updated: 2003-03-05; 12:48:21 p.m..
Python Community Server: Noteblog
Comments and opinions from the coder behind the Python Community Server. Techy types might want to check out the development blog for more software-related talk.

Check out CommunityServerWiki for discussion on PyCS and other community servers!
        

Tuesday, 4 February 2003

Pete Cole responds to Don Park:

Finding Hub Blogs. Blogspace needs a directory of hub blogs organized by topics.  Recognizing a hub blog is easy enough.  A hub blog has a large number of subscribers and links to other blogs.  A hub blog's topics can be either inferenced by content and links or specified explicitly. [Don Park's Blog]

Maybe I've missed something but it does surpise me that the blog crawlers (e.g. Technorati) out there don't seem to do anything like this. The 'citation analysis' algorithm proposed by Jon Klienberg would be highly usable here, though would require a fair number of tweaks to sort out outbound links and inbound links to a 'blog' rather than an individual blog page - but results from this algorithm can be impressive.

This is a cool idea - something I haven't even considered doing for the ecosystem, but it would be very sensible.  If anyone wants to have a crack at it (all the ecosystem data is available for research etc), do get in touch and tell me how you went.


9:58:54 AM    comment []

Sender Traumwind: Cherokee
is a tiny, ultrafast, lightweight Web server. It is implemented entirely in C, and has no dependencies beyond a standard C library. It provides only the most basic HTTP functionality, but is extremely fast and small.

9:56:29 AM    comment []

© Copyright 2003 Phillip Pearson.
 
February 2003
Sun Mon Tue Wed Thu Fri Sat
            1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28  
Jan   Mar


Click here to visit the Radio UserLand website.

Subscribe to "Python Community Server: Noteblog" in Radio UserLand.

Click to see the XML version of this web page.

Click here to send an email to the editor of this weblog.