Researchers Using Spinn3r « Kevin Burton's NEW FeedBlog

Data and Features. The data set we used for this paper is the spinn3r.com blog data set from Nov. 2007 until Nov. 2008. This data set includes practically all the blog posts published on the webin this period (approximately 1.5 TB of compressed XML ). … We periodically pinged the online api for the current dataset of all the rss feeds . Although we had different domains that were provided to us, we chose the political domain for consistency with our other results. …

Read the rest here:
Researchers Using Spinn3r « Kevin Burton's NEW FeedBlog

Leave a Reply