Supporting Research Data Collection from YouTube with TubeKit
Abstract
We present TubeKit, a query-based YouTube crawling toolkit. This software is a collection of tools that allows one to build one's own crawler that can crawl YouTube based on a set of seed queries and collect up to 17 di erent attributes. TubeKit assists in the phases of this process starting with database creation to nally giving access to the collected data with browsing and searching interfaces. We further demonstrate how we used this toolkit to collect elections related data from YouTube for nearly two years. Some analysis of the collected data relating to the elections is also given.
0
Your rating: None (24 votes)

Comments

I've already started tracking a campaign in contextminer.org!

This is SO INTERESTING! What a great tool. I wish I would have had it when completing my dissertation work!

i hear this a lot. i think i need to do a better job getting a word out about such tools. thanks for your interest. it helps to get encouraging words, especially when you're not getting paid for making such things! ;)