{"id":228,"date":"2011-05-07T14:13:16","date_gmt":"2011-05-07T21:13:16","guid":{"rendered":"http:\/\/themanwhosoldtheweb.com\/blog\/?p=228"},"modified":"2011-05-08T12:38:57","modified_gmt":"2011-05-08T19:38:57","slug":"autoscaling-with-wikipedia-and-dbpedia","status":"publish","type":"post","link":"http:\/\/themanwhosoldtheweb.com\/blog\/2011\/05\/autoscaling-with-wikipedia-and-dbpedia\/","title":{"rendered":"Learn to autoscale with the best source of information, Wikipedia."},"content":{"rendered":"<p>Wikipedia is such a great source of information.\u00a0 Though originally controversial as a source of legitimate information, it has becoming increasingly accepted as a reliable source.\u00a0 In fact, in my day job of business consulting for companies (including Fortune 50 organizations), Wikipedia is one of the first places I check when conducting research.<\/p>\n<p>However, if you have a website, and would like to automate the process of pulling data from Wikipedia, it&#8217;s not a simple task.\u00a0 You will need a very sophisticated scraper.\u00a0 Wouldn&#8217;t it be convenient if you could just query Wikipedia like a database?<\/p>\n<p>Well, it seems like you actually can&#8230; with the help of <a href=\"http:\/\/wiki.dbpedia.org\/\">DBpedia<\/a>.<strong><\/strong><!--more--><\/p>\n<p><strong>What is DBpedia?<\/strong><\/p>\n<p>As explained on its site (<a href=\"http:\/\/wiki.dbpedia.org\/\">http:\/\/wiki.dbpedia.org\/<\/a>), DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data. We hope this will make it easier for the amazing amount of information in Wikipedia to be used in new and interesting ways, and that it might inspire new mechanisms for navigating, linking and improving the encyclopaedia itself.<\/p>\n<p>As of January 2010, the DBpedia data set contains <span style=\"text-decoration: underline;\">3.5 million &#8220;things&#8221; with over half a billion &#8220;facts<\/span>.&#8221;\u00a0 That&#8217;s certainly not a bad source for your autoscaling needs!<\/p>\n<p>You can download the data sets from DBpedia here:<br \/>\n<a href=\"http:\/\/wiki.dbpedia.org\/Downloads36\">http:\/\/wiki.dbpedia.org\/Downloads36<\/a><\/p>\n<p>As a disclaimer, I have not personally used DBpedia.\u00a0 But, after just spending a few moments browsing the site, I&#8217;m conjuring up a number of ideas for some autoscale, autopilot, value-add sites.<\/p>\n<p>Whats thoughts have you conjured up?<\/p>\n<p><em><strong>dave<\/strong><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Wikipedia is such a great source of information.\u00a0 Though originally controversial as a source of legitimate information, it has becoming increasingly accepted as a reliable source.\u00a0 In fact, in my day job of business consulting for companies (including Fortune 50 organizations), Wikipedia is one of the first places I check when conducting research. However, if [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[21],"tags":[27,90,50],"class_list":["post-228","post","type-post","status-publish","format-standard","hentry","category-autoscale","tag-autoscale-2","tag-dbpedia","tag-wikipedia"],"_links":{"self":[{"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/posts\/228","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/comments?post=228"}],"version-history":[{"count":4,"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/posts\/228\/revisions"}],"predecessor-version":[{"id":233,"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/posts\/228\/revisions\/233"}],"wp:attachment":[{"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/media?parent=228"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/categories?post=228"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/themanwhosoldtheweb.com\/blog\/wp-json\/wp\/v2\/tags?post=228"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}