This script looks for the wikipedia network around a certain topic. It first finds the outlinks from and inlinks to that page in the wikipedia.org domain. It will use the common list as startingpoints and calculate the nr of pages that link to each page thus found. You will notice this has a long tail distribution. For all pages with an nr of links > the 'long tail cut-of point' their outlinks are retrieved and the shared links are calculated again for that set of links. This proces is repeated for crawlDepth -1 times.
Warning: crawling is exponential! More and more links are found each time. A page is fetched once each second. Even a small network takes a looooong time.
Tips: exclude dates and categories. ctrl-f on iteration to get to the results quickly
Note: Notice that only for getting the first startingpoints 'what links here' is used. We might wanna use it for the other pages as well - instead of just counting shared (out)links. In doing so we would get shared (out)links of which we are sure they also have an inlink to a previous iteration. The implications of both methods need to be thought through.