If you have come here, then you probably already know that statistics can (and often do) lie. This is not the first web site to examine this topic. I took a more than a little inspiration from the excellent Spurious Correlations website, but I wanted to use data which was a bit more relevant to the British public, and anyway, I just fancied seeing if I could do it.
A few notes on the data itself: The data is organised by category, and data is only correlated against data from other categories to reduce the number of boring correlations. So far, I have got all of the data from freely available sources on the web. In a lot of cases, I found out that less is more. I started with a few hundred timeseries of Visitor Attraction data and lots of Weather data broken down by region, but due to the magic of statistics, the correlations between these data sets overwhelmed the more interesting ones, so I took them out. I would like to include more "Quacks and Quackery" data, so if you know of any reliable sources, please do let me know (contact details are at the bottom of the page). Also a note of thanks to Alan Henness at Zeno's Blog, whose work I shamelessly reused in order to compile the homeopathy datasets. Final point about the graphs: It's probably best not to link directly to them for now. Every now and then I reload all of the data and the Urls may change.
Some new notes on the data as a result of feedback given to me by various friends and colleagues: Thanks to Derek for pointing out the difference between negatively correlated (-1) and uncorrelated (0). Duly noted and corrected! Also thanks those who pointed out that the weather data doesn't show the absolute maximums or minimums, but the average maximums or minimums. You can read a lot more about all of that in the Met Office Website Archives if you are interested.
The site is a typical 'lamp' setup, coded in PHP and MySQL and runs on a Linux server. The client side graphs were produced using JQPlot and the server side graphs (used in Facebook shaing) were produced using pChart. At the moment, there is no advertising, but if it gets a lot of traffic, I might add some banners, just to pay for the hosting.
You can contact me via firstname.lastname@example.org, sorry but I can't promise to answer every email, but will try.
A list of the main data sources used and where to find them is below.
|Office for National Statistics||https://www.ons.gov.uk/|
|National Records of Scotland||https://www.nrscotland.gov.uk/|
|Northern Ireland Statistics and Research Agency||https://www.nisra.gov.uk/|
|Association of Leading Visitor Attractions||http://www.alva.org.uk/|
|British Beer and Pub Association||http://www.beerandpub.com/|
|UK Gambling Commission||http://www.gamblingcommission.gov.uk/|
|British Bottled Water Producers (Zenith International)||http://www.britishbottledwater.org/|
|Department of Energy & Climate Change||http://www.gov.uk/government/organisations/department-of-energy-climate-change|
|Office of National Statistics||https://www.ons.gov.uk/|
|Mark Pack - Pollbase||https://www.markpack.org.uk/opinion-polls/|
|National Records of Scotland and Scottish Government (post 2011)||https://www.gov.scot|
|British Film Institute||https://www.bfi.org.uk/education-research/film-industry-statistics-research|
|General Chiropractic Council||http://www.gcc-uk.org/|
|Health and Social Care Information Centre||http://www.hscic.gov.uk/|
|British Acupuncture Council||https://www.acupuncture.org.uk|