Recently I’ve been working through the various courses on DataCamp. It soon became apparent that this was something that I should have done long ago. It has soon become apparent that it would have saved me a lot of time reading through various blogs and Stack Overflow questions. Read More
In my first land registry post I imported a month’s worth of land registry data, named the rows and had a go at using the ggplot2 package to produce a number of nice looking charts. This time I want to progress a little further. My aims are, using the same dataset to:
- Look at the distribution of prices
- Look at the prices by different factors
- Initially just using factors in the land registry data
One thing I keep on wanting to get around to looking at is what publicly available data is available on the trains in the UK and the answer to a few questions?
- What proportions of the trains in the UK are run by foreign governments?
- At a rough guess it is in the region of 20% based on not very much
- How would this be measured? Passenger miles?
- Why can the French, Dutch and German governments run UK rail franchises but not the UK government? It makes no sense!
- What subsidies are given to the different franchises?
- Are the original tenders released publicly?
- Do the train companies live up to these documents?