El hermano, who is a data monkey by day, has been helping me pull aggregate summaries for different types of crimes, which i've then thrown into a speadsheet to analyze. This post will be a bit of "housekeeping" and methodology, and future posts will be all about what I've found in the data.
For starters, there is a lot of data. It is a bit too much data to be manageable without isolating particular types of crimes or areas of the city. I have decided to focus first on what I hear about the most from friends and in the papers:
- home break-ins
- car break-ins.
I'm breaking the data up by NPUs. The APD zones are too large and there are too many neighborhoods or police beats to be useful. Atlanta neighborhoods are also vastly different sizes, so they aren't great for comparing data against each other. Using NPUs should break the city into manageable chunks of roughly similar sizes, but with small enough areas that we can see meaningful patterns.
Cassie Branum, a grad student at Georgia Tech who I worked with on the ULI Competition earlier this year, is helping out by putting the data into GIS. She is fantastic to work with, and like el hermano is helping me out for free. We'll have some pretty neat maps showing which parts of the city are "hot spots" for various crimes, as well as where which areas have seen large increases or decreases in activity.
Finally, there has been quite a bit of discussion with el hermano about what sort of summary measurements we should be looking at, and how best to present the data. I'd love to hear from you, my readers, about what you'd be most interested in finding out while we are crunching the data.
I foresee this being a weekly or bi-weekly feature, as we work our way through various crime categories. Classes start on Monday, so things will need to be staggered by necessity.