CrimeStat is a Windows-based spatial statistic software program that conducts spatial and statistical analysis and is designed to interface with a Geographic Information System. The program is developed by Ned Levine & Associates, with funding by the National Institute of Justice (NIJ). The program and manual are distributed for free by NIJ.
CrimeStat performs spatial analysis on objects located in a GIS. The objects can be points (e.g., events, locations), zones (e.g., blocks, traffic analysis zones, cities) or lines (e.g., street segments). The program can analyze the distribution of the objects, identify hot spots, indicate spatial autocorrelation, monitor the interaction of events in space and time, and model travel behavior. There is a regression module for non-linear spatial modeling. Some of its tools are specific to crime analysis. Others can by applied in many fields. There are 55 statistical routines in the program.
CrimeStat has been developed since the mid-1990s. The first prototype was a Unix-based C++ program called Pointstat that was developed to analyze motor vehicle crashes in Honolulu. In 1996, the National Institute of Justice funded the first version of CrimeStat and the early Pointstat routines were folded into the program.
- The first version (1.0) was released in August 1999.
- Release 1.1 (July 2000)
- Version 2.0 (May 2002)
- Version 3.0 (November 2004)
- Release 3.1 (March 2007)
- Release 3.2 (October 2009)
- Release 3.3 (July 2010)
The program is divided into three main parts:
- Data Setup
- Statistical Routines
CrimeStat can input data both attribute and GIS files but requires that all datasets have geographical coordinates assigned for the objects. The basic file format is dBase (dbf) but shape (shp), and Ascii text files can also be read. The program requires a Primary File but many routines also use a Secondary File. CrimeStat uses three coordinate systems:
- Spherical (longitude, latitude)
- Directional (angles).
Distance can be measured as direct, indirect (Manhattan) or on a network (which also allows travel time or speed to be used). Distance units are decimal degrees for spherical coordinates and feet, meters, miles, kilometers, or nautical miles for projected coordinates. The program can create reference grids. Several routines also use the area of the geographical region for their calculations.
The statistical routines are grouped into three categories
- Spatial description
- Spatial modeling
- Crime travel demand.
The spatial description routines include:
- Spatial distribution statistics (mean center, standard deviation ellipse, center of minimum distance, median center, directional mean, convex hull)
- Spatial autocorrelation statistics for zonal data (Moran’s “I”, Geary’s “C”, Getis-Ord Global “G”, Moran correlogram, Geary correlogram, Getis-Ord correlogram),
- Distance-based statistics among points (nearest neighbor analysis, Ripley’s “K”, the allocation and summation of Primary File points to Secondary File points, and various distance calculation matrices), and
- Hot spot statistics for points, zones or lines. CrimeStat has eight routines for hot spot identification:
- Mode (points and lines)
- Fuzzy mode (points and lines)
- Nearest neighbor hierarchical clustering (Nnh; points and lines)
- Risk-adjusted nearest neighbor hierarchical clustering (Rnnh; points and lines)
- The Spatial and Temporal Analysis of Crime routine (STAC)) (points and lines)
- K-means clustering (points and lines)
- Anselin’s Local Moran (zones)
- Local Getis-Ord “G” (zones).
The spatial modeling routines include:
- Single kernel density interpolation for examining variation over a region of a single variable
- Dual kernel density interpolation of two variables (e.g., a set of events in relation to a population ‘at risk’)
- Head Bang routine for smoothing zonal data
- ”Interpolated Head Bang surface that interpolates the Head Bang estimates to a grid
- ”Knox and Mantel indexes that identify the interaction between space and time in events
- Correlated Walk Analysis, based on random walk theory, for modeling the sequential behavior of a serial offender in space and time and makes a prediction about the next event
- Journey-to-crime analysis for modeling the likely origin of a serial offender based on the location of prior events committed by the offender (geographic profiling)
- Bayesian Journey-to-crime which is an empirical Bayes method that integrates the Journey-to-crime estimate with information on the residence location of other serial offenders who committed crimes in the same places to produce an updated estimate. The diagnostic routine compares this estimate with its components in predicting the residence location for multiple serial offenders
- Bayesian Journey-to-crime estimation which applies the Bayesian Journey-to-crime method to estimate the location of one serial offender
- Spatial regression. The models include Ordinary Least Squares and Poisson-based, the latter of which are more appropriate for skewed data such as crime. Currently, there are Poisson models for MLE Poisson, MLE Poisson with Linear Correction (NB1), MLE Poisson with Gamma dispersion (negative binomial), MCMC Poisson-Gamma, and MCMC Poisson-Gamma with a conditional spatial autoregression (CAR) adjustment.
The Crime Travel Demand module models crime travel over a metropolitan area. It is an application of travel demand modeling to crime or other rare events.. The purpose is to calibrate the travel behavior of a large number of offenders in committing crimes as a basis for modeling alternative interventions by law enforcement The data required must include a large number of events where both the crime location and the origin (residence) location of the offenders are known and are allocated to zones (e.g., blocks, traffic analysis zones).
All the routines in CrimeStat can be estimated either with distance (direct or indirect/Manhattan) or on a network using actual impedance estimates (e.g., travel time or speed).
CrimeStat has three different types of output:
- Screen output that displays the results once the calculations are finished. These can be saved to a text file.
- Non-graphical output for many routines in either dBase DBF or Ascii text format.
- Graphical output for many routines to allow the calculated objects to be displayed in a GIS. Currently, the graphical output formats include Esri SHP, MapInfo MIF/MID, Surfer for Windows DAT, and Ascii text formats.
All the routines are documented and illustrated in a manual. The current version (3.3) includes 17 chapters from version 3.0 and two update chapters. As with the program, the manual is distributed for free .
Unlike some other spatial statistics programs, CrimeStat has no mapping capabilities and must be used with GIS software. Some users have found that the GUI interface is difficult to understand and inconsistent between routines. Because CrimeStat analyzes points in most routines, its results are not always consistent with those of software that analyzes areas (Eg. GeoDa). Finally, while the manual is comprehensive and well written, its size may be daunting to new users of spatial statistics.
Ancillary CrimeStat Development
In addition to the development of the CrimeStat program, all the routines through version 2.0 plus the spatial autocorrelation routines have been converted into .NET libraries for use in third-party applications. Version 1.0 of the CrimeStat Libraries was released in August 2010 and is available on the CrimeStat web page.
NIJ has also run CrimeStat training courses for crime analysts. These are done on a periodic basis. At the NIJ Crime Mapping Research Conferences, held approximately every year and a half, workshops are conducted on various CrimeStat topics.
In addition to the CrimeStat program, NIJ has sponsored the development of a CrimeStat III User Workbook for crime analysts  and is currently developing a CrimeStat Analyst program that implements the most basic CrimeStat routines.
Reviews and Examples
Use of CrimeStat by Baltimore County Police Analysts
Baltimore County Police analysts use CrimeStat to perform various spatial analytics. The primary responsibility of police analysts in Baltimore County is to identify and address existing or anticipated crime problems. Police analysts use “hot spot analysis” in CrimeStat to identify areas within the county having high concentrations of crime. Another example demonstrating the use of CrimeStat involves the department’s Data Driven Approaches to Crime and Traffic Safety (DDACTS). Police analysts used Nearest Neighbor Hierarchical Spatial clustering to identify areas having high concentrations of crime and traffic accidents. Analysts found that the two cluster groups, crime and accidents, did tend to overlap in many areas of the county. The County’s DDACTS program was initiated to increase police presence in the target areas. Preliminary results have been encouraging, with most targeted crimes and traffic accidents dropping in DDACTS areas. The Department’s DDACTS program has since become a model nationwide with the support of the National Highway Traffic Safety Administration. Finally, police analysts have used CrimeStat’s Journey to Crime and Bayesian Journey to Crime Estimation models to successfully identify a serial offender’s activity space. Once an offender’s activity space has been identified, police analysts will examine information captured from other police sources such as traffic stops, Field Interview Reports, and License Plate Readers to determine if a contact was made with a potential offender. Police have also used CrimeStat’s Crime Travel Demand model to identify road networks used by drivers under the influence (DUI). Roadways identified by the Crime Travel Demand model were targeted for interdiction programs by the department’s DUI Enforcement Team. Similar weighted road networks have been used in conjunction with Journey to Crime models to improve identification of an offender’s activity space.
- ^ Levine, N. "Spatial statistics and GIS: software tools to quantify spatial patterns", Journal of the American Planning Association. 1996. 62 (3), 381-392.
- ^ Levine, N., Kim, K. E., & Nitz, L. H. (1995). "Spatial analysis of Honolulu motor vehicle crashes: I. Spatial patterns". Accident Analysis & Prevention, 27 (5), 663-674.
- ^ Block, C. R. (1994). "STAC hot spot areas: a statistical tool for law enforcement decisions". In Proceedings of the Workshop on Crime Analysis Through Computer Mapping. Illinois Criminal Justice Information Authority: Chicago, IL.
- ^ Block, R. and Block, C. R. (1995). "Space, place and cime: hot spot areas and hot places of liquor-related Crime". In Eck, J. E. & Weisburd D. (eds.), Crime and Place. Crime Prevention Studies, Volume 4. Criminal Justice Press: Monsey, NY. 147-185.
- ^ Levine, N. (2008). “The ‘hottest’ part of a crime hotspot: Comments on “The utility of hotspot mapping for predicting spatial patterns of crime” by Chainey, Tompson, L. and Uhlig, S.”. Security Journal, 21, 295-302
- ^ Mungiole, M. Pickle, L. W., & Simonson, K. H. (2002). Application of a weighted Head-Banging algorithm to Mortality data maps, Statistics in Medicine, 18, 3201-3209.
- ^ Mungiole, M. & Pickle, L. W. (1999). Determining the optimal degree of smoothing using the weighted head banging algorithm on mapped mortality data, In ASC '99 Leading Survey & Statistical Computing into the New Millennium, Proceedings of the ASC International Conference, September. 
- ^ Levine, N. & Block, R. (2011). “Bayesian Journey to crime estimation: an improvement in geographic profiling methodology”. The Professional Geographer, 63(2), 1–17.
- ^ JIP-OP (2009). Articles by Levine, N., Canter, D., Block, R., Bernasco, W., Leitner, M., Kent, J., Lee, P., and O’Leary, M., Special issue on Bayesian Journey-to-crime modeling. Journal of Investigative Psychology and Offender Profiling, 6 (3).
- ^ Cameron, C. A. & Trivedi, P. K. (1998). Regression Analysis of Count Data. Cambridge University Press: Cambridge, U.K.
- ^ Hilbe, J. M. (2008). Negative Binomial Regression (with corrections). Cambridge University Press: Cambridge.
- ^ Hensher, D. A. & Button, K. J. (2002). Handbook of Transport Modeling. Elsevier Science: Cambridge, UK.
- ^ Ortuzar, Juan de Dios and Luis G. Willumsen (2001). Modeling Transport (3rd edition). J. Wiley & Sons: New York.
- ^ Levine, N. and Canter, P. (2011). “). “Linking origins with destinations for DWI Motor Vehicle Crashes: An application of crime travel demand modeling”. Crime Mapping. In press.
- ^ Levine, N. (2007), “Crime travel demand and bank robberies: Using CrimeStat III to model bank robbery trips”. Social Science Computer Review, 25(2), 239-258.
- ^ Brodsky, H. (2002). “CrimeStat II on the geostatistical scence”. Geospatial Solultions, November. 49-53
- ^ Paulsen, D. & Robinson, M. (2008).Spatial Aspects of Crime: Theory and Practice (2nd edition). Allyn & Bacon.
- ^ Chainey, S. & Ratcliffe, J. (2005). GIS and Crime Mapping. John Wiley & Sons, Ltd.
- ^ Lai PC, Low CT, Wong M, Wong WC, & Chan MH. (2009). “Spatial analysis of falls in an urban community of Hong Kong”, International Journal of Health Geography, 17:8-14
- ^ de Smith, M. J., Goodchild, M. F., & Longley, P. A. (2007). Geospatial Analysis (second edition). The Winchelsea Press: Leicester, U.K.
- ^ Anne van der Veen, A. (2005). “Economic hotspots: Visualizing Vulnerability to Flooding”, Natural Hazards, 36(1-2), 65-80.
- ^ Anselin, L. (2003). “An Introduction to Point Pattern Analysis using CrimeStat”, GeoDa Center, Arizona State University: Tempe. http://geodacenter.asu.edu/system/files/points.pdf
- ^ Clevenger, A. P., Chruszcz, B. & Gunson, K. E. (2001). “Highway mitigation fencing reduces wildlife-vehicle collisions”, Wildlife Society Bulletin, 29(2),646-653.
- Levine, N. (2008). “CrimeStat: a spatial statistical program for the analysis of crime incidents”. Shekhar, S. and Xiong, H. (eds), Encyclopedia of Geographic Information Science. Springer. 187-193.
- Levine, N. (2006). “Crime mapping and the CrimeStat program. Geographical Analysis. 38 (1), 41-55.
Wikimedia Foundation. 2010.
Look at other dictionaries:
Geographic profiling — is a criminal investigative methodology that analyzes the locations of a connected series of crimes to determine the most probable area of offender residence. Typically used in cases of serial murder or rape (but also arson, bombing, robbery, and … Wikipedia
Kernel density estimation — of 100 normally distributed random numbers using different smoothing bandwidths. In statistics, kernel density estimation is a non parametric way of estimating the probability density function of a random variable. Kernel density estimation is a… … Wikipedia
k-means clustering — In statistics and data mining, k means clustering is a method of cluster analysis which aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean. This results into a partitioning of… … Wikipedia