This paper focuses on fraud detection in surveys using Socio-Economic Panel (SOEP) data as an example for testing newly methods proposed here. A statistical theorem referred to as Benford’s Law states that in many sets of numerical data, the significant digits are not uniformly distributed, as one might expect, but adhere to a certain logarithmic probability function. In order to detect fraud, we derive several requirements that should, according to this law, be fulfilled in the case of survey data.We show that in several SOEP subsamples, Benford’s Law holds for the available continuous data. For this analysis, we developed a measure that reflects the plausibility of the digit distribution in interviewer clusters. We are thus able to demonstrate that several interviews that were known to have been fabricated and therefore deleted in the original user data set can now be detected using this method. Furthermore, in one subsample, we use this method to identify a case of an interviewer falsifying ten interviews not previously detected by the fieldwork organization.
© 2011 by Lucius & Lucius, Stuttgart