Resource title

Identification, Characteristics and Impact of Faked Interviews in Surveys : An analysis by means of genuine fakes in the raw data of SOEP

Resource image

image for OpenScout resource :: Identification, Characteristics and Impact of Faked Interviews in Surveys : An analysis by means of genuine fakes in the raw data of SOEP

Resource description

To the best of our knowledge, most of the few methodological studies which analyze the impact of faked interviews on survey results are based on ?artificial fakes? generated by project students in a ?laboratory environment?. In contrast, panel data provide a unique opportunity to identify data which are actually faked by interviewers. By comparing data of two waves, unequivocal fakes are easily identifiable. However, in most surveys there is no second wave because they have a pure cross-sectional nature. In search of a method which does not need two waves of data we test an unconventional benchmark called Benford?s Law, which is used by several accountants to discover frauds. Our preliminary results let us conclude that Benford´s Law might be not an efficient method for detecting faked data, but it might be a new instrument for quality control of the interviewing process The raw data of the German Socio-Economic Panel Study (SOEP) provide a rich source of faked interviews because it is built on several sub-samples. However, because interviewers know that panel respondents will be interviewed again over the course of time, clever interviewers will not fake panel interviews. In fact, in raw data of SOEP the share is about only 0,5 percent of all records. The fakes are used for an analysis of the potential impact of non detected fakes on survey results. The major result is that the faked records has no impact on the mean and the proportions. But in very rare, exceptional cases there may be a bias in estimates of correlations and regression coefficients if fakes would not be detected. One should note that – except for some fakes in the first two waves of sample E – faked data were never disseminated within the widely-used SOEP. The fakes were detected before the data were released.

Resource author

Gert G. Wagner, Joerg-Peter Schraepler

Resource publisher

Resource publish date

Resource language

eng

Resource content type

text/html

Resource resource URL

http://hdl.handle.net/10419/18154

Resource license

Adapt according to the presented license agreement and reference the original author.