This dataset was generated in a joint effort by the Electronic Frontier Foundation (EFF) and the Multimedia Analysis and Data Mining (MADM) Group at the German Research Center for Artificial Intelligence (DFKI). The purpose of this dataset is to provide researchers a wide variety of different machine identification codes for development and evaluation purposes. The documents were collected by the EFF and scanned and ground-truthed at DFKI.
An overview of the printer samples is given here
Two versions are available:
Contact: Faisal Shafait, Joost van Beusekom