SCANDATA

A book of images scraped from the Internet Archive website. Used for calibrating large book scanners, they automate the process of color correction and the removal of hot spots created by the scanner’s lights, and are meant to be read by the calibration software rather than humans.

When present in an Internet Archive entry, the images are stored in an odd file format (.ppm) inside a zip archive called "scandata.zip," from which the title for this book comes. Using a script written in Python, URLs for Internet Archive entries containing these files were pulled from the Bing search engine API, their contents downloaded, and the images converted to JPGs. This collection represents a small fraction of the thousands of such files on Internet Archive’s site.

Download a PDF of the book, or buy it here.

All work on this site is licensed under a Creative Commons BY-NC-SA License. Feel free to use, but please let me know.