Case of the 35 Gigabyte Digital Record: OCR and Digital Workflows

by Kelley F. Rowan, Florida International University This presentation was given at the Panhandle Library Access Network’s (PLAN) Innovation Conference: Digitization- Preserving the Past for the Future Conference on August 14th, 2015. The presentation uses a specific collection of directories as a case study of the complications librarians and archivists face in digitizing older materials…

Digitization Workflows: Scanning, OCR, and Audio Transcription

Converting documents, text, images, and sound files to digital and/or machine-readable formats is a prerequisite for many digital humanities projects. Digitization is the process of capturing analog materials as digital images. Optical Character Recognition (OCR) programs “read” these images and convert them to text documents which can be easily searched, copied, edited, or used for…