Best practices for photographing old documents for long-term digital preservation
Asked 4/29/2020
7 views
2 answers
0
We’re building a workflow to photograph old documents for preservation and long-term access. We already have a Nikon Z7-class camera in budget, use an X-Rite target for color correction, and are considering storing RAW plus TIFF masters and smaller TIFF access files. Space is available, but we still want sensible file sizes.
What should we prioritize for the capture setup and workflow to get the best archival results? In particular:
- lighting and copy setup
- camera settings such as ISO, aperture, distance, and focus
- color management and metadata to preserve
- whether glass is appropriate to hold documents flat, and any concerns about reflections or color cast
- any general guidance for creating image files that remain useful for future research, OCR, or machine analysis
Originally by Photography Stack Exchange contributor. Source · Licensed CC BY-SA 4.0
Photography Stack Exchange contributor
6y ago
2 Answers
1
I think you maybe asking two separate sets of questions: 1) What are recommended settings and conditions for the best photo and 2) What is the best format to ensure longevity of the digital file?
For 1) it depends on your subject, but otherwise I suggest reading on product shot information or better yet, hire a professional photographer that shoots similar shots for a living. They could recommend the best conditions, lighting, camera settings and other items, which will vary by location, subject, lens, etc. Having an experienced professional, on site, and available to analyze the final images would be a good spend of money and time, and will ensure that you have the best conditions for your subject and location.
As for 2), this represents a significant issue. Not only must you concern yourself with the file and file reading capability, but also the media. 500 years is a significant timeline. Given your timeline, I do not think any file format will be preferred over others. IMHO, likely any of them will be lost to time, forgotten and be unreadable in that timeline.
The Library of Congress (LoC) might be a valuable resource, and perhaps you should consider their approach: They preserve the original as current knowledge advises, and constantly improve their methods over time. They never store and forget. In addition they are constantly updating to the latest storage methods, for example digitally scanning, and also converting those scans to newer formats as time goes on. Most importantly, the LoC also preserves the devices needed to read those artifacts.
Edison invented the phonograph, and recordings sold by his company and others were wildly popular. Many of these earliest recordings are preserved in the LoC. Yet there are likely very few readers of this answer that can listen to the Edison Phonograph recordings, because those were recorded on wax cylinders, not records as we know them. Records did not become popular until they were standardized in the 1940s. So, in less than 100 years, the ability to play the original format had all but disappeared and is rarer still today.
So, picking the most popular format is no guarantee of success. I would recommend picking a non-proprietary format, save and maintain the equipment needed to read and access those files and then put in place processes to maintain other more current formats of the original over time. NEF obviously is proprietary to Nikon, and the specifications of the format are not published. While the TIFF specification is technically owned by Adobe, they allow its use broadly, publish the format openly and it is well supported by software and hardware today. TIFF is included in the LoC recommended format standards. But, your plans should include maintaining a computer or other device capable of reading, displaying and digitally transferring those images for future generations to convert into modern future formats.
Originally by user4880. Source · Licensed CC BY-SA 4.0
user4880
6y ago
0
Generated from our catalog & community — verify before relying on it.
Treat this as two separate problems: capture quality and file longevity.
For capture, the exact “best” settings depend on the documents, lens, lighting, and workspace. In general, use a proper copy/product-photography setup with controlled, even lighting and test your workflow carefully. Low ISO, accurate focus, stable camera support, and a lens/camera position that keeps the document flat and undistorted are the key basics. Because this is preservation work, it’s worth hiring or consulting an experienced reproduction/product photographer to help set up lighting, glass handling, and quality control for your specific materials.
If you use glass to flatten pages, watch for reflections, glare, and any tint or color cast from the glass.
For longevity, file format and documentation matter as much as image quality. Keeping original RAW files plus high-quality TIFF masters is a sensible approach, and smaller derivative files for access are fine. Preserve strong metadata describing capture conditions, equipment, dates, color workflow, and any processing performed.
No digital format is guaranteed readable hundreds of years from now, so plan for future migration and documentation rather than assuming one format will last forever.
Recommended products
UniqueBot
AI6y ago
Your Answer
Related Questions
Should I save scanned 35mm slides as JPEG or TIFF for archiving?
Does Apple Preview make lower-quality JPEGs than Photoshop, GIMP, or Bridge when converting PNG/TIFF scans on Mac?
Should I convert Sony ARW files to DNG for long-term storage?
Will my JPEG photos lose quality just from being opened, and should I convert them to PNG or TIFF for archiving?
How can a family share and edit photos and movies across multiple Macs with one central storage and backup?