![]() |
|||
|
Intermountain Ski Instructors Association Records About this Digital Project Scanning
and OCR OCR on older, typewritten documents produces poor results. Colored paper, light ink, and stray marks all contribute to illegible or undecipherable results, and handwritten documents usually produce no results at all. There is a wide range of quality in the original documents of this collection. We created HTML templates in DreamWeaver 3.0 and pasted the OCR text into the body. There is one template for each box of manuscripts (11 total). The HTML templates contain a number of identical meta tags, but minimal unique indexing was done on each document by keying in a title that contains a brief subject line, date, and the place of the event covered in the document. Templates were used because of DreamWeaver's ability to instantly update the thousands of HTML files that were based on the templates. Each HTML file also contains a link to the PDF document - these were manually inserted into the files generated from the templates. Indexing
and Searching Browsing Questions
or Comments? |
|||
|
|
|||
| Digitization Center | Marriott Library | ||