Draft

Step by Step Procedures

1) scan with visioneer scanner on to a paperport platform (6.1 or better) or directly into Omnipage

2) if using paperport, create virtual reality folder for scans, scanning no more than one letter of the alphabet per folder:

megafile\msa\govpub\dc100\dc65\000001\000001 (for 'a'; 2 for 'b', ...)\tif

3) maintain quality control on the Paperport or Omnipage platform, saving working files to a max or opd subdirectory, and exporting files to a tif directory labeling them by the series number and two digits for the letter of the alphabet (in this case s141101 for the letter 'a'

4) import the tifs into Omnipage and correct in omnipage, saving the work in progress to an opd directory

5) export the opd file as simple txt to a txt directory

6) if necessary, use the Quick File Rename to make the tifs conform to the labeling of the .txt files or vive versa  (e.g. to eliminate the lead 0's in the tif numbering, i.e. convert s141101-001.tif to s141101-1.tif)

7) run the perl program against the megafile\msa\govpub\dc100\dc65\ directory, making certain that the necessary image directory, etc. are present at the right directory level

8) run scripted global search and replace programs using the FUNDUC search and replace program to standardize the text in the htmls as shown in the examples.

9) possibly move the alpha folders (00001-000026) to their appropriate location under megafile\stagser\s1400\s1411\
 

Repeat steps 1-8 until all letters of the alphabet are complete

Then

1) prepare images of the relevant series units accessed by the index

2) run the images through the cdc55.pl program

3) global search and replace the text in the html to conform to the correct series unit descriptions

4) possibly convert the tifs to larger gifs than automatically produced by the Perl program

5) move the resulting files to their proper stagser, etc. location

6) use global search and replace to create the links from the index html entry to the appropriate image file html (hopefully this can be done by passing the page information to a cgi script that hyperlinks to the right page;  if not, at least to the first page of the series unit)