Subj : Paper page to view file
To : All
From : Mike Luther
Date : Wed Jan 30 2008 02:54 pm
Thoughts please?
I have a client who 'inherited' about 4000 client files averaging about 50
pages of paper per file. These are all text based records and do not seem to
favor any kind of OCR scanning to convert them to text/data files of any kind.
It appears that all must be scanned visually, then saved in black and white as
images of the pages. The file images look like they will have to be targeted
for hard drive storage, the names,indexing and ability to organize the file
names not being key issues at the moment.
I'm not familiar with this in OS/2. Outside people are suggesting that the
final storage 'format' for these images should either be .JPG or .PDF type
files. For research and study, I do have a fully paid-for PMView tool set, as
well as a fully latest Lotus Smart Suite for OS/2 with all the latest fixpacks
applied, as well as the public release free version of TrueSpec Graphis Pro.
That together with access to the SANE/TAME products which I have never
installed to date.
Although I'm not the party to be saddled with this project (PROJECT!) I still
need to know a couple things. For test purposes I have both an HP 1120 and one
other HP combination printer and scanner, the drivers for which I think I have
as part of the OS/2 MCP2 latest device drivers and so on. But that done, even
USB and installed, plus supported, I guess,by SANE/TAME or whatever, some other
thoughts need answers before I even start to learn more about this.
As best I can tell, from looking at things like this, a black and white image
of hand written and/or typed notes in each page, at a 300 DPI resolution would
amount to about a megabyte per page as an image. That noted, in a format such
as .JPG, from what I think I see, it might be about 240K per page as a file of
that image. True, looking at a .PDF file of a page of text and minor imaging,
I see about 9K to 10K per page as a file of that type.
But what, on the average, might I be looking at for file storage of captured
images of this, say 200,000 such pages? Unless my mental math is wrong at even
30,000K per page that is some 6 terrabytes of disk space.
Suggestions as to how one looks at this kind of thing in OS/2? Knowing what
type of programs are already available as cited above. And so on?
Thanks!
--> Sleep well; OS/2's still awake! ;)
Mike @ 1:117/3001
--- Maximus/2 3.01
* Origin: Ziplog Public Port (1:117/3001)