Scott Moore wrote:
>>>In the meantime, the scan format is online, the PUG newsletters
>>>where widely and publically distributed, and you guys are smart,
>>>I think you can figgure out what to do.
>>
>>
>>No, recapturing things from those scans is far too hard and error
>>prone. I took a look already.
>>
>
>
> I agree that it is hard. The biggest difficulty with it is that the
format
> is completely messed up, and that is because I, nor anyone else seems to
> be able to find an OCR program that will output fixed format text in the
> same format as it was on the paper.
>
> However, Pascal does not care about the format the input is in, and it
> can be "prettyprinted" in any case.
>
A followup, the program I was using, readiris (came with scanner) was
pretty
painful to use with the scans, so I tried a different program yesterday,
Omnipage.
This program has its own issues, but the two big plusses for it are that
its
accuracy is much higher, and it knows how to preserve formatting even to a
text
file.
The disadvantage of Omnipage is that it knows how to decolummnate, but NOT
to TEXT files! I did a convertion of the Pascal formatter from PUG #13
page 49
by scanning each collumn on each page individually, then pasting them
together into the correct order, a real pain, but much better than
readiris.
I'll be putting this and other programs online. I don't have a decision
yet
on the BSI suite, the authors have not returned my email. Again, the
recommendation
is that you scan this yourself.
--
Samiam is Scott A. Moore
Personal web site: http:/www.moorecad.com/scott
My electronics engineering consulting site: http://www.moorecad.com
ISO 7185 Standard Pascal web site: http://www.moorecad.com/standardpascal
Classic Basic Games web site: http://www.moorecad.com/classicbasic
The IP Pascal web site, a high performance, highly ****table ISO 7185
Pascal
compiler system: http://www.moorecad.com/ippas
Good does not always win. But good is more patient.


|