Search This Blog

Monday, June 15, 2009

New File Format for On-line Articles - Feedback Requested

One of the constant challenges when placing material on-line is to produce readable text that can be spidered by search engines, produced quickly and is of a small enough file size to be accessed by those in the developing world using dial-up connections. OK - that's a whole bundle of challenges rolled into one!

I was recently introduced to a new file format - Deja Vu (.djvu) which produces files which combine an image with an OCRed text overlay. Deja Vu is superior to the Adobe equivalent in that the files are much smaller and can be produced from images scanned at only 200 dpi.

I have been testing the new format on articles from the Bulletin of the Evangelical Theological Society. So far I have been able to upload 2 entire volumes in the same time it would have taken to upload about 4 articles using my usual method of Scan & OCR -> MS Word format -> Print -> Proofread -> PDF.

I will probably add PDF's versions in the course of time, but at least something will be available quickly. You will probably need to download a browser plugin to read the articles - a link is included on the page.

Take a look and let me know what you think.

No comments:

Post a Comment

Related Posts Plugin for WordPress, Blogger...