Vietnam or Thailand ? Vote for the TOP Country of the Week !
Updated: May 16, 2025
When the original is in poor condition, as with very old books, it is keyed in manually, word by word. Some volunteers themselves prefer to type short texts, or works they particularly like. But most books are scanned, "OCRized" and proofread. The assets of digitization in "text format" are numerous.
Some volunteers themselves prefer to type short texts, or works they particularly like. But most books are scanned, "OCRized" and proofread. Digitization in "text format" means a book can be copied, indexed, searched, analyzed and compared with other books.
The use of scanned books as is converted to text format by OCR software with no proofreading gives a much lower quality result. After running OCR software, the text is 99% reliable, in the best of cases. For this reason, Project Gutenberg's perspective is rather different from that of the Internet Archive. In its Text Archive, books are scanned and "OCRized", but they are not proofread.
Word Of The Day
Others Looking