Quote:
Originally Posted by Quoth
My experience is that the Internet Archive "ebooks" are worthless. They are generated automatically from un-proofed OCR text only marginally good for searching.
So I deleted them all and only download PDFs (after checking they are really PD) and read them on a tablet.
You are better doing your own OCR of the PDF and proofing it. Do put page breaks at chapters, sections or other natural breaks in your wordprocessor. Later those will start new files in the epub. A new file is the only reliable page break and works for epub converted to mobi, azw3/KF8, dual mobi and KFX.
|
Just for the record--and to offset the quoted view--my experience with reading many Internet Archive "epubs" OCR'd from pdfs over many years is that about two-thirds of them are perfectly OK and about 10% are completely unusable. Note that this is just for casual reading.