gPDFText

logo

gPDFText is a text editor for GTK+ that opens PDF documents for ebook readers, converts the text contents into plain ASCII text, restores the original paragraphs and removes unwanted line breaks to allow easier zooming on the reader.


Current release: 0.0.1.


Many downloaded PDF files for ebook readers still use the A4 paper type (or letter which is similar in size) and when the PDF is displayed on the ebook reader, the zoom required to display the entire page makes the text too small. Simply exporting the PDF into text causes problems with line wrapping and the various ways that ebook PDFs indicate page headers and footers make it hard to automate the conversion.

gPDFText loads the PDF, extracts the text, reformats the paragraphs into single long lines and then puts the text into a standard GTK+ editor where you can make other adjustments.

On the ebook reader, the plain text file then has no unwanted line breaks and can be zoomed to whatever text size you prefer.

Each reformatting option can be turned off using the gPDFText preferences window.

Spelling support also helps identify areas where the text has not been fully reconstructed.

 

Distributions

If you use Debian GNU/Linux, gpdftext will soon be in the unstable distribution.



The copyright licensing notice below applies to this text.

Copyright © 2009 Neil Williams

Permission is granted to copy, distribute, and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation; with no Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts. A copy of this license is included in the file copying.txt