Mike Gamble - 2011-10-22 23:10:42
This is just what I needed.
Formatting the text into HTML after extracting it took a little effort, but you provided all the tools I needed to get the job done. The job was fairly easy because I'm working with PDFs produced by a single application, so they are all structured the same. However, for those who don't know, writing a general purpose formatter would be very challenging because all PDFs are not created equal. Lines of text are sometimes arbitrarily broken (even in the middle of words) and numerically positioned either relative to a matrix or to the last element. In some cases, it could be difficult to determine where a broken sentence continues in the document.
Anyway, great job and thanks!