13 Commits

Author SHA1 Message Date
Gunnsteinn Hall
6056cf828d remove dot import and address golint in contentstream 2018-08-03 10:17:06 +00:00
Gunnsteinn Hall
5b1b9bd504 PdfObjectArray change to struct and receivers added Elements, Get, Set, Len. Fixes to resulting broken code. 2018-07-15 17:52:53 +00:00
Gunnsteinn Hall
14ee80e1fe Preserve and allow output of hexadecimal strings
Refactored PdfObjectString into a struct with bool flag for hex.  Fixed any code broken by the change.
Unexported non-essential functions for crypto (not used by model).  Can unexport more later or refactor to internal package.
2018-07-14 02:25:29 +00:00
Gunnsteinn Hall
817ea404b9 Extractor package with powerful text extraction capabilities and CMap handling. Closes #17 2018-03-22 13:01:04 +00:00
Kevin Burke
1726a4708c
ExtractText: handle text matrix (Tm) boxes
Previously Tm boxes would have their text extracted, but with no
formatting information between boxes. Now, if a box is lower on the
page than has previously been recorded, we insert a newline, and if
a box is further to the right on the same line than a previous box, we
insert a tab character.

This works well enough for parsing Clipper Card transaction histories.
You can see an example at https://clipper-csv.appspot.com.

Fixes #141.
2018-03-17 12:45:12 -07:00
Nat Wilson
30ff60a44e Insert a space when TJ string is offset by more than a threshold
That threshold is hard-coded as -100 (is this font specific?)
2017-10-29 10:55:21 -07:00
Gunnsteinn Hall
badce2938f Address gofmt simplify items 2017-07-31 14:54:14 +00:00
Gunnsteinn Hall
be629aecbf Add convenience functions in Content stream creator 2017-07-01 21:57:31 +00:00
Gunnsteinn Hall
3586509908 Add functions to wrap content state (q..Q). Fix TJ.
Changes function syntax to use *ContentStreamOperations so can change
state.
2017-06-28 15:13:37 +00:00
Gunnsteinn Hall
e3c90b85b7 Cleaning up comments etc. 2017-04-05 18:05:38 +00:00
Gunnsteinn Hall
8ae4f6a63a Initial implementation of a simple graphics state handler / content stream processor. Multiple changes and fixes in colorspace handling. And more. 2017-03-14 13:04:51 +00:00
Gunnsteinn Hall
c599f8decc Fix package comments 2017-03-02 12:49:42 +00:00
Gunnsteinn Hall
4aa6845e27 Refactored contentstream into a separate package. 2017-03-01 16:02:53 +00:00