362 Commits

Author SHA1 Message Date
Gunnsteinn Hall
d5396dd893 Fixes in extractor testing 2018-03-22 13:53:12 +00:00
Gunnsteinn Hall
4af19b929a License handling in extractor 2018-03-22 13:17:09 +00:00
Gunnsteinn Hall
817ea404b9 Extractor package with powerful text extraction capabilities and CMap handling. Closes #17 2018-03-22 13:01:04 +00:00
Gunnsteinn Hall
820be65467
Merge pull request #146 from kevinburke/newline-parsing
ExtractText: handle text matrix (Tm) boxes
2018-03-21 16:03:51 +00:00
Gunnsteinn Hall
695bd434a3
Merge pull request #138 from emmanuel-florent/master
Add func NewPdfAnnotationWidget() *PdfAnnotationWidget
2018-03-21 10:34:10 +00:00
Gunnsteinn Hall
0fbe11ab75 Merge branch 'llonchj-master' 2018-03-20 14:28:10 +00:00
Gunnsteinn Hall
e71bb34362 Fix pagebreak generate page blocks. Support pagebreak in subchapter. 2018-03-20 14:21:03 +00:00
Gunnsteinn Hall
d95127579f Merge branch 'master' of https://github.com/llonchj/unidoc into llonchj-master 2018-03-20 13:16:59 +00:00
Gunnsteinn Hall
e80a3dab58
Merge pull request #148 from quetz/master
Parser crude optimizations.
2018-03-20 11:47:34 +00:00
qz
b7efd634fe Parser crude optimizations.
- parsing functions how collect results in bytes.Buffer
- added benchmarks
- parseHexString avoids calling skipSpace
2018-03-20 04:37:12 +03:00
Gunnsteinn Hall
9e10ce018f
Merge pull request #145 from quetz/master
fixes #144, using bytes.Buffer for more efficient string building
2018-03-20 00:13:17 +00:00
Kevin Burke
1726a4708c
ExtractText: handle text matrix (Tm) boxes
Previously Tm boxes would have their text extracted, but with no
formatting information between boxes. Now, if a box is lower on the
page than has previously been recorded, we insert a newline, and if
a box is further to the right on the same line than a previous box, we
insert a tab character.

This works well enough for parsing Clipper Card transaction histories.
You can see an example at https://clipper-csv.appspot.com.

Fixes #141.
2018-03-17 12:45:12 -07:00
qz
3f7e07ac9e fixes #144, using bytes.Buffer for more efficient string building 2018-03-05 13:04:27 +03:00
Gunnsteinn Hall
b0297d84b2 Update readme 2018-02-23 14:11:58 +00:00
Gunnsteinn Hall
51f575080a Licensing updates 2018-02-23 14:07:26 +00:00
Jordi Llonch
7ae11755c8 Added Paragraph.Text function returning text content 2018-02-15 13:11:32 +11:00
Jordi Llonch
c9f172694c fix TableCell.SetBorderColor 2018-02-12 14:18:37 +11:00
Jordi Llonch
40f364059f PageBreak drawable implementation 2018-02-12 13:07:35 +11:00
Emmanuel Florent
f3adc44a4f Add func NewPdfAnnotationWidget() *PdfAnnotationWidget
AnnotationWidget are usefull for adding Signature widgets
2018-01-28 09:54:01 +01:00
Gunnsteinn Hall
91660d88c4
Merge pull request #132 from Northern-Lights/master
Fix #131: Return error on pageNumber = 0
2017-12-27 20:39:36 +00:00
Armen Boursalian
6613014adf Fix #131: Return error on pageNumber = 0 2017-12-11 16:13:21 -08:00
Gunnsteinn Hall
978d94da33
Merge pull request #126 from drhoden/Fill_in_Form_Fields_Issue_18
Added SetForms to Creator, giving the ability to fill in form fields
2017-11-08 21:19:06 +00:00
Daniel Rhoden
f2fa55a63b Added SetForms to Creator 2017-11-08 14:54:59 -06:00
Gunnsteinn Hall
73394ee038
Merge pull request #124 from njwilson23/master
Insert a space when TJ string is offset by more than a threshold
2017-11-02 06:00:47 +00:00
Nat Wilson
30ff60a44e Insert a space when TJ string is offset by more than a threshold
That threshold is hard-coded as -100 (is this font specific?)
2017-10-29 10:55:21 -07:00
Gunnsteinn Hall
bdf676e045 Support for resampling images with BitsPerComponent != 8. 2017-10-21 15:30:40 +00:00
Gunnsteinn Hall
dce6d19cb1 Merge pull request #121 from mrsinham/master
Text wrapping : dont encode p.textLines
2017-09-22 10:52:17 +00:00
Julien Lefèvre
4f438e3fed dont encode glyphs when you only wrap 2017-09-22 10:57:16 +02:00
Gunnsteinn Hall
207d0ef510 Add support for specifying alternate image encoding/compression method (creator). 2017-09-05 21:37:23 +00:00
Gunnsteinn Hall
81967a216b Add commercial license to repository and full AGPL license text. Updated licensing text in README. 2017-08-24 13:57:17 +00:00
Gunnsteinn Hall
e498030678 Updated version 2017-08-14 11:59:12 +00:00
Gunnsteinn Hall
b3e300884f Few fixes: Shadingtype 4 handling, inline image handling (compatibility), encoding name (compatibility) 2017-08-14 11:56:33 +00:00
Gunnsteinn Hall
144fa3f75b Improved CS handling with Separation, Lab color ranges. 2017-08-11 12:03:24 +00:00
Gunnsteinn Hall
e970ac4c84 Fix in handling of indexed colorspace parsing inline images when using base=DeviceRGB rather than shortcut. Closes #114. 2017-08-11 09:20:16 +00:00
Gunnsteinn Hall
50041f0a78 Support indexed cs in inline image. Fixes #113. Fixes in lab colorspace color processing.
Exported NewPdfColorspaceFromPdfObject so could use it from package
content stream.  Moving from dot imports.
2017-08-10 07:23:42 +00:00
Gunnsteinn Hall
d234fec92e Allow array in DecodeParms in FlateDecoder of length 1 (containing dict). 2017-08-09 18:09:30 +00:00
Gunnsteinn Hall
cc6883d0ae Fix problem with MultiEncoder DecodeParms. Fixes #97. 2017-08-09 17:43:25 +00:00
Gunnsteinn Hall
9e04cbd144 Limit maximum specified number of indirect objects when parsing xref stream. Fixes #111. 2017-08-08 17:35:06 +00:00
Gunnsteinn Hall
4217f83fdd Merge pull request #109 from peterwilliams97/xmaster3
Make parseFloat behave like gs in contentstream handling
2017-08-08 07:34:44 +00:00
Peter Williams
2db7c72bb7 Merge remote-tracking branch 'upstream/master' into xmaster3 2017-08-08 15:44:25 +10:00
Peter Williams
47c329e93e Error -> Debug log 2017-08-08 15:43:11 +10:00
Gunnsteinn Hall
574e5b1933 Merging with recent pull request. 2017-08-07 20:30:29 +00:00
Gunnsteinn Hall
e2bfa9094a Fixing lab colorspace component input ranges. Fix Indexed cs Image to rgb conversion. 2017-08-07 20:21:35 +00:00
Gunnsteinn Hall
d015aa5cf6 Merge pull request #108 from peterwilliams97/xmaster2
Reimplementing dummy colorspaces. Lab colorspace range.
2017-08-07 20:21:22 +00:00
Peter Williams
9d09f7aa2a Make float parsing more like gs 2017-08-07 16:44:10 +10:00
Peter Williams
f840c1c77e Fixed Lab bounds 2017-08-07 16:42:07 +10:00
Peter Williams
a96a007238 Merge remote-tracking branch 'upstream/master' into xmaster2 2017-08-07 13:57:07 +10:00
Peter Williams
2bd4a0a995 Added dummy encodings 2017-08-07 13:36:39 +10:00
Peter Williams
38fa0f1fb4 Added dummy encodings 2017-08-07 13:34:19 +10:00
Gunnsteinn Hall
d3d0312170 Fix PS processing of dup operand. Fixes #98.
Added test cases to verify.
2017-08-06 18:53:24 +00:00