414 Commits

Author SHA1 Message Date
Peter Williams
079d85779d fixed typo 2018-05-05 17:45:18 +10:00
Peter Williams
695f022baa No longer break GetContentStream() API backwards compatibility 2018-05-04 15:41:23 +10:00
Peter Williams
4b2d244fe0 When converting XImage objects to grayscale, also set the Matts of their SMask images to grayscale if they have SMasks 2018-05-02 17:14:31 +10:00
Peter Williams
df58e3e5bd Merge branch 'master' of https://github.com/unidoc/unidoc 2018-04-27 10:06:57 +10:00
Peter Williams
6692c21b74 Don't reuse parts of old objects 2018-04-27 10:06:36 +10:00
Gunnsteinn Hall
e314834685
Merge pull request #152 from quetz/master
page Resources can be loaded from indirect object
2018-04-18 06:38:43 +00:00
qz
14587fd738 type conversion consistent with project style 2018-04-18 09:09:29 +03:00
qz
bf5496aa8d page Resources can be loaded from indirect object 2018-04-11 21:47:48 +03:00
Gunnsteinn Hall
3c1db91380 Add extractor to doc 2018-03-22 14:48:41 +00:00
Gunnsteinn Hall
5b17c89af4 Fix for D/Decode operand. Fixes #149 2018-03-22 14:27:17 +00:00
Gunnsteinn Hall
413dcd757e Fix 2018-03-22 14:16:59 +00:00
Gunnsteinn Hall
a4fe3bded2 Add LICENSE.md with reference to AGPL and Commercial license. Add license header info to code. 2018-03-22 14:03:47 +00:00
Gunnsteinn Hall
d5396dd893 Fixes in extractor testing 2018-03-22 13:53:12 +00:00
Gunnsteinn Hall
4af19b929a License handling in extractor 2018-03-22 13:17:09 +00:00
Gunnsteinn Hall
817ea404b9 Extractor package with powerful text extraction capabilities and CMap handling. Closes #17 2018-03-22 13:01:04 +00:00
Gunnsteinn Hall
820be65467
Merge pull request #146 from kevinburke/newline-parsing
ExtractText: handle text matrix (Tm) boxes
2018-03-21 16:03:51 +00:00
Gunnsteinn Hall
695bd434a3
Merge pull request #138 from emmanuel-florent/master
Add func NewPdfAnnotationWidget() *PdfAnnotationWidget
2018-03-21 10:34:10 +00:00
Gunnsteinn Hall
0fbe11ab75 Merge branch 'llonchj-master' 2018-03-20 14:28:10 +00:00
Gunnsteinn Hall
e71bb34362 Fix pagebreak generate page blocks. Support pagebreak in subchapter. 2018-03-20 14:21:03 +00:00
Gunnsteinn Hall
d95127579f Merge branch 'master' of https://github.com/llonchj/unidoc into llonchj-master 2018-03-20 13:16:59 +00:00
Gunnsteinn Hall
e80a3dab58
Merge pull request #148 from quetz/master
Parser crude optimizations.
2018-03-20 11:47:34 +00:00
qz
b7efd634fe Parser crude optimizations.
- parsing functions how collect results in bytes.Buffer
- added benchmarks
- parseHexString avoids calling skipSpace
2018-03-20 04:37:12 +03:00
Gunnsteinn Hall
9e10ce018f
Merge pull request #145 from quetz/master
fixes #144, using bytes.Buffer for more efficient string building
2018-03-20 00:13:17 +00:00
Kevin Burke
1726a4708c
ExtractText: handle text matrix (Tm) boxes
Previously Tm boxes would have their text extracted, but with no
formatting information between boxes. Now, if a box is lower on the
page than has previously been recorded, we insert a newline, and if
a box is further to the right on the same line than a previous box, we
insert a tab character.

This works well enough for parsing Clipper Card transaction histories.
You can see an example at https://clipper-csv.appspot.com.

Fixes #141.
2018-03-17 12:45:12 -07:00
qz
3f7e07ac9e fixes #144, using bytes.Buffer for more efficient string building 2018-03-05 13:04:27 +03:00
Gunnsteinn Hall
b0297d84b2 Update readme 2018-02-23 14:11:58 +00:00
Gunnsteinn Hall
51f575080a Licensing updates 2018-02-23 14:07:26 +00:00
Jordi Llonch
7ae11755c8 Added Paragraph.Text function returning text content 2018-02-15 13:11:32 +11:00
Jordi Llonch
c9f172694c fix TableCell.SetBorderColor 2018-02-12 14:18:37 +11:00
Jordi Llonch
40f364059f PageBreak drawable implementation 2018-02-12 13:07:35 +11:00
Emmanuel Florent
f3adc44a4f Add func NewPdfAnnotationWidget() *PdfAnnotationWidget
AnnotationWidget are usefull for adding Signature widgets
2018-01-28 09:54:01 +01:00
Peter Williams
309e23f65d Merge branch 'master' of https://github.com/peterwilliams97/unidoc 2018-01-11 18:33:08 +11:00
Peter Williams
4a20c56de0 Merge remote-tracking branch 'upstream/master'
* upstream/master:
  Fix #131: Return error on pageNumber = 0
  Added SetForms to Creator
  Insert a space when TJ string is offset by more than a threshold
  Support for resampling images with BitsPerComponent != 8.
  dont encode glyphs when you only wrap
  Add support for specifying alternate image encoding/compression method (creator).
  Add commercial license to repository and full AGPL license text. Updated licensing text in README.
  Updated version
  Few fixes: Shadingtype 4 handling, inline image handling (compatibility), encoding name (compatibility)
  Improved CS handling with Separation, Lab color ranges.
  Fix in handling of indexed colorspace parsing inline images when using base=DeviceRGB rather than shortcut.  Closes #114.
  Support indexed cs in inline image. Fixes #113. Fixes in lab colorspace color processing.
  Allow array in DecodeParms in FlateDecoder of length 1 (containing dict).
  Fix problem with MultiEncoder DecodeParms.  Fixes #97.
  Limit maximum specified number of indirect objects when parsing xref stream.  Fixes #111.
  Error -> Debug log
2018-01-11 17:52:09 +11:00
Peter Williams
e4adfd91a3 Merge branch 'master' of https://github.com/peterwilliams97/unidoc
* 'master' of https://github.com/peterwilliams97/unidoc: (50 commits)
  Fixing lab colorspace component input ranges.  Fix Indexed cs Image to rgb conversion.
  Make float parsing more like gs
  Fixed Lab bounds
  Added dummy encodings
  Added dummy encodings
  Fix PS processing of dup operand. Fixes #98.
  Check sizes for memory allocation based on pdf user inputs. Fixes #107.
  Check to avoid division by zero.  Fixes #106.
  Add GetObjectNums
  Address go vet issues
  Fix comment typo
  Fixed some bugs found while getting pdf_descibe.go to work
  Address golint recommendations in core
  Address core golint recommendations in crypt, io
  Add check for base colorspace type when loading Indexed colorspace.  Fixes #95.
  Address more golint recommendations  #89
  Checks on stated byte lengths in xref stream objects.  Closes #94.
  Address golint recommendations. Add TODO comments for recommended future refactoring work in next major release.
  Only attempt to load annotation from a valid indirect object for annotation Popup entries.  Fixes #91.
  Address godoc code block line wrapping
  ...
2018-01-11 17:43:17 +11:00
Peter Williams
651ff90214 Set encoding when transforming patterns. Preserve SMasks when transforming image Xobjects 2018-01-11 16:25:04 +11:00
Peter Williams
6c3e49cd2d Merge remote-tracking branch 'upstream/master' 2018-01-10 11:47:00 +11:00
Gunnsteinn Hall
91660d88c4
Merge pull request #132 from Northern-Lights/master
Fix #131: Return error on pageNumber = 0
2017-12-27 20:39:36 +00:00
Armen Boursalian
6613014adf Fix #131: Return error on pageNumber = 0 2017-12-11 16:13:21 -08:00
Gunnsteinn Hall
978d94da33
Merge pull request #126 from drhoden/Fill_in_Form_Fields_Issue_18
Added SetForms to Creator, giving the ability to fill in form fields
2017-11-08 21:19:06 +00:00
Daniel Rhoden
f2fa55a63b Added SetForms to Creator 2017-11-08 14:54:59 -06:00
Gunnsteinn Hall
73394ee038
Merge pull request #124 from njwilson23/master
Insert a space when TJ string is offset by more than a threshold
2017-11-02 06:00:47 +00:00
Nat Wilson
30ff60a44e Insert a space when TJ string is offset by more than a threshold
That threshold is hard-coded as -100 (is this font specific?)
2017-10-29 10:55:21 -07:00
Gunnsteinn Hall
bdf676e045 Support for resampling images with BitsPerComponent != 8. 2017-10-21 15:30:40 +00:00
Gunnsteinn Hall
dce6d19cb1 Merge pull request #121 from mrsinham/master
Text wrapping : dont encode p.textLines
2017-09-22 10:52:17 +00:00
Julien Lefèvre
4f438e3fed dont encode glyphs when you only wrap 2017-09-22 10:57:16 +02:00
Gunnsteinn Hall
207d0ef510 Add support for specifying alternate image encoding/compression method (creator). 2017-09-05 21:37:23 +00:00
Gunnsteinn Hall
81967a216b Add commercial license to repository and full AGPL license text. Updated licensing text in README. 2017-08-24 13:57:17 +00:00
Gunnsteinn Hall
e498030678 Updated version 2017-08-14 11:59:12 +00:00
Gunnsteinn Hall
b3e300884f Few fixes: Shadingtype 4 handling, inline image handling (compatibility), encoding name (compatibility) 2017-08-14 11:56:33 +00:00
Gunnsteinn Hall
144fa3f75b Improved CS handling with Separation, Lab color ranges. 2017-08-11 12:03:24 +00:00