353 Commits

Author SHA1 Message Date
Peter Williams
efaadcae4f Use standard conventions for cmap names 2018-06-27 14:22:47 +10:00
Peter Williams
7af97fba6a Fixed some comments 2018-06-27 12:54:47 +10:00
Peter Williams
fa19666a38 Added missing const 2018-06-27 12:52:14 +10:00
Peter Williams
199a74dbd8 Major changes to font code
- Added Type1 font parsing.
- Added Standard 14 font parsing.
- Fixed some bugs in cmap code.
- Started re-structuring of font code. Moved common font fields to `fontSkeleton`
2018-06-27 12:25:59 +10:00
Gunnsteinn Hall
ce9c6ba114 Merge branch 'master' into compositefonts 2018-06-07 14:55:37 +00:00
Gunnsteinn Hall
f2ef568840 Add comments. Output generated files to /tmp. 2018-06-03 02:26:50 +00:00
Gunnsteinn Hall
0785daf45f Merge branch 'curve-shape' of https://github.com/s4kibs4mi/unidoc into s4kibs4mi-curve-shape 2018-06-03 01:16:48 +00:00
Gunnsteinn Hall
a7abcd0160 Track number of bytes per character code for mappings. Fixes problem posed in PR #156 in a generic fashion. 2018-06-03 01:05:46 +00:00
Sakib Sami
f82403ef12 Implemented : Curve & Filled Curve Shape with test 2018-05-31 03:48:26 +06:00
Gunnsteinn Hall
0c9502ad8b Block supporting more resource types (pattern, shading) 2018-05-22 12:57:32 +00:00
Gunnsteinn Hall
a1a95012ea Shading support function arrays 2018-05-22 12:56:40 +00:00
Gunnsteinn Hall
3b89a0ba98 Fixes #157 2018-05-20 23:33:24 +00:00
Gunnsteinn Hall
db0d78aa6c
Merge pull request #155 from hy05190134/master
bugfix for parse
2018-05-12 08:15:37 +00:00
yang.yang
e453d64ce0 bugfix for parse 2018-05-10 17:24:23 +08:00
Gunnsteinn Hall
311b0a8e2d
Merge pull request #153 from peterwilliams97/master
Fixes for grayscale conversion
2018-05-10 08:28:05 +00:00
Gunnsteinn Hall
223f22bfcd
Merge pull request #154 from hy05190134/master
bugfix for cal code val
2018-05-09 11:43:29 +00:00
yang.yang
a26d320c7b bugfix for cal code val 2018-05-08 15:39:29 +08:00
Peter Williams
608b2bf51e check that SMask is PdfObjectStream 2018-05-05 18:07:09 +10:00
Peter Williams
079d85779d fixed typo 2018-05-05 17:45:18 +10:00
Peter Williams
695f022baa No longer break GetContentStream() API backwards compatibility 2018-05-04 15:41:23 +10:00
Peter Williams
4b2d244fe0 When converting XImage objects to grayscale, also set the Matts of their SMask images to grayscale if they have SMasks 2018-05-02 17:14:31 +10:00
Peter Williams
df58e3e5bd Merge branch 'master' of https://github.com/unidoc/unidoc 2018-04-27 10:06:57 +10:00
Peter Williams
6692c21b74 Don't reuse parts of old objects 2018-04-27 10:06:36 +10:00
qz
14587fd738 type conversion consistent with project style 2018-04-18 09:09:29 +03:00
qz
bf5496aa8d page Resources can be loaded from indirect object 2018-04-11 21:47:48 +03:00
Gunnsteinn Hall
5b17c89af4 Fix for D/Decode operand. Fixes #149 2018-03-22 14:27:17 +00:00
Gunnsteinn Hall
413dcd757e Fix 2018-03-22 14:16:59 +00:00
Gunnsteinn Hall
a4fe3bded2 Add LICENSE.md with reference to AGPL and Commercial license. Add license header info to code. 2018-03-22 14:03:47 +00:00
Gunnsteinn Hall
d5396dd893 Fixes in extractor testing 2018-03-22 13:53:12 +00:00
Gunnsteinn Hall
4af19b929a License handling in extractor 2018-03-22 13:17:09 +00:00
Gunnsteinn Hall
817ea404b9 Extractor package with powerful text extraction capabilities and CMap handling. Closes #17 2018-03-22 13:01:04 +00:00
Gunnsteinn Hall
820be65467
Merge pull request #146 from kevinburke/newline-parsing
ExtractText: handle text matrix (Tm) boxes
2018-03-21 16:03:51 +00:00
Gunnsteinn Hall
695bd434a3
Merge pull request #138 from emmanuel-florent/master
Add func NewPdfAnnotationWidget() *PdfAnnotationWidget
2018-03-21 10:34:10 +00:00
Gunnsteinn Hall
e71bb34362 Fix pagebreak generate page blocks. Support pagebreak in subchapter. 2018-03-20 14:21:03 +00:00
Gunnsteinn Hall
d95127579f Merge branch 'master' of https://github.com/llonchj/unidoc into llonchj-master 2018-03-20 13:16:59 +00:00
qz
b7efd634fe Parser crude optimizations.
- parsing functions how collect results in bytes.Buffer
- added benchmarks
- parseHexString avoids calling skipSpace
2018-03-20 04:37:12 +03:00
Kevin Burke
1726a4708c
ExtractText: handle text matrix (Tm) boxes
Previously Tm boxes would have their text extracted, but with no
formatting information between boxes. Now, if a box is lower on the
page than has previously been recorded, we insert a newline, and if
a box is further to the right on the same line than a previous box, we
insert a tab character.

This works well enough for parsing Clipper Card transaction histories.
You can see an example at https://clipper-csv.appspot.com.

Fixes #141.
2018-03-17 12:45:12 -07:00
qz
3f7e07ac9e fixes #144, using bytes.Buffer for more efficient string building 2018-03-05 13:04:27 +03:00
Gunnsteinn Hall
51f575080a Licensing updates 2018-02-23 14:07:26 +00:00
Jordi Llonch
7ae11755c8 Added Paragraph.Text function returning text content 2018-02-15 13:11:32 +11:00
Jordi Llonch
c9f172694c fix TableCell.SetBorderColor 2018-02-12 14:18:37 +11:00
Jordi Llonch
40f364059f PageBreak drawable implementation 2018-02-12 13:07:35 +11:00
Emmanuel Florent
f3adc44a4f Add func NewPdfAnnotationWidget() *PdfAnnotationWidget
AnnotationWidget are usefull for adding Signature widgets
2018-01-28 09:54:01 +01:00
Peter Williams
309e23f65d Merge branch 'master' of https://github.com/peterwilliams97/unidoc 2018-01-11 18:33:08 +11:00
Peter Williams
4a20c56de0 Merge remote-tracking branch 'upstream/master'
* upstream/master:
  Fix #131: Return error on pageNumber = 0
  Added SetForms to Creator
  Insert a space when TJ string is offset by more than a threshold
  Support for resampling images with BitsPerComponent != 8.
  dont encode glyphs when you only wrap
  Add support for specifying alternate image encoding/compression method (creator).
  Add commercial license to repository and full AGPL license text. Updated licensing text in README.
  Updated version
  Few fixes: Shadingtype 4 handling, inline image handling (compatibility), encoding name (compatibility)
  Improved CS handling with Separation, Lab color ranges.
  Fix in handling of indexed colorspace parsing inline images when using base=DeviceRGB rather than shortcut.  Closes #114.
  Support indexed cs in inline image. Fixes #113. Fixes in lab colorspace color processing.
  Allow array in DecodeParms in FlateDecoder of length 1 (containing dict).
  Fix problem with MultiEncoder DecodeParms.  Fixes #97.
  Limit maximum specified number of indirect objects when parsing xref stream.  Fixes #111.
  Error -> Debug log
2018-01-11 17:52:09 +11:00
Peter Williams
e4adfd91a3 Merge branch 'master' of https://github.com/peterwilliams97/unidoc
* 'master' of https://github.com/peterwilliams97/unidoc: (50 commits)
  Fixing lab colorspace component input ranges.  Fix Indexed cs Image to rgb conversion.
  Make float parsing more like gs
  Fixed Lab bounds
  Added dummy encodings
  Added dummy encodings
  Fix PS processing of dup operand. Fixes #98.
  Check sizes for memory allocation based on pdf user inputs. Fixes #107.
  Check to avoid division by zero.  Fixes #106.
  Add GetObjectNums
  Address go vet issues
  Fix comment typo
  Fixed some bugs found while getting pdf_descibe.go to work
  Address golint recommendations in core
  Address core golint recommendations in crypt, io
  Add check for base colorspace type when loading Indexed colorspace.  Fixes #95.
  Address more golint recommendations  #89
  Checks on stated byte lengths in xref stream objects.  Closes #94.
  Address golint recommendations. Add TODO comments for recommended future refactoring work in next major release.
  Only attempt to load annotation from a valid indirect object for annotation Popup entries.  Fixes #91.
  Address godoc code block line wrapping
  ...
2018-01-11 17:43:17 +11:00
Peter Williams
651ff90214 Set encoding when transforming patterns. Preserve SMasks when transforming image Xobjects 2018-01-11 16:25:04 +11:00
Armen Boursalian
6613014adf Fix #131: Return error on pageNumber = 0 2017-12-11 16:13:21 -08:00
Daniel Rhoden
f2fa55a63b Added SetForms to Creator 2017-11-08 14:54:59 -06:00
Nat Wilson
30ff60a44e Insert a space when TJ string is offset by more than a threshold
That threshold is hard-coded as -100 (is this font specific?)
2017-10-29 10:55:21 -07:00