109 Commits

Author SHA1 Message Date
Denys Smirnov
53687f854e Merge remote-tracking branch 'origin/v3' into extract.text
# Conflicts:
#	pdf/contentstream/processor.go
#	pdf/extractor/text.go
#	pdf/extractor/utils.go
#	pdf/internal/textencoding/winansi.go
#	pdf/model/font.go
#	pdf/model/font_composite.go
#	pdf/model/font_simple.go
#	pdf/model/font_test.go
#	pdf/model/fontfile.go
#	pdf/model/fonts/ttfparser.go
#	pdf/model/structures.go
2018-12-27 12:17:28 +02:00
Denys Smirnov
3687c83b37 errors should start with a lower case 2018-12-15 18:49:15 +05:00
Denys Smirnov
3f7ad73812 refactor some receiver and method names; fix typos in comments 2018-12-11 04:37:00 +02:00
Denys Smirnov
0a8b46daff don't use generic receiver names; make sure receiver name is consistent 2018-12-09 21:47:15 +02:00
Denys Smirnov
9f0df8945d don't use XXX for TODOs 2018-12-09 21:39:11 +02:00
Denys Smirnov
6d2c39043c make sure comments begin with a type/function name 2018-12-09 20:22:33 +02:00
Denys Smirnov
99f3184879 define slices with a var instead of an empty literal 2018-12-09 19:28:50 +02:00
Gunnsteinn Hall
2b1c796a74 Addressing review comments 2018-11-30 23:01:04 +00:00
Gunnsteinn Hall
283c9bf778 Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into v3-peterwilliams97-extract.text.take2 2018-11-30 17:05:49 +00:00
Gunnsteinn Hall
33843599f2 Another round of addressing review comments 2018-11-30 16:53:48 +00:00
Peter Williams
f566fe5f68 Moved point.go and matrix.go back to their original locations. 2018-11-30 12:17:52 +11:00
Peter Williams
785a83e866 Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into extract.text
NOTE: Fixed a text_test.go regression by modifying getCharCodeMetrics().
2018-11-30 10:46:33 +11:00
Gunnsteinn Hall
520ab09a72 Addressing review comments 2018-11-28 23:25:17 +00:00
Peter Williams
da8544e68b Moved Matrix code to model/matrix.go 2018-11-28 22:29:35 +11:00
Peter Williams
536c688001 Fixed orientation handling in text extraction. 2018-11-26 17:17:17 +11:00
Peter Williams
a815ca7271 Premultiply coordinate transforms to text matrix in text extraction. 2018-11-26 08:09:52 +11:00
Peter Williams
ea8a26a7dc Fixed text matrix multiplication order. 2018-11-19 14:19:50 +11:00
Peter Williams
851aa267b1 Added test for position based text extraction 2018-11-12 11:04:09 +11:00
Peter Williams
85cb1db004 Fixed position sorting for text extraction for landscape text. 2018-11-10 21:19:02 +11:00
Peter Williams
a6ce81c001 Merge branch 'render.v3.hungarian' into extract 2018-11-02 15:13:48 +11:00
Peter Williams
3da4ffc5aa Merge 2018-11-01 21:33:51 +11:00
Peter Williams
ee3e2a45a0 Update CTM 2018-10-29 15:49:15 +11:00
Gunnsteinn Hall
0d331d036f Update receiver name in ContentStreamProcessor 2018-10-15 10:23:25 +00:00
Gunnsteinn Hall
aed2e3e30e Update receiver names 2018-10-15 10:13:50 +00:00
Peter Williams
2c8c8e5c98 Removed debugging code. 2018-10-09 19:05:38 +11:00
Peter Williams
f6dc3e2fc3 First attempt at splitting words in text extraction using a space detection heuristic 2018-10-09 11:49:59 +11:00
Gunnsteinn Hall
3536d323fb Support MK appearance characteristics for certain styles. Wrap content stream to fix bugs. 2018-10-05 01:59:19 +00:00
Gunnsteinn Hall
aea91f1ba9 Merge branch 'v3' into v3-enhance-forms 2018-09-29 16:59:16 +00:00
Gunnsteinn Hall
5ebb80233c Add marked up content operators and cleanup. 2018-09-28 09:49:32 +00:00
Peter Williams
44563f2cae Added fontMetrics to font loader and GetAverageCharWidth to PdfFont 2018-09-19 11:12:59 +10:00
Peter Williams
76902719df Merge branch 'v3' of https://github.com/unidoc/unidoc into extract 2018-09-18 11:06:43 +10:00
Gunnsteinn Hall
23af4db2b3 Remove dot imports in contentstream pkg 2018-09-18 00:00:48 +00:00
Gunnsteinn Hall
e63a74cc9c Merge branch 'master' into v3-merge-master-in 2018-09-06 09:45:04 +00:00
Peter Williams
5bacca3437 formatting changes 2018-09-06 15:17:41 +10:00
Peter Williams
f2c771b3bb Merge branch 'render.v3.hungarian' into extract 2018-08-23 07:26:56 +10:00
Peter Williams
8ff8665149 First attempt at extraction based on a full PDF text parser. 2018-08-22 12:29:34 +10:00
Sakib Sami
44e98e2ced
Merge branch 'v3' into master 2018-08-14 01:56:45 +06:00
Sakib Sami
e879b00d57 - updated : Comments to godoc style 2018-08-13 23:36:22 +06:00
Sakib Sami
4e8d9d8e47 - Updated : SetBorderStyleMethod
- updated : Comments to godoc style
2018-08-13 01:58:28 +06:00
Gunnsteinn Hall
2cc0c74b80 Avoid endless loop when parsing operand in contenstream. Fixes #176 2018-08-09 22:51:21 +00:00
Gunnsteinn Hall
38157c3a51 Fix border corners and BasicLine Draw 2018-08-08 14:48:01 +00:00
Sakib Sami
c024c237d2 - Added : BorderTest 2018-08-08 08:09:06 +06:00
Gunnsteinn Hall
2bd003d464 Wrap function in content stream creator 2018-08-07 20:51:31 +00:00
Sakib Sami
bf0a51ea71 - Added : Color for basic line 2018-08-04 02:07:06 +06:00
Sakib Sami
9184246eb1 - Added : Code cleanup 2018-08-04 02:03:15 +06:00
Sakib Sami
f823d3b37e - Added : Code cleanup 2018-08-04 00:51:37 +06:00
Sakib Sami
a3518f5d5e - Added : BasicLine with test 2018-08-04 00:37:28 +06:00
Gunnsteinn Hall
96f358543a Remove unintended BasicLine in draw 2018-08-03 10:26:45 +00:00
Gunnsteinn Hall
a487f6848a godoc and golint fixes 2018-08-03 10:18:08 +00:00
Gunnsteinn Hall
6056cf828d remove dot import and address golint in contentstream 2018-08-03 10:17:06 +00:00