Denys Smirnov
|
53687f854e
|
Merge remote-tracking branch 'origin/v3' into extract.text
# Conflicts:
# pdf/contentstream/processor.go
# pdf/extractor/text.go
# pdf/extractor/utils.go
# pdf/internal/textencoding/winansi.go
# pdf/model/font.go
# pdf/model/font_composite.go
# pdf/model/font_simple.go
# pdf/model/font_test.go
# pdf/model/fontfile.go
# pdf/model/fonts/ttfparser.go
# pdf/model/structures.go
|
2018-12-27 12:17:28 +02:00 |
|
Denys Smirnov
|
3687c83b37
|
errors should start with a lower case
|
2018-12-15 18:49:15 +05:00 |
|
Denys Smirnov
|
3f7ad73812
|
refactor some receiver and method names; fix typos in comments
|
2018-12-11 04:37:00 +02:00 |
|
Denys Smirnov
|
0a8b46daff
|
don't use generic receiver names; make sure receiver name is consistent
|
2018-12-09 21:47:15 +02:00 |
|
Denys Smirnov
|
9f0df8945d
|
don't use XXX for TODOs
|
2018-12-09 21:39:11 +02:00 |
|
Denys Smirnov
|
6d2c39043c
|
make sure comments begin with a type/function name
|
2018-12-09 20:22:33 +02:00 |
|
Denys Smirnov
|
99f3184879
|
define slices with a var instead of an empty literal
|
2018-12-09 19:28:50 +02:00 |
|
Gunnsteinn Hall
|
2b1c796a74
|
Addressing review comments
|
2018-11-30 23:01:04 +00:00 |
|
Gunnsteinn Hall
|
283c9bf778
|
Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into v3-peterwilliams97-extract.text.take2
|
2018-11-30 17:05:49 +00:00 |
|
Gunnsteinn Hall
|
33843599f2
|
Another round of addressing review comments
|
2018-11-30 16:53:48 +00:00 |
|
Peter Williams
|
f566fe5f68
|
Moved point.go and matrix.go back to their original locations.
|
2018-11-30 12:17:52 +11:00 |
|
Peter Williams
|
785a83e866
|
Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into extract.text
NOTE: Fixed a text_test.go regression by modifying getCharCodeMetrics().
|
2018-11-30 10:46:33 +11:00 |
|
Gunnsteinn Hall
|
520ab09a72
|
Addressing review comments
|
2018-11-28 23:25:17 +00:00 |
|
Peter Williams
|
da8544e68b
|
Moved Matrix code to model/matrix.go
|
2018-11-28 22:29:35 +11:00 |
|
Peter Williams
|
536c688001
|
Fixed orientation handling in text extraction.
|
2018-11-26 17:17:17 +11:00 |
|
Peter Williams
|
a815ca7271
|
Premultiply coordinate transforms to text matrix in text extraction.
|
2018-11-26 08:09:52 +11:00 |
|
Peter Williams
|
ea8a26a7dc
|
Fixed text matrix multiplication order.
|
2018-11-19 14:19:50 +11:00 |
|
Peter Williams
|
851aa267b1
|
Added test for position based text extraction
|
2018-11-12 11:04:09 +11:00 |
|
Peter Williams
|
85cb1db004
|
Fixed position sorting for text extraction for landscape text.
|
2018-11-10 21:19:02 +11:00 |
|
Peter Williams
|
a6ce81c001
|
Merge branch 'render.v3.hungarian' into extract
|
2018-11-02 15:13:48 +11:00 |
|
Peter Williams
|
3da4ffc5aa
|
Merge
|
2018-11-01 21:33:51 +11:00 |
|
Peter Williams
|
ee3e2a45a0
|
Update CTM
|
2018-10-29 15:49:15 +11:00 |
|
Gunnsteinn Hall
|
0d331d036f
|
Update receiver name in ContentStreamProcessor
|
2018-10-15 10:23:25 +00:00 |
|
Gunnsteinn Hall
|
aed2e3e30e
|
Update receiver names
|
2018-10-15 10:13:50 +00:00 |
|
Peter Williams
|
2c8c8e5c98
|
Removed debugging code.
|
2018-10-09 19:05:38 +11:00 |
|
Peter Williams
|
f6dc3e2fc3
|
First attempt at splitting words in text extraction using a space detection heuristic
|
2018-10-09 11:49:59 +11:00 |
|
Gunnsteinn Hall
|
3536d323fb
|
Support MK appearance characteristics for certain styles. Wrap content stream to fix bugs.
|
2018-10-05 01:59:19 +00:00 |
|
Gunnsteinn Hall
|
aea91f1ba9
|
Merge branch 'v3' into v3-enhance-forms
|
2018-09-29 16:59:16 +00:00 |
|
Gunnsteinn Hall
|
5ebb80233c
|
Add marked up content operators and cleanup.
|
2018-09-28 09:49:32 +00:00 |
|
Peter Williams
|
44563f2cae
|
Added fontMetrics to font loader and GetAverageCharWidth to PdfFont
|
2018-09-19 11:12:59 +10:00 |
|
Peter Williams
|
76902719df
|
Merge branch 'v3' of https://github.com/unidoc/unidoc into extract
|
2018-09-18 11:06:43 +10:00 |
|
Gunnsteinn Hall
|
23af4db2b3
|
Remove dot imports in contentstream pkg
|
2018-09-18 00:00:48 +00:00 |
|
Gunnsteinn Hall
|
e63a74cc9c
|
Merge branch 'master' into v3-merge-master-in
|
2018-09-06 09:45:04 +00:00 |
|
Peter Williams
|
5bacca3437
|
formatting changes
|
2018-09-06 15:17:41 +10:00 |
|
Peter Williams
|
f2c771b3bb
|
Merge branch 'render.v3.hungarian' into extract
|
2018-08-23 07:26:56 +10:00 |
|
Peter Williams
|
8ff8665149
|
First attempt at extraction based on a full PDF text parser.
|
2018-08-22 12:29:34 +10:00 |
|
Sakib Sami
|
44e98e2ced
|
Merge branch 'v3' into master
|
2018-08-14 01:56:45 +06:00 |
|
Sakib Sami
|
e879b00d57
|
- updated : Comments to godoc style
|
2018-08-13 23:36:22 +06:00 |
|
Sakib Sami
|
4e8d9d8e47
|
- Updated : SetBorderStyleMethod
- updated : Comments to godoc style
|
2018-08-13 01:58:28 +06:00 |
|
Gunnsteinn Hall
|
2cc0c74b80
|
Avoid endless loop when parsing operand in contenstream. Fixes #176
|
2018-08-09 22:51:21 +00:00 |
|
Gunnsteinn Hall
|
38157c3a51
|
Fix border corners and BasicLine Draw
|
2018-08-08 14:48:01 +00:00 |
|
Sakib Sami
|
c024c237d2
|
- Added : BorderTest
|
2018-08-08 08:09:06 +06:00 |
|
Gunnsteinn Hall
|
2bd003d464
|
Wrap function in content stream creator
|
2018-08-07 20:51:31 +00:00 |
|
Sakib Sami
|
bf0a51ea71
|
- Added : Color for basic line
|
2018-08-04 02:07:06 +06:00 |
|
Sakib Sami
|
9184246eb1
|
- Added : Code cleanup
|
2018-08-04 02:03:15 +06:00 |
|
Sakib Sami
|
f823d3b37e
|
- Added : Code cleanup
|
2018-08-04 00:51:37 +06:00 |
|
Sakib Sami
|
a3518f5d5e
|
- Added : BasicLine with test
|
2018-08-04 00:37:28 +06:00 |
|
Gunnsteinn Hall
|
96f358543a
|
Remove unintended BasicLine in draw
|
2018-08-03 10:26:45 +00:00 |
|
Gunnsteinn Hall
|
a487f6848a
|
godoc and golint fixes
|
2018-08-03 10:18:08 +00:00 |
|
Gunnsteinn Hall
|
6056cf828d
|
remove dot import and address golint in contentstream
|
2018-08-03 10:17:06 +00:00 |
|