1200 Commits

Author SHA1 Message Date
Denys Smirnov
41af4a14eb list dependencies for dep and go modules 2018-11-29 01:15:19 +02:00
Adrian-George Bostan
585470eebe Add styled paragraph support for internal link annotations 2018-11-28 22:19:30 +02:00
Adrian-George Bostan
e14d898abf Add styled paragraph support for external link annotations 2018-11-28 21:28:27 +02:00
Adrian-George Bostan
e89519d010 Add Clear method to PdfObjectArray 2018-11-28 21:24:20 +02:00
Peter Williams
da8544e68b Moved Matrix code to model/matrix.go 2018-11-28 22:29:35 +11:00
Peter Williams
ad83b1c948 In text extraction, split lines with tolerance on y coordinate. 2018-11-28 22:13:56 +11:00
Peter Williams
6529b42a70 Remove duplicate code. 2018-11-28 18:22:42 +11:00
Peter Williams
36a1148962 Combine diacritics in text extraction. 2018-11-28 18:06:03 +11:00
Peter Williams
f373881a48 Removed some unused struct fields. 2018-11-27 13:37:12 +11:00
Peter Williams
c898ce847a Removed non-text-extraction code. 2018-11-26 18:00:15 +11:00
Peter Williams
d241226931 Removed test PDFs. 2018-11-26 17:46:45 +11:00
Peter Williams
478c5dfe56 removed debug code 2018-11-26 17:45:41 +11:00
Peter Williams
209616ab52 File missed in previous commit. 2018-11-26 17:26:27 +11:00
Peter Williams
536c688001 Fixed orientation handling in text extraction. 2018-11-26 17:17:17 +11:00
Peter Williams
a815ca7271 Premultiply coordinate transforms to text matrix in text extraction. 2018-11-26 08:09:52 +11:00
Peter Williams
a2024b8e29 Use char width 250 for standard 14 font characters without given char metrics. 2018-11-23 11:21:51 +11:00
Peter Williams
92e3e455c2 Merge branch 'v3' of https://github.com/unidoc/unidoc into extract 2018-11-22 22:03:26 +11:00
Peter Williams
6e5e32dd92 Fixed encoding selection for standard 14 fonts. 2018-11-22 22:01:04 +11:00
Peter Williams
8b964f2008 Set font even when Tf operator is not between BT and ET. 2018-11-21 13:14:11 +11:00
Peter Williams
dcb2b14d55 Handle standard 14 TrueType fonts and stanard 14 font aliases in text extraction. 2018-11-20 17:49:37 +11:00
Peter Williams
cad144cec3 Handle missing widths in text extraction 2018-11-20 15:49:28 +11:00
Peter Williams
2f8b50af75 Fixed landscape rotation for text extraction.
Also compute metrics for standard 14 fonts when not created from dict.
2018-11-19 16:50:28 +11:00
Peter Williams
ea8a26a7dc Fixed text matrix multiplication order. 2018-11-19 14:19:50 +11:00
Gunnsteinn Hall
319d40718c
Merge pull request #254 from adrg/list-component
Add list component
2018-11-18 10:44:35 +00:00
Adrian-George Bostan
2c50caf6a4 Remove unnecessary util function 2018-11-18 11:13:53 +02:00
Peter Williams
a9019a50a3 Fixes for text extraction corpus testing.
- Correct matrix multiplication order in text.go
- Look up standard 14 font widths after applying custom encoding.
2018-11-18 17:21:30 +11:00
Adrian-George Bostan
9fd0b08297
Merge branch 'v3' into list-component 2018-11-17 18:59:04 +02:00
Gunnsteinn Hall
a1c0e0d7fe
Merge pull request #248 from dennwc/font_fixes
TTF parser fixes and tests
2018-11-17 14:16:47 +00:00
Gunnsteinn Hall
76463b44dc
Merge branch 'v3' into font_fixes 2018-11-17 14:11:58 +00:00
Gunnsteinn Hall
b5abc6925e
Merge pull request #247 from dennwc/fonts_minor_2
Minor improvements to the fonts encoders
2018-11-17 14:11:20 +00:00
Denys Smirnov
2d7d6334bc fonts: add tests for ttf parser 2018-11-17 15:03:38 +01:00
Denys Smirnov
86a30df78c fonts: floats should be signed 2018-11-17 15:03:34 +01:00
Gunnsteinn Hall
04e74d73f6
Merge branch 'v3' into fonts_minor_2 2018-11-17 14:02:12 +00:00
Gunnsteinn Hall
55e00727ea
Merge pull request #251 from dennwc/crypto_no_panics
Remove panics from crypto code, simplify the code
2018-11-17 13:41:32 +00:00
Gunnsteinn Hall
2752beb6f0 Release v2.2.0 2018-11-17 11:32:42 +00:00
Gunnsteinn Hall
138c412b7e
Merge pull request #246 from unidoc/fix-inlineimg-accept-fullname-params
Support more full name params in inline images.  Fixes #235
2018-11-17 11:29:45 +00:00
Gunnsteinn Hall
b89a074fcf
Merge branch 'master' into fix-inlineimg-accept-fullname-params 2018-11-17 11:25:25 +00:00
Adrian-George Bostan
3e9267c57c Improve list component test cases 2018-11-16 22:41:07 +02:00
Adrian-George Bostan
d6d91f07a5 Add List component test case 2018-11-16 22:32:58 +02:00
Adrian-George Bostan
2932b81dc2 Add AddTextItem method on the list component 2018-11-16 22:31:36 +02:00
Adrian-George Bostan
ff8557824c Improve list component documentation and fix golint notices 2018-11-16 21:40:14 +02:00
Adrian-George Bostan
7219022df1 Add List component 2018-11-16 21:23:46 +02:00
Peter Williams
851aa267b1 Added test for position based text extraction 2018-11-12 11:04:09 +11:00
Peter Williams
a1d5e8dc45 Cleaned up some comments. 2018-11-10 21:41:47 +11:00
Peter Williams
85cb1db004 Fixed position sorting for text extraction for landscape text. 2018-11-10 21:19:02 +11:00
Peter Williams
75aa370467 Updated font_test.go for treating æ and Æ as letters rather than ligatures. 2018-11-10 08:56:47 +11:00
Gunnsteinn Hall
ea294c8817
update readme 2018-11-09 10:52:57 +00:00
Peter Williams
70e65eb941 Merge branch 'render.v3.hungarian' into extract
Treat æ, Æ as letters rather than ligatures.
2018-11-09 09:25:36 +11:00
Peter Williams
4e8977bcba Merge branch 'render.v3.hungarian' into extract 2018-11-08 15:23:41 +11:00
Peter Williams
a86b679bb9 Merge branch 'v3' of https://github.com/unidoc/unidoc into render.v3.hungarian 2018-11-08 15:22:56 +11:00