Peter Williams
|
785a83e866
|
Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into extract.text
NOTE: Fixed a text_test.go regression by modifying getCharCodeMetrics().
|
2018-11-30 10:46:33 +11:00 |
|
Denys Smirnov
|
0436f2c974
|
validate shex length in cmaps; add comments
|
2018-11-29 23:43:00 +02:00 |
|
Denys Smirnov
|
fb4a087a93
|
textencoding: introduce GlyphName type
|
2018-11-29 23:24:40 +02:00 |
|
Gunnsteinn Hall
|
c9439c80ed
|
Merge branch 'v3' into extract.text
|
2018-11-29 08:51:58 +00:00 |
|
Peter Williams
|
1cea79b8ef
|
Merge pull request #5 from unidoc/v3-peterwilliams97-extract.text
Cleaning up v3 extract.text
|
2018-11-29 18:03:50 +11:00 |
|
Peter Williams
|
f131af7b5a
|
File missed in previous commit.
|
2018-11-29 17:50:43 +11:00 |
|
Peter Williams
|
88c9b05dff
|
Merge branch 'v3' of https://github.com/unidoc/unidoc into extract.text
|
2018-11-29 17:12:40 +11:00 |
|
Peter Williams
|
94dca18b60
|
removed a comment
|
2018-11-29 17:09:45 +11:00 |
|
Peter Williams
|
7bbcec65fa
|
Made Matrix and Point structs more general and moved them to their own files in pdf/model.
|
2018-11-29 17:04:20 +11:00 |
|
Denys Smirnov
|
27efe08a26
|
cmap: remove global for missing code; should replace the rune afterwards
|
2018-11-29 04:52:23 +02:00 |
|
Denys Smirnov
|
e79be78aae
|
textencoding: simplify the code of computeTables
|
2018-11-29 04:45:39 +02:00 |
|
Denys Smirnov
|
8a4c4069b7
|
textencoding: unexport CodeToGlyph field
|
2018-11-29 04:42:35 +02:00 |
|
Denys Smirnov
|
6fddd80eba
|
textencoding: assert the type of differences map
|
2018-11-29 04:40:25 +02:00 |
|
Denys Smirnov
|
7c8d88185c
|
fonts: assert type of another map; add some comments
|
2018-11-29 04:30:37 +02:00 |
|
Denys Smirnov
|
b91c1b8c61
|
model: remove unnecessary typ names in font initialization
|
2018-11-29 04:19:29 +02:00 |
|
Denys Smirnov
|
46d22eac31
|
fonts: introduce types for GIDs and char codes; fix shadowing bug
|
2018-11-29 04:19:29 +02:00 |
|
Denys Smirnov
|
ab62ff5060
|
fonts: specify rune type as a key for Chars and runeToWidth
|
2018-11-29 04:19:29 +02:00 |
|
Denys Smirnov
|
6c0fd1e780
|
cmap: mapped values are runes, not strings
|
2018-11-29 04:19:29 +02:00 |
|
Gunnsteinn Hall
|
dd145a2d15
|
Merge pull request #258 from dennwc/stable_creator
Make PDF output stable when using custom fonts
|
2018-11-29 01:16:24 +00:00 |
|
Gunnsteinn Hall
|
e6b768c06c
|
Remove GetAverageCharWidth
|
2018-11-29 01:09:34 +00:00 |
|
Denys Smirnov
|
5b0eaf3f3a
|
creator: make output stable when using custom fonts; fixes #232
|
2018-11-29 02:56:26 +02:00 |
|
Gunnsteinn Hall
|
c3e3b326e4
|
Merge pull request #257 from dennwc/deps
List dependencies for Dep and Go modules
|
2018-11-28 23:55:46 +00:00 |
|
Gunnsteinn Hall
|
f04f83b271
|
Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into v3-peterwilliams97-extract.text
|
2018-11-28 23:33:31 +00:00 |
|
Gunnsteinn Hall
|
d29f9a6a34
|
Adding Height and Width methods for PdfRectangle
|
2018-11-28 23:25:31 +00:00 |
|
Gunnsteinn Hall
|
520ab09a72
|
Addressing review comments
|
2018-11-28 23:25:17 +00:00 |
|
Denys Smirnov
|
41af4a14eb
|
list dependencies for dep and go modules
|
2018-11-29 01:15:19 +02:00 |
|
Adrian-George Bostan
|
585470eebe
|
Add styled paragraph support for internal link annotations
|
2018-11-28 22:19:30 +02:00 |
|
Adrian-George Bostan
|
e14d898abf
|
Add styled paragraph support for external link annotations
|
2018-11-28 21:28:27 +02:00 |
|
Adrian-George Bostan
|
e89519d010
|
Add Clear method to PdfObjectArray
|
2018-11-28 21:24:20 +02:00 |
|
Peter Williams
|
da8544e68b
|
Moved Matrix code to model/matrix.go
|
2018-11-28 22:29:35 +11:00 |
|
Peter Williams
|
ad83b1c948
|
In text extraction, split lines with tolerance on y coordinate.
|
2018-11-28 22:13:56 +11:00 |
|
Peter Williams
|
6529b42a70
|
Remove duplicate code.
|
2018-11-28 18:22:42 +11:00 |
|
Peter Williams
|
36a1148962
|
Combine diacritics in text extraction.
|
2018-11-28 18:06:03 +11:00 |
|
Peter Williams
|
f373881a48
|
Removed some unused struct fields.
|
2018-11-27 13:37:12 +11:00 |
|
Peter Williams
|
c898ce847a
|
Removed non-text-extraction code.
|
2018-11-26 18:00:15 +11:00 |
|
Peter Williams
|
d241226931
|
Removed test PDFs.
|
2018-11-26 17:46:45 +11:00 |
|
Peter Williams
|
478c5dfe56
|
removed debug code
|
2018-11-26 17:45:41 +11:00 |
|
Peter Williams
|
209616ab52
|
File missed in previous commit.
|
2018-11-26 17:26:27 +11:00 |
|
Peter Williams
|
536c688001
|
Fixed orientation handling in text extraction.
|
2018-11-26 17:17:17 +11:00 |
|
Peter Williams
|
a815ca7271
|
Premultiply coordinate transforms to text matrix in text extraction.
|
2018-11-26 08:09:52 +11:00 |
|
Peter Williams
|
a2024b8e29
|
Use char width 250 for standard 14 font characters without given char metrics.
|
2018-11-23 11:21:51 +11:00 |
|
Peter Williams
|
92e3e455c2
|
Merge branch 'v3' of https://github.com/unidoc/unidoc into extract
|
2018-11-22 22:03:26 +11:00 |
|
Peter Williams
|
6e5e32dd92
|
Fixed encoding selection for standard 14 fonts.
|
2018-11-22 22:01:04 +11:00 |
|
Peter Williams
|
8b964f2008
|
Set font even when Tf operator is not between BT and ET.
|
2018-11-21 13:14:11 +11:00 |
|
Peter Williams
|
dcb2b14d55
|
Handle standard 14 TrueType fonts and stanard 14 font aliases in text extraction.
|
2018-11-20 17:49:37 +11:00 |
|
Peter Williams
|
cad144cec3
|
Handle missing widths in text extraction
|
2018-11-20 15:49:28 +11:00 |
|
Peter Williams
|
2f8b50af75
|
Fixed landscape rotation for text extraction.
Also compute metrics for standard 14 fonts when not created from dict.
|
2018-11-19 16:50:28 +11:00 |
|
Peter Williams
|
ea8a26a7dc
|
Fixed text matrix multiplication order.
|
2018-11-19 14:19:50 +11:00 |
|
Gunnsteinn Hall
|
319d40718c
|
Merge pull request #254 from adrg/list-component
Add list component
|
2018-11-18 10:44:35 +00:00 |
|
Adrian-George Bostan
|
2c50caf6a4
|
Remove unnecessary util function
|
2018-11-18 11:13:53 +02:00 |
|