Denys Smirnov
|
53687f854e
|
Merge remote-tracking branch 'origin/v3' into extract.text
# Conflicts:
# pdf/contentstream/processor.go
# pdf/extractor/text.go
# pdf/extractor/utils.go
# pdf/internal/textencoding/winansi.go
# pdf/model/font.go
# pdf/model/font_composite.go
# pdf/model/font_simple.go
# pdf/model/font_test.go
# pdf/model/fontfile.go
# pdf/model/fonts/ttfparser.go
# pdf/model/structures.go
|
2018-12-27 12:17:28 +02:00 |
|
Denys Smirnov
|
217f984033
|
fonts: make standard font names type-safe
|
2018-12-19 16:55:27 +05:00 |
|
Denys Smirnov
|
7f667d8fbb
|
model: remove Standard14Font in favor of fonts.StdFont; resolves #269
|
2018-12-19 13:43:09 +05:00 |
|
Denys Smirnov
|
e3704defc7
|
rename Typ1 font to StdFont
|
2018-12-15 19:39:55 +05:00 |
|
Denys Smirnov
|
3687c83b37
|
errors should start with a lower case
|
2018-12-15 18:49:15 +05:00 |
|
Denys Smirnov
|
d3664d0f85
|
fonts: make metric tables for type1 fonts more compact by sharing glyphs
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
3c8e70256d
|
fonts: reuse metrics tables where possible
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
0ef989c713
|
fonts: group similar fonts to a single file
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
3b1a92701f
|
fonts: remove redundant Type1 font interface implementations
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
59f694d99f
|
fonts: remove broken SetEncoder method for most fonts
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
81bb03763b
|
font: discovered a bug in SetEncoder
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
11081b20c5
|
fonts: clarify cid to gid mapping
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
2274cbdf8c
|
fonts: add a function to make a text encoder from ttf font
|
2018-12-15 18:47:39 +05:00 |
|
Denys Smirnov
|
0a8b46daff
|
don't use generic receiver names; make sure receiver name is consistent
|
2018-12-09 21:47:15 +02:00 |
|
Denys Smirnov
|
9f0df8945d
|
don't use XXX for TODOs
|
2018-12-09 21:39:11 +02:00 |
|
Denys Smirnov
|
6d2c39043c
|
make sure comments begin with a type/function name
|
2018-12-09 20:22:33 +02:00 |
|
Denys Smirnov
|
99f3184879
|
define slices with a var instead of an empty literal
|
2018-12-09 19:28:50 +02:00 |
|
Denys Smirnov
|
7cdbb0c572
|
Merge remote-tracking branch 'origin/v3' into extract.text
# Conflicts:
# pdf/internal/textencoding/truetype.go
# pdf/model/font.go
# pdf/model/font_composite.go
# pdf/model/font_simple.go
# pdf/model/font_test.go
# pdf/model/fonts/ttfparser.go
|
2018-12-07 18:30:37 +02:00 |
|
Peter Williams
|
835f329c28
|
Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into extract.text
|
2018-12-02 10:02:16 +11:00 |
|
Peter Williams
|
9c258551ad
|
Documented font code. Fall back to StandardEncoding when no encoding is speficied for a font.
|
2018-12-02 09:14:58 +11:00 |
|
Gunnsteinn Hall
|
2b1c796a74
|
Addressing review comments
|
2018-11-30 23:01:04 +00:00 |
|
Gunnsteinn Hall
|
33843599f2
|
Another round of addressing review comments
|
2018-11-30 16:53:48 +00:00 |
|
Denys Smirnov
|
fb4a087a93
|
textencoding: introduce GlyphName type
|
2018-11-29 23:24:40 +02:00 |
|
Denys Smirnov
|
7c8d88185c
|
fonts: assert type of another map; add some comments
|
2018-11-29 04:30:37 +02:00 |
|
Denys Smirnov
|
46d22eac31
|
fonts: introduce types for GIDs and char codes; fix shadowing bug
|
2018-11-29 04:19:29 +02:00 |
|
Denys Smirnov
|
ab62ff5060
|
fonts: specify rune type as a key for Chars and runeToWidth
|
2018-11-29 04:19:29 +02:00 |
|
Denys Smirnov
|
6c0fd1e780
|
cmap: mapped values are runes, not strings
|
2018-11-29 04:19:29 +02:00 |
|
Gunnsteinn Hall
|
e6b768c06c
|
Remove GetAverageCharWidth
|
2018-11-29 01:09:34 +00:00 |
|
Gunnsteinn Hall
|
520ab09a72
|
Addressing review comments
|
2018-11-28 23:25:17 +00:00 |
|
Peter Williams
|
92e3e455c2
|
Merge branch 'v3' of https://github.com/unidoc/unidoc into extract
|
2018-11-22 22:03:26 +11:00 |
|
Peter Williams
|
8b964f2008
|
Set font even when Tf operator is not between BT and ET.
|
2018-11-21 13:14:11 +11:00 |
|
Peter Williams
|
cad144cec3
|
Handle missing widths in text extraction
|
2018-11-20 15:49:28 +11:00 |
|
Denys Smirnov
|
2d7d6334bc
|
fonts: add tests for ttf parser
|
2018-11-17 15:03:38 +01:00 |
|
Denys Smirnov
|
86a30df78c
|
fonts: floats should be signed
|
2018-11-17 15:03:34 +01:00 |
|
Denys Smirnov
|
c8c7a03896
|
fonts: fix glyph id bounds check
|
2018-11-07 22:09:57 +02:00 |
|
Denys Smirnov
|
08c1fe4ed4
|
fonts: remove unused field
|
2018-11-07 22:09:57 +02:00 |
|
Peter Williams
|
3da4ffc5aa
|
Merge
|
2018-11-01 21:33:51 +11:00 |
|
Peter Williams
|
5e8ca9c18c
|
Fixed code->glyph mapping for TrueType fonts for raw number gid
|
2018-10-29 09:08:32 +11:00 |
|
Peter Williams
|
b23600c9f4
|
Merge branch 'render.v3.hungarian' into extract
|
2018-10-23 10:59:59 +11:00 |
|
Peter Williams
|
5d15dc97dd
|
Removed code with problematic provenance.
|
2018-10-23 10:44:58 +11:00 |
|
Peter Williams
|
86108bd2b9
|
Build font descriptor literals from .afm files
|
2018-10-23 10:36:38 +11:00 |
|
Peter Williams
|
28c99083d4
|
Merge branch 'render.v3.hungarian' into extract
|
2018-10-19 13:32:34 +11:00 |
|
Peter Williams
|
ade6fa0405
|
Added tcpdf licence
|
2018-10-19 13:31:25 +11:00 |
|
Peter Williams
|
45f6c09e39
|
Merge branch 'render.v3.hungarian' into extract
|
2018-10-19 10:05:02 +11:00 |
|
Peter Williams
|
b48010c75b
|
Fixed typo
|
2018-10-18 21:39:16 +11:00 |
|
Peter Williams
|
45228219b5
|
Added PdfFont.FontDescriptor() which always returns a PdfFontDescriptor, possibly a builtin one for
the standard 14 fonts.
|
2018-10-18 21:12:15 +11:00 |
|
Peter Williams
|
f6dc3e2fc3
|
First attempt at splitting words in text extraction using a space detection heuristic
|
2018-10-09 11:49:59 +11:00 |
|
Gunnsteinn Hall
|
aea91f1ba9
|
Merge branch 'v3' into v3-enhance-forms
|
2018-09-29 16:59:16 +00:00 |
|
Peter Williams
|
f953c11452
|
Don't return errors for TrueType font file tables with no PostScript entry in their "name" table.
This is needed for PDFs created with Tesseract.
|
2018-09-24 18:02:02 +10:00 |
|
Peter Williams
|
b0f5329425
|
Allow TrueType font files to not have PostScript entries in their "name" table.
|
2018-09-24 17:53:12 +10:00 |
|