81 Commits

Author SHA1 Message Date
Gunnsteinn Hall
9665959bcf Move model/fonts to model/internal/fonts - reducing export surface
- Move the folder
- Update imports
- Add type aliases to access needed types from model (fonts.StdFont, fonts.CharMetrics and the font names)
2019-03-12 19:08:37 +00:00
Gunnsteinn Hall
bf47fc5b6e
Merge branch 'v3' into encodings 2019-01-05 17:18:16 +00:00
Denys Smirnov
aeea76f4dd fonts: read ttf font data once 2019-01-02 17:18:43 +02:00
Denys Smirnov
7a2cd35f48 fonts: rebuild font metrics tables based on runes for standard fonts 2019-01-01 22:40:11 +02:00
Denys Smirnov
1742cb9c89 textencoding: drop old simpleEncoder, use the new implementation 2019-01-01 21:17:57 +02:00
Denys Smirnov
3c5fc18b01 textencoding: refactor encodings; better handling for differences 2019-01-01 17:20:01 +02:00
Denys Smirnov
ac7696693b fonts: describe few issues with the code; remove unused cmap type 2019-01-01 17:19:58 +02:00
Denys Smirnov
f6506204d7 fonts: simplify code by getting width of runes in font instead of glyphs 2018-12-28 01:38:48 +02:00
Denys Smirnov
eb04b2d594 fonts: remove unused name field in char metrics 2018-12-28 01:08:47 +02:00
Denys Smirnov
8835230856 model: fix tests after the merge 2018-12-27 12:37:32 +02:00
Denys Smirnov
53687f854e Merge remote-tracking branch 'origin/v3' into extract.text
# Conflicts:
#	pdf/contentstream/processor.go
#	pdf/extractor/text.go
#	pdf/extractor/utils.go
#	pdf/internal/textencoding/winansi.go
#	pdf/model/font.go
#	pdf/model/font_composite.go
#	pdf/model/font_simple.go
#	pdf/model/font_test.go
#	pdf/model/fontfile.go
#	pdf/model/fonts/ttfparser.go
#	pdf/model/structures.go
2018-12-27 12:17:28 +02:00
Denys Smirnov
85e1a02ac8 model: define an unexported pdfFont interface and remove error cases 2018-12-19 13:54:45 +05:00
Denys Smirnov
7f667d8fbb model: remove Standard14Font in favor of fonts.StdFont; resolves #269 2018-12-19 13:43:09 +05:00
Denys Smirnov
e3704defc7 rename Typ1 font to StdFont 2018-12-15 19:39:55 +05:00
Denys Smirnov
0ef989c713 fonts: group similar fonts to a single file 2018-12-15 18:47:39 +05:00
Denys Smirnov
2658fe9c06 assert types for the new code as well 2018-12-07 18:43:24 +02:00
Denys Smirnov
7cdbb0c572 Merge remote-tracking branch 'origin/v3' into extract.text
# Conflicts:
#	pdf/internal/textencoding/truetype.go
#	pdf/model/font.go
#	pdf/model/font_composite.go
#	pdf/model/font_simple.go
#	pdf/model/font_test.go
#	pdf/model/fonts/ttfparser.go
2018-12-07 18:30:37 +02:00
Gunnsteinn Hall
1f56c18454 Address review comments 2018-12-07 10:32:49 +00:00
Peter Williams
c4a39a1353 Look for CharMetrics for char code 32 when finding space width. 2018-12-02 13:12:10 +11:00
Peter Williams
835f329c28 Merge branch 'extract.text' of https://github.com/peterwilliams97/unidoc into extract.text 2018-12-02 10:02:16 +11:00
Peter Williams
9c258551ad Documented font code. Fall back to StandardEncoding when no encoding is speficied for a font. 2018-12-02 09:14:58 +11:00
Gunnsteinn Hall
2b1c796a74 Addressing review comments 2018-11-30 23:01:04 +00:00
Gunnsteinn Hall
33843599f2 Another round of addressing review comments 2018-11-30 16:53:48 +00:00
Denys Smirnov
fb4a087a93 textencoding: introduce GlyphName type 2018-11-29 23:24:40 +02:00
Denys Smirnov
6fddd80eba textencoding: assert the type of differences map 2018-11-29 04:40:25 +02:00
Denys Smirnov
b91c1b8c61 model: remove unnecessary typ names in font initialization 2018-11-29 04:19:29 +02:00
Denys Smirnov
46d22eac31 fonts: introduce types for GIDs and char codes; fix shadowing bug 2018-11-29 04:19:29 +02:00
Denys Smirnov
ab62ff5060 fonts: specify rune type as a key for Chars and runeToWidth 2018-11-29 04:19:29 +02:00
Gunnsteinn Hall
e6b768c06c Remove GetAverageCharWidth 2018-11-29 01:09:34 +00:00
Peter Williams
36a1148962 Combine diacritics in text extraction. 2018-11-28 18:06:03 +11:00
Peter Williams
a2024b8e29 Use char width 250 for standard 14 font characters without given char metrics. 2018-11-23 11:21:51 +11:00
Peter Williams
6e5e32dd92 Fixed encoding selection for standard 14 fonts. 2018-11-22 22:01:04 +11:00
Peter Williams
8b964f2008 Set font even when Tf operator is not between BT and ET. 2018-11-21 13:14:11 +11:00
Peter Williams
dcb2b14d55 Handle standard 14 TrueType fonts and stanard 14 font aliases in text extraction. 2018-11-20 17:49:37 +11:00
Peter Williams
cad144cec3 Handle missing widths in text extraction 2018-11-20 15:49:28 +11:00
Peter Williams
a9019a50a3 Fixes for text extraction corpus testing.
- Correct matrix multiplication order in text.go
- Look up standard 14 font widths after applying custom encoding.
2018-11-18 17:21:30 +11:00
Peter Williams
a1d5e8dc45 Cleaned up some comments. 2018-11-10 21:41:47 +11:00
Peter Williams
a2342ec6c6 First attempt at getting font metrics by character code. 2018-11-08 15:20:12 +11:00
Peter Williams
3da4ffc5aa Merge 2018-11-01 21:33:51 +11:00
Peter Williams
b0c440dd00 Fixed text position tracking. 2018-10-30 21:55:30 +11:00
Peter Williams
b23600c9f4 Merge branch 'render.v3.hungarian' into extract 2018-10-23 10:59:59 +11:00
Peter Williams
5d15dc97dd Removed code with problematic provenance. 2018-10-23 10:44:58 +11:00
Peter Williams
86108bd2b9 Build font descriptor literals from .afm files 2018-10-23 10:36:38 +11:00
Peter Williams
45f6c09e39 Merge branch 'render.v3.hungarian' into extract 2018-10-19 10:05:02 +11:00
Peter Williams
45228219b5 Added PdfFont.FontDescriptor() which always returns a PdfFontDescriptor, possibly a builtin one for
the standard 14 fonts.
2018-10-18 21:12:15 +11:00
Peter Williams
2452973cfe Don't add /Encoding entry to standard 14 font dicts.
Moved the standard 14 font encoders to a separate field pdfFontSimple.std14Encoder.
2018-10-16 14:50:43 +11:00
Gunnsteinn Hall
bc6391200a Avoid outputing invalid Encoding name for generated standard fonts (use font encoding instead) 2018-10-10 22:44:55 +00:00
Gunnsteinn Hall
f4deb858ba Fix for loading standard fonts with Encoding difference maps 2018-10-09 18:14:34 +11:00
Gunnsteinn Hall
aea91f1ba9 Merge branch 'v3' into v3-enhance-forms 2018-09-29 16:59:16 +00:00
Peter Williams
24d522bdb2 Merge branch 'v3' of https://github.com/unidoc/unidoc into extract 2018-09-24 15:25:44 +10:00