10 Commits

Author SHA1 Message Date
Peter Williams
9de46c5b9f Noted that text extractor is an intermediate version 2018-06-28 11:11:43 +10:00
Peter Williams
4cc6c14a8e Fall back to font encoding when ToUnicode doesn't match 2018-06-27 22:01:17 +10:00
Peter Williams
2dcf8e0cdd Added more missing changes 2018-06-27 16:59:35 +10:00
Peter Williams
759a1dd882 changes left out of last commit 2018-06-27 16:46:33 +10:00
Peter Williams
d184031903 Updated the text extractor to use the new font code 2018-06-27 16:31:28 +10:00
Peter Williams
199a74dbd8 Major changes to font code
- Added Type1 font parsing.
- Added Standard 14 font parsing.
- Fixed some bugs in cmap code.
- Started re-structuring of font code. Moved common font fields to `fontSkeleton`
2018-06-27 12:25:59 +10:00
Gunnsteinn Hall
a4fe3bded2 Add LICENSE.md with reference to AGPL and Commercial license. Add license header info to code. 2018-03-22 14:03:47 +00:00
Gunnsteinn Hall
d5396dd893 Fixes in extractor testing 2018-03-22 13:53:12 +00:00
Gunnsteinn Hall
4af19b929a License handling in extractor 2018-03-22 13:17:09 +00:00
Gunnsteinn Hall
817ea404b9 Extractor package with powerful text extraction capabilities and CMap handling. Closes #17 2018-03-22 13:01:04 +00:00