28 Commits

Author SHA1 Message Date
UniDoc Build
7bc673892f prepare release 2021-07-14 21:15:02 +00:00
UniDoc Build
9d8efb87a8 prepare release 2021-06-21 14:01:56 +00:00
UniDoc Build
edb7c66944 prepare release 2021-05-31 17:17:31 +00:00
UniDoc Build
aa9968c6af prepare release 2021-05-11 00:01:27 +00:00
UniDoc Build
b221a76c5e prepare release 2021-04-23 20:28:14 +00:00
UniDoc Build
596e8b8b8a prepare release 2021-04-17 13:46:54 +00:00
UniDoc Build
dada0fe1d4 prepare release 2021-04-06 22:35:37 +00:00
UniDoc Build
e309710fcd prepare release 2021-03-23 23:12:52 +00:00
UniDoc Build
9a2a3ba8f6 prepare release 2021-03-13 21:28:23 +00:00
UniDoc Build
ec7f5e55c3 prepare release 2021-02-22 02:29:48 +00:00
UniDoc Build
8b10191fd5 prepare release 2021-02-11 10:35:13 +00:00
UniDoc Build
4b16f3c2ce prepare release 2021-01-26 01:31:56 +00:00
UniDoc Build
6ec1f6abf1 prepare release 2021-01-07 14:20:10 +00:00
UniDoc Build
ec282cd9c5 prepare release 2020-12-06 13:03:03 +00:00
UniDoc Build
bafd659395 prepare release 2020-11-23 22:15:56 +00:00
UniDoc Build
79e32364de prepare release 2020-11-11 18:48:37 +00:00
UniDoc Build
22540b937c prepare release 2020-10-19 10:58:10 +00:00
UniDoc Build
56a210342e prepare release 2020-10-12 14:17:59 +00:00
UniDoc Build
87cbc66cbd prepare release 2020-10-05 19:28:24 +00:00
UniDoc Build
22ca2c0eed prepare release 2020-09-28 23:18:17 +00:00
UniDoc Build
9107a86674 prepare release 2020-09-21 01:20:10 +00:00
UniDoc Build
b991a36456 prepare release 2020-09-14 09:32:45 +00:00
UniDoc Build
fd3b669a36 prepare release 2020-09-07 00:23:12 +00:00
UniDoc Build
61b6580cb9 prepare release 2020-08-31 21:12:07 +00:00
UniDoc Build
1501d07a74 prepare release 2020-08-27 21:45:09 +00:00
Adrian-George Bostan
d961079c5d
Add basic image rendering support (#266)
* Add render package
* Add text state
* Add more text operators
* Remove unnecessary files
* Add text font
* Add custom text render method
* Improve text rendering method
* Rename text state methods
* Refactor and document context interface
* Refact text begin/end operators
* Fix graphics state transformations
* Keep original font when doing font substitution
* Take page cropbox into account
* Revert to substitution font if original font measurement is 0
* Add font substitution package
* Implement addition transform.Point methods
* Use transform.Point in the image context package
* Remove unneeded functionality from the render image package
* Fix golint notices in the image rendering package
* Fix go vet notices in the render package
* Fix golint notices in the top-level render package
* Improve render context package documentation
* Document context text state struct.
* Document context text font struct.
* Minor logging improvements
* Add license disclaimer to the render package files
* Avoid using package aliases where possible
* Change style of section comments
* Adapt render package import style to follow the developer guide
* Improve documentation for the internal matrix implementation
* Update render package dependency versions
* Apply crop box post render
* Account for offseted media boxes
* Improve metrics of rendered characters
* Fix text matrix translation
* Change priority of fonts used for measuring rendered characters
* Skip invalid m and l operators on image rendering
* Small fix for v operator
* Fix rendered characters spacing issues
* Refactor naming of internal render packages
2020-03-02 21:22:54 +00:00
Peter Williams
9ebcfcf168 Finding bounding boxes of substrings of extracted text. (#109)
* Added text bounding box extraction.
* Add `font` field to textMark struct;
Create a new method `TextComponents` to retrieve all the text components of the extracted text in the page, with position and character informations
* Reorganizing extractor/text.go
* Added a text extraction position test.
* Added another text extraction location test.
* Text extraction location testing.
* Added tests for text extraction with location information.
* Cleaned up text extraction tests. No changes to functionality.
* Simplifying text extraction code.
* Simplified line construction in text.go
* Returning TextMark's in TextMarkArray which are based on PdfObjectArray but read-only, so not pointers.
* Added text extraction to show PDFs marked-up with bounding boxes of substring in extracted text.
* Add comments explaining how to calculate text bounding boxes.
* Made text_test.go naming consistent with function comments in text.go
* Use tm, pt, tl for textMark/TextMark PageText and TextLine receivers and local variables.
* uncommeted text stress test. Use go test --short to skip
* TextMark.Offset is now an index into the extracted text. It was an index into []rune(text)
2019-07-18 06:41:47 +00:00
Adrian-George Bostan
c64812093d Remmove pdf folder and move packages up one level (#2) 2019-05-16 20:44:51 +00:00