Christoph Auer
|
b066b26215
|
feat!: Public threaded PDF parser and rendering API (#265)
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
|
2026-05-11 15:37:22 +02:00 |
|
Peter W. J. Staar
|
db84017ca7
|
fix: memory management for docling upstream (#263)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-04-24 15:18:25 +02:00 |
|
Peter W. J. Staar
|
b5804c1654
|
fix: refactored the black to ruff (#258)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-04-18 05:56:59 +02:00 |
|
Peter W. J. Staar
|
7be5d62336
|
feat: add jbig2 decoder (#252)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-04-17 15:46:44 +02:00 |
|
Peter W. J. Staar
|
e7ef57fbf6
|
feat: extend the renderer (#245)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-04-01 06:48:09 +02:00 |
|
Peter W. J. Staar
|
ae66f6ddf0
|
feat: add parallelization for parsing (#216)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-03-04 10:42:04 +01:00 |
|
Peter W. J. Staar
|
36eb3928fd
|
fix: updated the debug log (#229)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-23 17:54:59 +01:00 |
|
Peter W. J. Staar
|
e7812a122a
|
feat: Refactor pdf resources to pdf page item (#215)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-13 17:25:41 +01:00 |
|
Peter W. J. Staar
|
2fd79a05c5
|
perf: improve recursive form xobject (#212)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-11 14:49:17 +01:00 |
|
Peter W. J. Staar
|
3272dd8d0b
|
feat: removing the json from the pdf-parser (#210)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-11 07:30:12 +01:00 |
|
Peter W. J. Staar
|
ea5f1d8d7b
|
feat: renaming lines to shapes and enriching with graphics (color, filling and stroking) (#209)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-10 05:35:19 +01:00 |
|
Peter W. J. Staar
|
f01ce848aa
|
feat: add decoding config to decode_page (#208)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-06 15:32:39 +01:00 |
|
Peter W. J. Staar
|
25672da1e8
|
feat: add-image-extraction (#207)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-02-04 17:35:00 +01:00 |
|
Peter W. J. Staar
|
82a0aaa791
|
perf: default cmap speedup (#203)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-01-30 09:48:31 +01:00 |
|
Peter W. J. Staar
|
f86ff926c8
|
perf: move map to unordered_map (#202)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-01-29 08:41:15 +01:00 |
|
Peter W. J. Staar
|
23c7fb8e8f
|
feat: add typed serialization (#201)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-01-28 17:51:51 +01:00 |
|
Peter W. J. Staar
|
a98871e9e3
|
chore: removed the v2 naming in the code (#198)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-01-26 13:36:37 +01:00 |
|
Peter W. J. Staar
|
adcb9b00e5
|
feat!: Remove deprecated v1 api (#189)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2026-01-19 17:16:32 +01:00 |
|
Michele Dolfi
|
96e5f21639
|
fix: avoid setting global root logger (#182)
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
|
2025-12-02 17:49:34 +01:00 |
|
Nimalan
|
0c64402ddb
|
feat: Support reading password protected PDF (#169)
Signed-off-by: Nimalan <nimalan.m@protonmail.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
|
2025-10-20 13:56:18 +02:00 |
|
Peter W. J. Staar
|
f8d53ee481
|
feat: add perf tools (#165)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-09-16 16:53:46 +02:00 |
|
Peter W. J. Staar
|
0402b3f0a3
|
feat: reset to the old parameters in sanitation (#163)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-09-04 12:39:59 +02:00 |
|
Peter W. J. Staar
|
1466548476
|
feat: accelerate docling parse (#161)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-09-03 07:46:24 +02:00 |
|
Peter W. J. Staar
|
fe3482f7d7
|
feat: add page unloading (#150)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-08-19 08:49:18 +02:00 |
|
Peter W. J. Staar
|
8872e736bf
|
feat: Fixed char ordering in text lines (#138)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
|
2025-06-24 12:43:01 +02:00 |
|
Christoph Auer
|
01971026d9
|
fix: Set flags for SegmentedPage correctly (#127)
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
|
2025-06-11 15:51:32 +02:00 |
|
Michele Dolfi
|
ec6556b988
|
fix: filenames with unicode chars on Windows (#124)
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
|
2025-06-05 10:47:35 +02:00 |
|
Michele Dolfi
|
c9c452b6ee
|
fix: setup hashlib for fips compliance (#123)
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
|
2025-06-04 11:10:00 +02:00 |
|
Christoph Auer
|
ca7d584fa3
|
feat!: Update API, naming, and tests. Move data model to docling-core (#107)
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
|
2025-03-14 13:00:24 +01:00 |
|
Peter W. J. Staar
|
c2f9741a5b
|
feat: Establish char_cells, word_cells and line_cells, other fixes (#101)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-02-18 09:54:17 +01:00 |
|
Peter W. J. Staar
|
25b1e64846
|
feat: add support for RtL (#94)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-02-06 07:11:19 +01:00 |
|
Peter W. J. Staar
|
b634c11571
|
fix: update vizualisation script (#95)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-02-03 17:51:57 +01:00 |
|
Peter W. J. Staar
|
9718762209
|
feat: Added the pure chars and fixed the duplicate text (#91)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
|
2025-02-02 13:57:45 +01:00 |
|
Peter W. J. Staar
|
525ed8e380
|
feat: Update for complex fonts, rendering, and experimental high-level API (#82)
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
|
2025-01-17 18:46:16 +01:00 |
|