Commit Graph

  • 1da0ad52ed updated the table visualization dev/visualize-single-and-merged-cells Peter Staar 2026-05-17 07:53:02 +02:00
  • 6e8e088788 add aligned virtual text support to lists & tables align-lists-tables-headings Panos Vagenas 2026-05-15 16:31:24 +02:00
  • 0268348b6a ignore serialization of empty Meta in HTML dev/update-base-meta Peter Staar 2026-05-13 17:20:48 +02:00
  • 03dd0675d8 feat: extending BaseMeta with language, entities, etc for docling-agent Peter Staar 2026-05-13 07:40:15 +02:00
  • 7cc62cbde7 chore: bump version to 2.75.0 [skip ci] main v2.75.0 github-actions[bot] 2026-05-12 14:54:25 +00:00
  • 014948b0e8 feat: updated the HTML serialization (#609) Peter W. J. Staar 2026-05-12 16:49:08 +02:00
  • 89bfa71e79 chore(doclang): align lists, tables, groups, headings Panos Vagenas 2026-05-12 15:59:29 +02:00
  • 48c5b97593 fix(DocLang): fix chemistry serialization. (#607) vwe-ibm 2026-05-08 13:43:26 +02:00
  • 6a6512ebe4 docs(security): Document security processes (#606) Michele Dolfi 2026-05-08 10:00:26 +02:00
  • 4b38ab6037 fix: refine local path image loading fix-img-local-path-handling Panos Vagenas 2026-05-05 12:48:54 +02:00
  • 275bd1d217 chore: bump version to 2.74.1 [skip ci] v2.74.1 github-actions[bot] 2026-04-22 14:33:02 +00:00
  • 2087d0f362 fix: refine ImageRef URI handling (#595) Panos Vagenas 2026-04-22 16:13:10 +02:00
  • 048f1720f6 fix(doclang): default DoclangDeserializer to page 1 (#590) Ahmed Nassar 2026-04-21 17:54:48 +02:00
  • 473fbacfb9 fix: refine remote filename handling (#591) Panos Vagenas 2026-04-21 16:37:04 +02:00
  • 0425dc0c03 chore: bump version to 2.74.0 [skip ci] v2.74.0 github-actions[bot] 2026-04-17 06:48:41 +00:00
  • b72af126b7 fix(DocLang): fix chemistry serialization (#584) Matteo 2026-04-17 07:33:45 +02:00
  • 9dc882dc48 feat(serializer): add MsExcelMarkdownDocSerializer for sheet-name headings (#587) Smeet Agrawal 2026-04-17 10:31:13 +05:30
  • 6cbdee9626 fix: prevent numeric precision loss in Markdown table serialization (#588) Cesar Berrospi Ramis 2026-04-15 17:16:57 +02:00
  • f2a61868d4 feat: DocChunk expansion (#549) odelliab 2026-04-15 17:16:28 +02:00
  • 4795593abe test(tree-ops): remove redundant RefItem appends in rich table cell tests fix-tree-ops-with-cell-refs Panos Vagenas 2026-04-13 10:07:21 +02:00
  • f284cdda59 fix: consider rich table cell refs during tree operations Panos Vagenas 2026-04-10 17:06:08 +02:00
  • d2e79f4a80 chore: bump version to 2.73.0 [skip ci] v2.73.0 github-actions[bot] 2026-04-09 08:08:14 +00:00
  • 18f573899b feat(ouline): extend OutlineDocSerializer with filtering capabilities (#580) Cesar Berrospi Ramis 2026-04-09 09:56:02 +02:00
  • 7d8c9db2a7 docs: Fixes a typo in CONTRIBUTING.md (#582) Jordan White 2026-04-08 22:57:11 -07:00
  • 46a9b5a329 feat: add latex and Tikz as codelabels (#579) Peter W. J. Staar 2026-04-09 06:46:18 +02:00
  • b7d35cef4b chore: bump version to 2.72.0 [skip ci] v2.72.0 github-actions[bot] 2026-04-07 12:35:15 +00:00
  • 7e58e8b607 apply CDATA escaping to URIs as needed add-doclang-hyperlinks Panos Vagenas 2026-04-07 12:46:06 +02:00
  • 1b367fdc7f feat(DocLang): add hyperlink support Panos Vagenas 2026-04-07 11:47:06 +02:00
  • aaa1fbb031 ci(mergify): upgrade configuration to current format (#576) mergify[bot] 2026-04-06 09:19:25 +02:00
  • 00c3bb223d feat(Doclang): add newline handling (#575) Panos Vagenas 2026-04-01 17:13:37 +02:00
  • f20068db91 feat: add transforms in the hierarchy (#572) Peter W. J. Staar 2026-04-01 17:12:06 +02:00
  • 0388a99570 chore: bump version to 2.71.0 [skip ci] v2.71.0 github-actions[bot] 2026-03-30 15:47:39 +00:00
  • 0bd5d8e649 feat: add code representation meta field (#573) Panos Vagenas 2026-03-30 17:43:39 +02:00
  • c9b51520d2 fix(Doclang): improve checkbox serialization & deserialization (#570) Panos Vagenas 2026-03-30 12:31:07 +02:00
  • a1535bc1d8 fix(Doclang): fix serialization order in text items (#571) Panos Vagenas 2026-03-30 10:44:45 +02:00
  • 57187b2364 chore: address subtree moving within same parent (#564) Panos Vagenas 2026-03-30 10:41:54 +02:00
  • fe9bbfbb0f feat(Doclang): add content layer support (#568) Panos Vagenas 2026-03-30 10:40:49 +02:00
  • fb3b603bc4 feat: add handwriting support (#561) Vittorio Pippi 2026-03-26 08:43:29 +01:00
  • 0cfb663275 fix: extend validation to address duplicate refs (#565) Panos Vagenas 2026-03-25 15:43:28 +01:00
  • 159eb8f021 fix(Doclang): fix group serialization (#566) Panos Vagenas 2026-03-25 14:31:32 +01:00
  • 7c29721e27 chore: parametrize validation through context parametrize-validation-via-ctx Panos Vagenas 2026-03-25 13:36:14 +01:00
  • b65dd24212 fix: repair table children when rich table cells break hierarchy (#563) Christoph Auer 2026-03-23 16:01:55 +01:00
  • 2808317b3f chore: bump version to 2.70.2 [skip ci] v2.70.2 github-actions[bot] 2026-03-20 15:37:35 +00:00
  • 91ee7e2302 fix(Doclang): suppress empty elements in Doclang serialization (#554) Ahmed Nassar 2026-03-20 14:49:59 +01:00
  • 4807381050 chore: improve key-value migration (#559) Panos Vagenas 2026-03-20 14:32:30 +01:00
  • 7c1d74894c refactor(chunker): add type hints and @override decorators to chunk methods dev/chunker-type-hints Cesar Berrospi Ramis 2026-03-20 14:01:27 +01:00
  • de575b7c3a chore: add item label in outline serializer (#558) Cesar Berrospi Ramis 2026-03-20 13:16:57 +01:00
  • 3e030edc6f fix: expose traverse_pictures in export_to_markdown and export_to_text (#557) samiuc 2026-03-20 00:12:54 -07:00
  • f97ec83f67 fix: sync picture classification enums with DocumentFigureClassifier-v2.0 model (#529) jhchoi1182 2026-03-20 00:00:00 +09:00
  • 2a4377b713 chore: bump version to 2.70.1 [skip ci] v2.70.1 github-actions[bot] 2026-03-17 14:06:00 +00:00
  • afa5bd9e3a chore: document-level metadata serialization via body field (#551) Cesar Berrospi Ramis 2026-03-17 14:55:52 +01:00
  • 0a3b2787e0 fix(markdown): remove assert statements to support Python optimization mode (#548) Cesar Berrospi Ramis 2026-03-17 11:25:39 +01:00
  • c57e50ac43 fix: improve rich table cell validation (#550) Panos Vagenas 2026-03-16 17:13:53 +01:00
  • 1513f7d171 chore: bump version to 2.70.0 [skip ci] v2.70.0 github-actions[bot] 2026-03-13 15:06:05 +00:00
  • b56f75190f chore(Doclang): remove inline element (#517) Panos Vagenas 2026-03-13 15:58:42 +01:00
  • b93d5a3920 feat: introduce field data model incl. Doclang serialization (#519) Panos Vagenas 2026-03-13 15:37:42 +01:00
  • 9f3c0c6757 chore: upgrade dependencies to address dependabot alerts (#543) Cesar Berrospi Ramis 2026-03-13 15:18:38 +01:00
  • 8d7859eeec feat: make an experimental outline serializer (#415) Peter W. J. Staar 2026-03-13 15:14:40 +01:00
  • af50f1cb07 feat: profile a document or collection (#511) Cesar Berrospi Ramis 2026-03-13 13:36:38 +01:00
  • b435090fdf feat: split html table to headers and body (#532) odelliab 2026-03-13 11:40:05 +02:00
  • e00125c477 feat: handle wide table outliers with LineBasedTokenChunker (#536) Anish Raghavendra 2026-03-13 11:59:46 +05:30
  • d8a8b361ae fixes doclang_variable_export ahn 2026-02-24 14:26:06 +01:00
  • 8881d0430b feat: add WebVTT export and save functionality (#523) Cesar Berrospi Ramis 2026-02-24 10:17:17 +01:00
  • 9728ef71fb chore: bump version to 2.65.2 [skip ci] github-actions[bot] 2026-02-23 15:17:00 +00:00
  • b1d282b5eb fix: accept relative URIs in PdfHyperlink without validation failure (#520) Ultizan 2026-02-23 07:11:38 -08:00
  • d9e97cd990 fix: shift KV/Form graph cell page numbers during DoclingDocument.concatenate (#521) Christoph Auer 2026-02-23 10:35:16 +01:00
  • 900dd96897 fix(chunker): propagate 'traverse_pictures' parameter to chunker (#518) Cesar Berrospi Ramis 2026-02-20 20:37:43 +01:00
  • dab400f9a3 extend kv migration tests Panos Vagenas 2026-03-10 15:57:29 +01:00
  • 24574110d3 chore: bump version to 2.69.0 [skip ci] v2.69.0 github-actions[bot] 2026-03-09 04:31:38 +00:00
  • 4eb0d20d04 feat: Loosen dependency version constraints (#534) Michele Dolfi 2026-03-09 05:27:48 +01:00
  • eb900064c8 chore: bump version to 2.68.0 [skip ci] v2.68.0 github-actions[bot] 2026-03-07 12:19:50 +00:00
  • 119ea59bf0 extend KV migration scope, extend tree manipulation operations Panos Vagenas 2026-03-06 17:09:22 +01:00
  • a661bb10cb fix: prevent infinite loop in LineBasedTokenChunker with unbreakable tokens (#533) Cesar Berrospi Ramis 2026-03-06 12:52:30 +01:00
  • 43030488c9 chore: add support for pandas==3.0.0 (#513) Faiq Adzlan 2026-03-06 18:25:56 +08:00
  • e363c951d8 feat: add plain-text serializer (#522) samiuc 2026-03-06 01:20:21 -08:00
  • 1c6ae32b46 chore: bump version to 2.67.1 [skip ci] v2.67.1 github-actions[bot] 2026-03-05 09:11:24 +00:00
  • 2debe0836f fix: prevent hang in export_to_markdown() on nested RichTableCells (#525) Ivan Traus 2026-03-05 00:54:27 -08:00
  • 7c21840c4f chore: bump version to 2.67.0 [skip ci] v2.67.0 github-actions[bot] 2026-03-04 15:29:11 +00:00
  • ea359bcc63 feat: table aware chunking (#527) odelliab 2026-03-04 17:25:57 +02:00
  • cc2955c22a make FieldItem a DocItem Panos Vagenas 2026-03-03 12:42:00 +01:00
  • 8d1f287057 align with TextItem conventions Panos Vagenas 2026-02-27 14:28:40 +01:00
  • 8495b246bc updated API Panos Vagenas 2026-02-27 13:59:47 +01:00
  • c5f3266235 generalize marker naming Panos Vagenas 2026-02-27 13:21:05 +01:00
  • a993b01059 add field marker and test Panos Vagenas 2026-02-27 13:15:11 +01:00
  • 3bdb8e9341 enable proper nested serialization without inline Panos Vagenas 2026-02-26 14:24:36 +01:00
  • 9eca661df7 chore: bump version to 2.66.0 [skip ci] v2.66.0 github-actions[bot] 2026-02-26 10:46:22 +00:00
  • c566268e0a fix: rich table triplet serialization (#425) Matvei Smirnov 2026-02-26 13:39:34 +03:00
  • 469f6ecca8 improve prov migration & location serialization Panos Vagenas 2026-02-25 17:29:53 +01:00
  • 73b07572cf fix: support single-column table default serialization (#526) Cesar Berrospi Ramis 2026-02-24 12:23:01 +01:00
  • b8ef7bad1b feat: add WebVTT export and save functionality (#523) Cesar Berrospi Ramis 2026-02-24 10:17:17 +01:00
  • 44e4364d53 switch to field naming Panos Vagenas 2026-02-23 17:51:28 +01:00
  • 869a13d06b add nesting test Panos Vagenas 2026-02-23 17:05:27 +01:00
  • cc4df04ac0 chore: bump version to 2.65.2 [skip ci] v2.65.2 github-actions[bot] 2026-02-23 15:17:00 +00:00
  • 6032c7c175 fix: accept relative URIs in PdfHyperlink without validation failure (#520) Ultizan 2026-02-23 07:11:38 -08:00
  • 6a04db77aa fix: shift KV/Form graph cell page numbers during DoclingDocument.concatenate (#521) Christoph Auer 2026-02-23 10:35:16 +01:00
  • a3b6e3fb89 fix(chunker): propagate 'traverse_pictures' parameter to chunker (#518) Cesar Berrospi Ramis 2026-02-20 20:37:43 +01:00
  • df35f9245e include pre-migration YAML Panos Vagenas 2026-02-20 12:39:23 +01:00
  • 77944d5d23 add migration Panos Vagenas 2026-02-20 11:41:02 +01:00
  • 12404d07ad update invoice test Panos Vagenas 2026-02-19 15:56:55 +01:00
  • 90ec69d75f add form-table test Panos Vagenas 2026-02-19 13:57:42 +01:00