Commit Graph

9 Commits

Author SHA1 Message Date
Maxim Lysak 1c74a9b9c7 feat: Implementation of HTML backend with headless browser (#2969)
- Implementation of HTML backend that (optionally) uses headless browser (via Playwright) to materialize HTML pages into images, and add provenances with bboxes to all elements in the converted docling document.
- Conversion preserves reading order given by HTML DOM tree
- Added support for HTML "input" fields: checkboxes, radiobuttons, text inputs, etc.
- Added support to Key-Value convention in HTML (i.e. elements with id "key1" and "key1_value1" will be paired as key-values, see test cases as examples)
- Heuristic that glues independent inline HTML elements with single-character text in them into larger text blocks
- Support for inline styling (bold, italic, etc.)

Signed-off-by: Maksym Lysak <mly@zurich.ibm.com>
Co-authored-by: Maksym Lysak <mly@zurich.ibm.com>
2026-03-24 14:28:57 +01:00
Robert Sokolewicz 95b759e519 docs: update code snippet to use modern pipeline options syntax (#3087)
Signed-off-by: Robert Sokolewicz <rsokolewicz@gmail.com>
2026-03-09 08:49:53 +01:00
ParvaP 16e88d50fa docs: correct broken link to supported formats (#2878)
Fix link to supported formats in quickstart guide

Updated link to supported formats documentation.

Signed-off-by: ParvaP <55171512+ParvaP@users.noreply.github.com>
2026-01-14 19:50:16 +01:00
Michele Dolfi be085c0e39 docs(RTX): Guidelines for best performance on RTX GPUs (#2765)
* add RTX docs

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* add artwork and fix title

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* fix series definition

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* add nvidia logo and update todo

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

---------

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2025-12-19 13:16:59 +01:00
Ryan Soliveres d549445e78 docs: Move Installation and Quickstart (Usage) under Getting started (#2644)
* docs: Move Installation and Quickstart (Usage) under Getting started

Moved Installation and Usage (Quickstart) under Getting started section
Rename installation folder to documentation folder
Rename installation/index.md to documentation/installation.md
Duplicate usage/index.md to documentation directory and rename it to documentation/quickstart.md
Add redirection from installation and usage

Signed-off-by: Ryan S <ryansoliveres@users.noreply.github.com>

* docs: Move Installation and Quickstart under Getting started

Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>

* docs: Move Installation and Quickstart under Getting started

Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>

* git commit -m "DCO Remediation Commit for rysoliveres <ryan.soliveres@yahoo.com>

I, rysoliveres <ryan.soliveres@yahoo.com>, hereby add my Signed-off-by to this commit: b7ae13e3d8

Signed-off-by: rysoliveres <ryan.soliveres@yahoo.com>"

Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>

* git commit --allow-empty -m "DCO Remediation Commit for rysoliveres <ryan.soliveres@yahoo.com>

I, rysoliveres <ryan.soliveres@yahoo.com>, hereby add my Signed-off-by to this commit: b7ae13e3d8

Signed-off-by: rysoliveres <ryan.soliveres@yahoo.com>"

Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>

* DCO Remediation Commit for rysoliveres <ryan.soliveres@yahoo.com>

I, rysoliveres <ryan.soliveres@yahoo.com>, hereby add my Signed-off-by to this commit: b7ae13e3d8

Signed-off-by: rysoliveres <ryan.soliveres@yahoo.com>

Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>

* DCO Remediation Commit for rysoliveres <ryan.soliveres@yahoo.com>

I, rysoliveres <ryan.soliveres@yahoo.com>, hereby add my Signed-off-by to this commit: b7ae13e3d8

Signed-off-by: rysoliveres <ryan.soliveres@yahoo.com>

Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>

---------

Signed-off-by: Ryan S <ryansoliveres@users.noreply.github.com>
Signed-off-by: ryansoliveres <ryan.soliveres@yahoo.com>
2025-11-18 17:09:41 +01:00
Panos Vagenas ac9fc585bb docs: add redirection from getting started page (#2640)
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
2025-11-17 14:13:51 +01:00
Robyn Johnson ae30373ee7 docs: combine Home and Getting Started pages (#2600)
* Update mkdocs.yml

Remove navigations.sections feature so that navigation menus will collapse & expand. They are collapsed by default.

* docs: add sign-off

DCO Remediation Commit for Robyn J <bobbinrobyn@users.noreply.github.com>

I, Robyn J <bobbinrobyn@users.noreply.github.com>, hereby add my Signed-off-by to this commit: b7d7441827

Signed-off-by: Robyn J <bobbinrobyn@users.noreply.github.com>

* docs: Combine Home and Getting Started page

Combine home and getting stated pages, and rename the page "Documentation"

Signed-off-by: Robyn J <bobbinrobyn@users.noreply.github.com>

---------

Signed-off-by: Robyn J <bobbinrobyn@users.noreply.github.com>
2025-11-14 13:29:25 +01:00
Michele Dolfi dd03b53117 docs: discord badge with join link (#2473)
* add discord link

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* Add Discord link to social section in mkdocs.yml

Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>

* Add Discord link to getting started documentation

Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>

---------

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
Co-authored-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2025-10-16 10:13:50 +02:00
Panos Vagenas 8996d612aa docs: add Getting Started page (#2113)
* docs: add Getting Started page

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>

* refactor usage

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>

* minor renaming

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>

---------

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
2025-08-21 08:44:53 +02:00