Releases · VikParuchuri/marker · GitHub

23 May 23:24

VikParuchuri

Speed improvements Latest

Latest

Enable parallel text extraction, with worker count settings
Bump surya version to pull in layout/line segmentation speed improvements, and OCR bug fix

Assets 2

18 May 04:28

VikParuchuri

Faster OCR

OCR is now ~2.5x faster, due to improvements in surya

Assets 2

17 May 22:57

VikParuchuri

Speed up inference

(from surya) faster ocr, line detection, layout inference
Unpin transformers version after testing

Should be significantly faster now, but haven't fully benchmarked, since I'm running low on time this week!

Assets 2

16 May 22:46

VikParuchuri

Fix memory leak

Fix a memory leak (fixed in surya, bumped the version). This caused high CPU memory usage on long docs.
Improve load_all_models to take device and dtype

Assets 2

10 May 16:02

VikParuchuri

Marker v2

Basically a full rewrite!

Main features:

Extracts and saves images
Improved table formatting
Better markdown wrapping
Better reading order on complex docs
Improved OCR engine with more language options
Simple pip package install (no more required system dependencies), so can be used easily on Windows
Can be used commercially (pymupdf and layoutlmv3 dependencies removed)

It takes ~2x as long to run now, but seems like a decent tradeoff.

See the README for details.

Assets 2