Releases: VikParuchuri/surya
Releases · VikParuchuri/surya
Processor improvements
- Remove unneeded format conversions
- Fix bug in OCR, where only one color channel was used for OCR - results should be better now
- Speed up layout/text detection a bit
OCR speedup
Cut OCR time in half. Combined with the previous release, OCR should now take about 40% as much time as it did before.
Significant speedup for layout, line detection
- Improve CPU postprocessing for line detection and layout - cut postprocessing time to 1/3 of original
- Unpin transformers version after investigating model performance
This should result in an ~2x speedup for layout and text detection. The effect will be most noticeable on GPU. I haven't fully benchmarked, though.
Bug fixes
- Fix memory leak with layout and text detection models and large batch sizes
- Improve ordering model generation slightly
Save memory when pruning MoE
- Prune MoE experts before loading model
- Unpin torch version from 2.2.2
Fix issue with torch and dependencies
Merge pull request #96 from VikParuchuri/dev Fix publishing issue
Add OCR batch size option
Merge pull request #95 from VikParuchuri/dev Add batch size to OCR
Set batch sizes programatically
- Programmatic batch sizes for all models
- Update package versions
- Bugfixes
Release ordering model
- Release reading order model
- Add Google Cloud OCR benchmarks to README
Release layout model
- Add layout model
- Multiprocessing for detection postprocessing (can be much faster)
- Fix minor bugs