Skip to content

Releases: VikParuchuri/surya

Processor improvements

23 May 23:12
80889bd
Compare
Choose a tag to compare
  • Remove unneeded format conversions
  • Fix bug in OCR, where only one color channel was used for OCR - results should be better now
  • Speed up layout/text detection a bit

OCR speedup

18 May 04:03
74e8c0c
Compare
Choose a tag to compare

Cut OCR time in half. Combined with the previous release, OCR should now take about 40% as much time as it did before.

Significant speedup for layout, line detection

17 May 22:04
7a65c45
Compare
Choose a tag to compare
  • Improve CPU postprocessing for line detection and layout - cut postprocessing time to 1/3 of original
  • Unpin transformers version after investigating model performance

This should result in an ~2x speedup for layout and text detection. The effect will be most noticeable on GPU. I haven't fully benchmarked, though.

Bug fixes

16 May 22:00
c89e015
Compare
Choose a tag to compare
  • Fix memory leak with layout and text detection models and large batch sizes
  • Improve ordering model generation slightly

Save memory when pruning MoE

09 May 17:24
11faf42
Compare
Choose a tag to compare
  • Prune MoE experts before loading model
  • Unpin torch version from 2.2.2

Fix issue with torch and dependencies

08 May 20:02
2ba7e19
Compare
Choose a tag to compare
Merge pull request #96 from VikParuchuri/dev

Fix publishing issue

Add OCR batch size option

08 May 19:30
2133e3f
Compare
Choose a tag to compare
Merge pull request #95 from VikParuchuri/dev

Add batch size to OCR

Set batch sizes programatically

08 May 19:02
40a1043
Compare
Choose a tag to compare
  • Programmatic batch sizes for all models
  • Update package versions
  • Bugfixes

Release ordering model

22 Apr 17:16
e8c98ac
Compare
Choose a tag to compare
  • Release reading order model
  • Add Google Cloud OCR benchmarks to README

Release layout model

26 Mar 18:48
3cdc3b6
Compare
Choose a tag to compare
  • Add layout model
  • Multiprocessing for detection postprocessing (can be much faster)
  • Fix minor bugs