Skip to content

PaddleOCRv2.7.1

Compare
Choose a tag to compare
@shiyutang shiyutang released this 18 Oct 12:32
· 2 commits to release/2.7.1 since this release

New Projects

  • Add Parseq recognition model.(#10836)
  • Add text recognition function to return single character coordinates.(#10515)

New Features

  • Add savefile option to save OCR output results.(#10628)
  • Add more data preprocessing options to ppocr.py.(#10217)
  • A single damaged image does not affect data set inference. (#10216)
  • Compatible with fitz version. (#10181)
  • Compatible with Pillow10.0 upgrade. (#10405)
  • Add Finnish dictionary file. (#10850)
  • Onnxruntime supports GPU. (#10668)
  • TIPC supports XPU and NPU. (#10658, #10460)
  • Add inference on mlu devices. (#10249)

BugFix

  • Fixed cannot find the library error when packaged into exe on windows. (#10502)
  • Fixed the bug of recognize page is affected by the maximum number of PDF files when recognize multiple PDF files. (#10290)
  • Fixed the problem of PPOCRLabel startup failure caused by mismatch in channel number. (#10847)
  • Fix memory leak problem of cpp inference. (#10441)
  • Modify DSR error caused by data enhancement. (#10662)
  • Fix training seed problem. (#10614)
  • Fix table_master tipc error. (#10514)
  • Fixed the problem of error reporting when ppocr.py uses wandb. (#10251)
  • Fix memory leak in predict_rec.py. (#10688)
  • Fixed the issue where dis and iou cannot be calculated correctly due to the index error of structure_boxes in the PaddleStructure::rebuild_table function. (#10810)
  • Compatible with paddle 2.5 fluid exit. (#10391)
  • Fix the performance problem of Tensor.numpy under stride. (#10773)
  • Adapt the size of ABINet during export to the size of ABINetRecResizeImg. (#10892)
  • ABINet training error. (#10744)
  • Fix KeyError in ch_PP-OCRv4_rec_distill.yml. (#10761)

Documentations Fix

  • Fix algorithm_kie_vi_layoutxlm_en.md, kie_datasets_en.md, README.md, algorithm_overview.md, how_to_do_kie_en.md document issues. (#10717)
  • Update documentation issues with setup.py. (#10749)
  • Add pyyaml ​​library in requirements.txt. (#10653)

New Projects

  • 增加Parseq 识别模型。(#10836
  • 增加文字识别返回单字识别坐标功能。(#10515

New Features

  • 增加savefile选项,保存ocr输出结果。(#10628
  • 增加ppocr.py 更多数据预处理选项。(#10217
  • 单张破损图片不影响整体数据集推理。(#10216
  • 兼容fitz版本。(#10181
  • 兼容Pillow10.0升级。(#10405
  • 增加芬兰语字典文件。(#10850
  • Onnxruntime 支持GPU。(#10668
  • TIPC支持XPU、NPU。(#10658#10460
  • 增加在mlu设备上的推理。(#10249

BugFix

  • 修复windows打包成exe找不到库的问题。(#10502
  • 修复多次识别pdf,受第一次页面最大数量影响的bug。(#10290
  • 修复通道数不匹配造成的PPOCRLabel启动失败问题。(#10847
  • 修复cpp推理的内存泄漏问题。(#10441
  • 修改数据增强导致的DSR报错。(#10662
  • 修复训练seed传递问题。( #10614
  • 修复 table_master tipc 报错。(#10514
  • 修复ppocr.py使用wandb报错问题。(#10251
  • 修复predict_rec.py中的内存泄漏问题。(#10688
  • 修复PaddleStructure::rebuild_table函数中structure_boxes的索引错误导致dis和iou无法正确计算的问题。(#10810
  • 兼容paddle 2.5 fluid退场。 (#10391
  • 修复stride下Tensor.numpy的性能问题。(#10773
  • ABINet导出时尺寸适应 ABINetRecResizeImg 的尺寸。(#10892
  • ABINet训练报错问题。(#10744
  • 解决ch_PP-OCRv4_rec_distill.yml中的keyError报错问题。(#10761

Documentations Fix

  • 修复 algorithm_kie_vi_layoutxlm_en.md、kie_datasets_en.md 、README.md、algorithm_overview.md、how_to_do_kie_en.md文档问题(#10717
  • 更新setup.py的文档问题(#10749
  • 修复requirements.txt中没有pyyaml库的问题(#10653

New Contributors

Full Changelog: v2.7.0...v2.7.1