Release MMOCR Release v1.0.0rc6 · open-mmlab/mmocr

Highlights

Two new models, ABCNet v2 (inference only) and SPTS are added to projects/ folder.
Announcing Inferencer, a unified inference interface in OpenMMLab for everyone's easy access and quick inference with all the pre-trained weights. Docs
Users can use test-time augmentation for text recognition tasks. Docs
Support batch augmentation through BatchAugSampler, which is a technique used in SPTS.
Dataset Preparer has been refactored to allow more flexible configurations. Besides, users are now able to prepare text recognition datasets in LMDB formats. Docs
Some textspotting datasets have been revised to enhance the correctness and consistency with the common practice.
Potential spurious warnings from shapely have been eliminated.

This version requires MMEngine >= 0.6.0, MMCV >= 2.0.0rc4 and MMDet >= 3.0.0rc5.

Discard deprecated lmdb dataset format and only support img+label now by @gaotongxiao in #1681
abcnetv2 inference by @Harold-lkk in #1657
Add RepeatAugSampler by @gaotongxiao in #1678
SPTS by @gaotongxiao in #1696
Refactor Inferencers by @gaotongxiao in #1608
Dynamic return type for rescale_polygons by @gaotongxiao in #1702
Revise upstream version limit by @gaotongxiao in #1703
TextRecogCropConverter add crop with opencv warpPersepective function by @KevinNuNu in #1667
change cudnn benchmark to false by @Harold-lkk in #1705
Add ST-pretrained DB-series models and logs by @gaotongxiao in #1635
Only keep meta and state_dict when publish model by @Harold-lkk in #1729
Rec TTA by @Harold-lkk in #1401
Speedup formatting by replacing np.transpose with torch… by @gaotongxiao in #1719
Support auto import modules from registry. by @Harold-lkk in #1731
Support batch visualization & dumping in Inferencer by @gaotongxiao in #1722
add a new argument font_properties to set a specific font file in order to draw Chinese characters properly by @KevinNuNu in #1709
Refactor data converter and gather by @Harold-lkk in #1707
Support batch augmentation through BatchAugSampler by @gaotongxiao in #1757
Put all registry into registry.py by @Harold-lkk in #1760
train by @gaotongxiao in #1756
configs for regression benchmark by @gaotongxiao in #1755
Support lmdb format in Dataset Preparer by @gaotongxiao in #1762

Update dockerfile by @gaotongxiao in #1671
Explicitly create np object array for compatibility by @gaotongxiao in #1691
Fix a minor error in docstring by @Mountchicken in #1685
Fix lint by @triple-Mu in #1694
Fix LoadOCRAnnotation ut by @Harold-lkk in #1695
Fix isort pre-commit error by @KevinNuNu in #1697
Update owners by @xinke-wang in #1699
Detect intersection before using shapley.intersection to eliminate spurious warnings by @gaotongxiao in #1710
Fix some inferencer bugs by @gaotongxiao in #1706
Fix textocr ignore flag by @xinke-wang in #1712
Add missing softmax in ASTER forward_test by @Mountchicken in #1718
Fix head in readme by @vansin in #1727
Fix some browse dataset script bugs and draw textdet gt instance with ignore flags by @KevinNuNu in #1701
icdar textrecog ann parser skip data with ignore flag by @KevinNuNu in #1708
bezier_to_polygon -> bezier2polygon by @double22a in #1739
Fix docs recog CharMetric P/R error definition by @KevinNuNu in #1740
Remove outdated resources in demo/ by @gaotongxiao in #1747
Fix wrong ic13 textspotting split data; add lexicons to ic13, ic15 and totaltext by @gaotongxiao in #1758
SPTS readme by @gaotongxiao in #1761

Full Changelog: v1.0.0rc5...v1.0.0rc6