Shortcuts

TextRecogDataSample

class mmocr.structures.TextRecogDataSample(*, metainfo=None, **kwargs)[源代码]

A data structure interface of MMOCR for text recognition. They are used as interfaces between different components.

The attributes in TextRecogDataSample are divided into two parts:

  • ``gt_text``(LabelData): Ground truth text.

  • ``pred_text``(LabelData): predictions text.

实际案例

>>> import torch
>>> import numpy as np
>>> from mmengine.structures import LabelData
>>> from mmocr.data import TextRecogDataSample
>>> # gt_text
>>> data_sample = TextRecogDataSample()
>>> img_meta = dict(img_shape=(800, 1196, 3),
...                 pad_shape=(800, 1216, 3))
>>> gt_text = LabelData(metainfo=img_meta)
>>> gt_text.item = 'mmocr'
>>> data_sample.gt_text = gt_text
>>> assert 'img_shape' in data_sample.gt_text.metainfo_keys()
>>> print(data_sample)
<TextRecogDataSample(
    META INFORMATION
    DATA FIELDS
    gt_text: <LabelData(
            META INFORMATION
            pad_shape: (800, 1216, 3)
            img_shape: (800, 1196, 3)
            DATA FIELDS
            item: 'mmocr'
        ) at 0x7f21fb1b9190>
) at 0x7f21fb1b9880>
>>> # pred_text
>>> pred_text = LabelData(metainfo=img_meta)
>>> pred_text.item = 'mmocr'
>>> data_sample = TextRecogDataSample(pred_text=pred_text)
>>> assert 'pred_text' in data_sample
>>> data_sample = TextRecogDataSample()
>>> gt_text_data = dict(item='mmocr')
>>> gt_text = LabelData(**gt_text_data)
>>> data_sample.gt_text = gt_text
>>> assert 'gt_text' in data_sample
>>> assert 'item' in data_sample.gt_text
参数

metainfo (Optional[dict]) –

返回类型

None

property gt_text: mmengine.structures.label_data.LabelData

ground truth text.

Type

LabelData

property pred_text: mmengine.structures.label_data.LabelData

prediction text.

Type

LabelData

Read the Docs v: dev-1.x
Versions
latest
stable
0.x
dev-1.x
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.