TextRecogDataSample¶
- class mmocr.structures.TextRecogDataSample(*, metainfo=None, **kwargs)[source]¶
A data structure interface of MMOCR for text recognition. They are used as interfaces between different components.
The attributes in
TextRecogDataSample
are divided into two parts:Examples
>>> import torch >>> import numpy as np >>> from mmengine.structures import LabelData >>> from mmocr.data import TextRecogDataSample >>> # gt_text >>> data_sample = TextRecogDataSample() >>> img_meta = dict(img_shape=(800, 1196, 3), ... pad_shape=(800, 1216, 3)) >>> gt_text = LabelData(metainfo=img_meta) >>> gt_text.item = 'mmocr' >>> data_sample.gt_text = gt_text >>> assert 'img_shape' in data_sample.gt_text.metainfo_keys() >>> print(data_sample) <TextRecogDataSample( META INFORMATION DATA FIELDS gt_text: <LabelData( META INFORMATION pad_shape: (800, 1216, 3) img_shape: (800, 1196, 3) DATA FIELDS item: 'mmocr' ) at 0x7f21fb1b9190> ) at 0x7f21fb1b9880> >>> # pred_text >>> pred_text = LabelData(metainfo=img_meta) >>> pred_text.item = 'mmocr' >>> data_sample = TextRecogDataSample(pred_text=pred_text) >>> assert 'pred_text' in data_sample >>> data_sample = TextRecogDataSample() >>> gt_text_data = dict(item='mmocr') >>> gt_text = LabelData(**gt_text_data) >>> data_sample.gt_text = gt_text >>> assert 'gt_text' in data_sample >>> assert 'item' in data_sample.gt_text
- property gt_text: mmengine.structures.label_data.LabelData¶
ground truth text.
- Type
LabelData
- property pred_text: mmengine.structures.label_data.LabelData¶
prediction text.
- Type
LabelData