Shortcuts

Note

You are reading the documentation for MMOCR 0.x, which will soon be deprecated by the end of 2022. We recommend you upgrade to MMOCR 1.0 to enjoy fruitful new features and better performance brought by OpenMMLab 2.0. Check out the maintenance plan, changelog, code and documentation of MMOCR 1.0 for more details.

Source code for mmocr.datasets.ocr_dataset

# Copyright (c) OpenMMLab. All rights reserved.
from mmdet.datasets.builder import DATASETS

from mmocr.core.evaluation.ocr_metric import eval_ocr_metric
from mmocr.datasets.base_dataset import BaseDataset
from mmocr.utils import is_type_list


[docs]@DATASETS.register_module() class OCRDataset(BaseDataset):
[docs] def pre_pipeline(self, results): results['img_prefix'] = self.img_prefix results['text'] = results['img_info']['text']
[docs] def evaluate(self, results, metric='acc', logger=None, **kwargs): """Evaluate the dataset. Args: results (list): Testing results of the dataset. metric (str | list[str]): Metrics to be evaluated. logger (logging.Logger | str | None): Logger used for printing related information during evaluation. Default: None. Returns: dict[str: float] """ assert isinstance(metric, str) or is_type_list(metric, str) gt_texts = [] pred_texts = [] for i in range(len(self)): item_info = self.data_infos[i] text = item_info['text'] gt_texts.append(text) pred_texts.append(results[i]['text']) eval_results = eval_ocr_metric(pred_texts, gt_texts, metric=metric) return eval_results
Read the Docs v: v0.6.3
Versions
latest
stable
v0.6.3
v0.6.2
v0.6.1
v0.6.0
v0.5.0
v0.4.1
v0.4.0
v0.3.0
v0.2.1
v0.2.0
v0.1.0
dev-1.x
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.