BaseTextDetPostProcessor¶
- class mmocr.models.textdet.BaseTextDetPostProcessor(text_repr_type='poly', rescale_fields=None, train_cfg=None, test_cfg=None)[source]¶
Base postprocessor for text detection models.
- Parameters
text_repr_type (str) – The boundary encoding type, ‘poly’ or ‘quad’. Defaults to ‘poly’.
rescale_fields (list[str], optional) – The bbox/polygon field names to be rescaled. If None, no rescaling will be performed.
train_cfg (dict, optional) – The parameters to be passed to
self.get_text_instances
in training. Defaults to None.test_cfg (dict, optional) – The parameters to be passed to
self.get_text_instances
in testing. Defaults to None.
- Return type
- get_text_instances(pred_results, data_sample, **kwargs)[source]¶
Get text instance predictions of one image.
- Parameters
pred_result (tuple(Tensor)) – Prediction results of an image.
data_sample (TextDetDataSample) – Datasample of an image.
**kwargs – Other parameters. Configurable via
__init__.train_cfg
and__init__.test_cfg
.pred_results (Union[torch.Tensor, List[torch.Tensor]]) –
- Returns
A new DataSample with predictions filled in. The polygon/bbox results are usually saved in
TextDetDataSample.pred_instances.polygons
orTextDetDataSample.pred_instances.bboxes
. The confidence scores are saved inTextDetDataSample.pred_instances.scores
.- Return type
- poly_nms(polygons, scores, threshold)[source]¶
Non-maximum suppression for text detection.
- Parameters
- Returns
keep_polys (list[ndarray]): List of preserved polygons after NMS.
keep_scores (list[float]): List of preserved scores after NMS.
- Return type
tuple(keep_polys, keep_scores)
- rescale(results, scale_factor)[source]¶
Rescale results in
results.pred_instances
according toscale_factor
, whose keys are defined inself.rescale_fields
. Usually used to rescale bboxes and/or polygons.- Parameters
results (TextDetDataSample) – The post-processed prediction results.
- Returns
Prediction results with rescaled results.
- Return type