BaseTextDetPostProcessor¶

class mmocr.models.textdet.BaseTextDetPostProcessor(text_repr_type='poly', rescale_fields=None, train_cfg=None, test_cfg=None)[源代码]¶

Base postprocessor for text detection models.

参数

text_repr_type (str) – The boundary encoding type, ‘poly’ or ‘quad’. Defaults to ‘poly’.
rescale_fields (list[str], optional) – The bbox/polygon field names to be rescaled. If None, no rescaling will be performed.
train_cfg (dict, optional) – The parameters to be passed to self.get_text_instances in training. Defaults to None.
test_cfg (dict, optional) – The parameters to be passed to self.get_text_instances in testing. Defaults to None.

返回类型

None

get_text_instances(pred_results, data_sample, **kwargs)[源代码]¶

Get text instance predictions of one image.

参数

pred_result (tuple(Tensor)) – Prediction results of an image.
data_sample (TextDetDataSample) – Datasample of an image.
**kwargs – Other parameters. Configurable via __init__.train_cfg and __init__.test_cfg.
pred_results (Union[torch.Tensor, List[torch.Tensor]]) –

返回

A new DataSample with predictions filled in. The polygon/bbox results are usually saved in TextDetDataSample.pred_instances.polygons or TextDetDataSample.pred_instances.bboxes. The confidence scores are saved in TextDetDataSample.pred_instances.scores.

返回类型

TextDetDataSample

poly_nms(polygons, scores, threshold)[源代码]¶

Non-maximum suppression for text detection.

参数

polygons (list[ndarray]) – List of polygons.
scores (list[float]) – List of scores.
threshold (float) – Threshold for NMS.

返回

keep_polys (list[ndarray]): List of preserved polygons after NMS.
keep_scores (list[float]): List of preserved scores after NMS.

返回类型

tuple(keep_polys, keep_scores)

rescale(results, scale_factor)[源代码]¶

Rescale results in results.pred_instances according to scale_factor, whose keys are defined in self.rescale_fields. Usually used to rescale bboxes and/or polygons.

参数

results (TextDetDataSample) – The post-processed prediction results.
scale_factor (tuple(int)) – (w_scale, h_scale)

返回

Prediction results with rescaled results.

返回类型

TextDetDataSample

split_results(pred_results)[源代码]¶

Split batched tensor(s) along the first dimension pack split tensors into a list.

参数

pred_results (tensor or list[tensor]) – Raw result tensor(s) from detection head. Each tensor usually has the shape of (N, …)

返回

N tensors if pred_results: is a tensor, or a list of N lists of tensors if pred_results is a list of tensors.

返回类型

list[tensor] or list[list[tensor]]