RepsNet: A Nucleus Instance Segmentation Model Based on Boundary Regression and Structural Re-parameterization

Report on the Paper “RepsNet: A Nucleus Instance Segmentation Model Based on Boundary Regression and Structural Re-parameterization”

Academic Background

Pathological diagnosis is the gold standard for tumor diagnosis, and nucleus instance segmentation is a key step in digital pathology analysis and pathological diagnosis. However, the computational efficiency of the model and the treatment of overlapping targets are major challenges in current research. To address these issues, this paper proposes a neural network model, RepsNet, based on nucleus boundary regression and structural re-parameterization, for segmenting and classifying nuclei in H&E-stained histopathological images.

The distribution and morphological characteristics of nuclei (e.g., density, nuclear-to-cytoplasmic ratio, average size, and pleomorphism) are not only useful for assessing cancer grade but also for predicting therapeutic efficacy. However, pathological images are typically characterized by widespread adhesion of nuclei, multiple species, shape variability, and low contrast between the cytoplasmic background and nucleus foreground, making nucleus instance segmentation extremely challenging.

Source of the Paper

This paper is co-authored by Shengchun Xiong, Xiangru Li, Yunpeng Zhong, and Wanfen Peng, affiliated with the School of Computer Science, South China Normal University, and Signet Therapeutics. The paper was accepted on December 17, 2024, and published in the International Journal of Computer Vision.

Research Process and Results

Research Process

  1. Boundary Position Information (BPI) Estimation: RepsNet first estimates the boundary position information (BPI) of the parent nucleus for each pixel. The BPI estimation incorporates the local information of the pixel and the contextual information of the parent nucleus.

  2. Boundary Voting Mechanism (BVM): Through the proposed Boundary Voting Mechanism (BVM), RepsNet aggregates the BPIs from a series of pixels to estimate the nucleus boundary. The BVM inherently achieves a kind of synergistic belief enhancement among the BPIs from various pixels.

  3. Connected Component Analysis: Using the estimated nucleus boundary, the instance segmentation results are computed through a connected component analysis procedure.

  4. Structural Re-parameterization: RepsNet employs a re-parameterizable encoder-decoder structure. During the training phase, the model extracts features from receptive fields of various scales through multiple branches. During the inference phase, the multiple branches are merged into a single branch using structural re-parameterization, thereby reducing the model’s parameter count and computational burden.

Main Results

Experiments on the Lizard dataset demonstrate that RepsNet outperforms several typical benchmark models in both segmentation accuracy and inference speed. Specifically, RepsNet achieves an MPQ (Multi-class Panoptic Quality) of 0.5633 on the test set, an improvement of 0.0161 over the CONIC SOTA model, Stardist. Additionally, RepsNet can process 10 pathological images of size 256×256 pixels per second.

Conclusions and Significance

The main contributions of this paper include: - A novel nucleus instance segmentation scheme based on nucleus boundary regression and information aggregation (NBRI). This scheme distinguishes nucleus boundaries from other pixels by aggregating a series of boundary position estimations from nucleus pixels, improving the discrimination performance on the boundaries of cohesive nuclei. - A fully re-parameterizable encoder-decoder network, RepsNet, based on the NBRI scheme. This network enhances the model’s efficiency and feature extraction performance from receptive fields of various scales without increasing computational complexity during inference. - A loss function based on boundary isoheights, which adaptively penalizes boundary estimations with various deviations, enhancing the model’s adaptability to potential misannotations in the training dataset.

Research Highlights

  • Innovation: RepsNet significantly improves the segmentation accuracy of overlapping nuclei through boundary regression and information aggregation.
  • Efficiency: Through structural re-parameterization, RepsNet maintains high segmentation accuracy while significantly reducing computational burden.
  • Robustness: The proposed boundary voting mechanism and boundary loss function enhance the model’s robustness to blurred boundaries and misannotations.

Experiments and Evaluation

Dataset and Experimental Setup

Experiments were conducted on the Lizard dataset, which contains approximately 500,000 labeled nuclei across six categories: neutrophil, epithelial, lymphocyte, plasma, eosinophil, and connective tissue. The dataset was randomly divided into training, validation, and test sets in a ratio of 7:1:2.

Experimental Results

RepsNet outperforms benchmark models on multiple evaluation metrics, including AJI (Aggregated Jaccard Index), Dice coefficient, PQ (Panoptic Quality), and MPQ (Multi-class Panoptic Quality). Specifically, RepsNet achieves an MPQ of 0.5633 on the test set, an improvement of 1.61% over Stardist.

Ablation Studies

Ablation studies were conducted to validate the effectiveness of key components in RepsNet, including the RepVGG unit, RepUpsample module, and boundary loss function. The results show that these components significantly contribute to the model’s performance improvement.

Summary and Outlook

The proposed RepsNet model demonstrates excellent performance in nucleus instance segmentation tasks, outperforming existing models in segmentation accuracy while significantly improving computational efficiency through structural re-parameterization. Future work may further optimize the model’s architecture and explore additional data augmentation and regularization techniques to enhance the model’s generalization ability and segmentation accuracy.

Through this research, RepsNet shows great potential in digital pathology, offering a more efficient and accurate tool for automated pathological diagnosis.