|
Figure 3
Overview of the modified Faster R-CNN structure. The feature extractor in (a) produces feature maps of different sizes and hands them on to the RPN in (b). This layer extracts regions of interest (Rois) from the feature maps at different scales. The pooling layer in (c) aligns the proposed boxes from (b) and the largest feature map in (a). Non-maximum suppression eliminates overlapping boxes, resulting in the final predicted boxes together with their confidence scores. |