September 2021
tl;dr: BiFPN and multidimensional scaling of object detection.
This paper follows up on the work of EfficientNet. The FPN neck essentially is a multi-scale feature fusion that aims to find a transformation that can effectively aggregate different features and output a list of new features.
- BiFPN (bidirectional FPN) (<-- PANet <-- FPN)
- PANet to introduce bottom up pathway again.
- Remove nodes from PANet that has only has one input edge.
- Add skip connection from original input to the output node if they are at the same level
- Repeat blocks of the above BiFPN block.
- Weighted feature fusion
- Baseline is to resize and sum up. Each feature may have different weight contribution (feature level attention).
- Softmax works, but a linear weighting normalization may work as well.
- Multidimensional/compound scaling up is more effective than single dimension scaling. Resolution, depth and width.