Wang, W., Dai, J., Chen, Z., Huang, Z., Li, Z., Zhu, X., Hu, X., Lu, L., Li, H., Wang,
X., and Qiao, Y. (2023), Internimage: Exploring large-scale vision foundation models
with deformable convolutions, Proceedings of the IEEE/CVF conference on computer vision
and pattern recognition, 14408-14419.
