An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-training Academic Background In recent years, self-supervised learning (SSL) has made significant progress in the field of computer vision. In particular, the successful application of masked image modeling (MIM) pre-training methods on large-sca...