Abstract: Knowledge distillation (KD) has recently demonstrated remarkable potential in developing lightweight convolutional neural networks for remote sensing image (RSI) scene classification tasks.