• Author : Keyan Chen, Bowen Chen, Chenyang Liu, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

Remote sensing image classification is essential for interpreting Earth’s surface features and is foundational for various analytical tasks. Despite the remarkable progress made with convolutional neural networks (CNNs) and transformers, which have significantly improved the accuracy of classifications, the task remains challenging. The complexity and diversity of remote sensing scenarios, along with the variability in spatiotemporal resolutions, demand advanced techniques for effective scene discrimination.

Addressing these challenges, a new architecture known as RSMamba has been developed. RSMamba utilizes the State Space Model (SSM) framework and features an efficient, hardware-aware design called Mamba. This design allows RSMamba to benefit from both a global receptive field, which is crucial for understanding the entire image, and linear modeling complexity, which simplifies the computational process.

One of the key innovations of RSMamba is its ability to handle non-causal data, which is a significant improvement over the original Mamba model that was limited to modeling causal sequences and not well-suited for two-dimensional image data. The dynamic multi-path activation mechanism introduced in RSMamba enhances its capacity to process various types of image data, making it a versatile tool for remote sensing image classification.

RSMamba has been tested across multiple datasets and has demonstrated superior performance in classifying remote sensing images. This success suggests that RSMamba could serve as a robust backbone for future visual foundation models, providing a more nuanced and accurate understanding of remote sensing data. The developers of RSMamba plan to make the code available, allowing others in the field to utilize and build upon this innovative architecture.