In recent years, optimization-inspired networks that integrate optimization theory into deep neural networks (DNNs) have achieved remarkable success in image compressed sensing (ICS). However, current DNN-based ICS methods face two key challenges: (1) the difficulty of acquiring ground-truth (GT) data from labeled measurements and (2) the underutilization of measurement data. To address these issues, a novel self-supervised deep learning framework is proposed to solve the inverse problem in ICS. Building upon traditional self-supervised learning, our proposed method leverages measurement values through a multi-branch, multi-stage progressive cross-contrast structure, enabling effective learning of the common prior of the original image in the absence of GT data. Based on this framework, a novel Multi-branch Multi-stage Cross-contrast CS (MMC-CS) end-to-end DNN is designed, which unfolds the Proximal Gradient Descent (PGD) algorithm. Our approach further implements multi-scale co-optimization in the image reconstruction process, integrating both the image paths (IPs) and convolutional feature paths (CFPs). Additionally, the reconstruction performance across multiple spatial levels is enhanced by incorporating wavelet convolution (WTConv). Extensive experiments demonstrate that our proposed method outperforms existing self-supervised approaches, achieving an average PSNR improvement of 0.3-1.6 dB. Moreover, it shows strong potential to compete with state-of-the-art supervised methods.