Automatic segmentation of uterus and fetus from 3D fetal ultrasound images remains a challenging problem due to multiple issues of fetal ultrasound, e.g., the relatively low image quality, intensity variations.In this work, we present a novel framework for the joint segmentation of uterus and fetus.It consists of two main components: a task-specific fully convolutional neural network (FCN) and a bidirectional convolutional LSTM (BiCLSTM).Our framework is inspired by a simple observation: the segmentation task can be decomposed into multiple easier-to-solve subproblems.More specifically, the encoder of the FCN extracts object-relevant features from the ultrasound slices.The BiCLSTM layer is responsible for modeling the inter-slice correlations.The final two branches of the FCN decoder produce the uterus and fetus predictions.In this way, the burden of the whole problem is evenly distributed among different parts of our network, thereby maximally exploiting the capacity of our network.Furthermore, we propose a spatially constrained loss to restrict the spatial positions of the segmented uterus and fetus to boost the performance.Quant. results demonstrate the effectiveness of the proposed method.