Prediction of pregnancy-related disorders is usually done based on established and easily measured risk factors. Recent advances in metabolomics may provide earlier and more accurate prediction of women at risk of pregnancy-related disorders. We used data collected from women in the Born in Bradford (BiB; n = 8212) and UK Pregnancies Better Eating and Activity Trial (UPBEAT; n = 859) studies to create and validate prediction models for pregnancy-related disorders. These were gestational diabetes mellitus (GDM), hypertensive disorders of pregnancy (HDP), small for gestational age (SGA), large for gestational age (LGA) and preterm birth (PTB). We used ten-fold cross-validation and penalised regression to create prediction models. We compared the predictive performance of (1) risk factors (maternal age, pregnancy smoking, body mass index (BMI), ethnicity and parity) to (2) nuclear magnetic resonance-derived metabolites (N = 156 quantified metabolites, collected at 24–28 weeks gestation) and (3) combined risk factors and metabolites. The multi-ethnic BiB cohort was used for training and testing the models, with independent validation conducted in UPBEAT, a multi-ethnic study of obese pregnant women. Maternal age, pregnancy smoking, BMI, ethnicity and parity were retained in the combined risk factor and metabolite models for all outcomes apart from PTB, which did not include maternal age. In addition, 147, 33, 96, 51 and 14 of the 156 metabolite traits were retained in the combined risk factor and metabolite model for GDM, HDP, SGA, LGA and PTB, respectively. These include cholesterol and triglycerides in very low-density lipoproteins (VLDL) in the models predicting GDM, HDP, SGA and LGA, and monounsaturated fatty acids (MUFA), ratios of MUFA to omega 3 fatty acids and total fatty acids, and a ratio of apolipoprotein B to apolipoprotein A-1 (APOA:APOB1) were retained predictors for GDM and LGA. In BiB, discrimination for GDM, HDP, LGA and SGA was improved in the combined risk factors and metabolites models. Risk factor area under the curve (AUC 95% confidence interval (CI)): GDM (0.69 (0.64, 0.73)), HDP (0.74 (0.70, 0.78)) and LGA (0.71 (0.66, 0.75)), and SGA (0.59 (0.56, 0.63)). Combined risk factor and metabolite models AUC 95% (CI): GDM (0.78 (0.74, 0.81)), HDP (0.76 (0.73, 0.79)) and LGA (0.75 (0.70, 0.79)), and SGA (0.66 (0.63, 0.70)). For GDM, HDP and LGA, but not SGA, calibration was good for a combined risk factor and metabolite model. Prediction of PTB was poor for all models. Independent validation in UPBEAT at 24–28 weeks and 15–18 weeks gestation confirmed similar patterns of results, but AUCs were attenuated. Our results suggest a combined risk factor and metabolite model improves prediction of GDM, HDP and LGA, and SGA, when compared to risk factors alone. They also highlight the difficulty of predicting PTB, with all models performing poorly.