This study evaluated inter- and intra-observer reliability and discriminative validity of three movement control tests: 1) standing knee-lift test; 2) static lunge test; and 3) dynamic lunge test. Thirty-eight subjects, 21 with low-back pain and 17 healthy, were video-recorded while performing the tests. Four physical therapists scored the tests according to a standardized protocol and calculated a composite score for each test based on the number of incorrect test components. Inter-observer reliability for the composite scores ranged between 0.68 and 0.80 (ICC 2,k) and intra-observer reliability between 0.54 and0.82 (ICC 2,1). The separate test components ranged between 0.32 and 0.91 (κfree) for inter-observer reliability and 0.42 and1.00 for intra-observer reliability. Test components showing the highest values were: back extension; arm lowering; and shoulders moving backwards. Components hip hitch, trunk lateral flexion, knee not lifted straight up and hips moving backwards did not reach accepted thresholds. Discriminative validity ranged between 0.47 and 0.56 (AUC). As our results showed an overall good agreement for the composite, scores and for the majority of the included test components the tests can be considered reliable enough. As the tests' discriminative ability was close to "none", they should, however, not be used for diagnostic purposes but should be further evaluated toward predicted validity.