That means prioritizing experiments that discriminate between plausible explanations instead of simply refining a favored one. Each run should be tied to a clear branch: if this result appears, what do we believe next and what do we stop doing?
If the team cannot answer that question before the run, the experiment is probably serving curiosity more than decision quality.