SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval - Databubble