R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling - Databubble