ATIR: Towards Audio-Text Interleaved Contextual Retrieval - Databubble