arxiv2d ago
From Tokens to Concepts: Leveraging SAE for SPLADE
arXiv:2604.21511v1 Announce Type: cross Abstract: Learned Sparse IR models, such as SPLADE, offer an excellent efficiency-effectiveness tradeoff. However, they rely on the underlying backbone vocabulary, which might hinder performance (polysemicity and synonymy) and pose a challenge for multi-lingua