Paper page - DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models
…Encoding cost stays roughly constant in K instead of scaling with it. Findings. Multi-token helps every diffusion backbone we test, on every benchmark (MS MARCO, TREC DL'19/'20, BEIR-7…
