Paper page - TIDE: Every Layer Knows the Token Beneath the Context
…View arXiv page View PDF Add to collection Community We investigate rare token and contextual collapse problem in LLM design and propose to inject token identity information to each transformer layer. the…