Paper page - The First Token Knows: Single-Decode Confidence for Hallucination Detection
…Semantic self-consistency improves this by clustering sampled answers by meaning using natural language inference , but it adds both sampling cost and external inference overhead. We show that first-token confidence , phi…