Neural re-ranking rescoring top candidates with a deep model greatly improves ordering. It reads the query and document together for fine-grained relevance.
Neural re-ranking uses a cross-encoder to jointly encode query and candidate text, producing high-quality relevance scores for the short list returned by recall.
Cross-encoders fix ordering on hard cases. Keep k small and models optimized.