In this tutorial, we use zeroentropy/zerank-2-reranker, a 4B Qwen3-based cross-encoder reranker, to improve retrieval quality. We start by setting up the runtime ...
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency. - Tencent/AngelSlim ...