ViqusViqus
Navigate
Company
Blog
About Us
Contact
System Status
Enter Viqus Hub

Ettin Reranker Family Released: State-of-the-Art Components for RAG Systems

Sentence Transformers CrossEncoder Reranker ModernBERT Information Retrieval MTEB
May 19, 2026
Viqus Verdict Logo Viqus Verdict Logo 6
Component Upgrade: Boosting RAG Accuracy
Media Hype 4/10
Real Impact 6/10

Article Summary

The latest release introduces six new Ettin Reranker models—spanning sizes from 17M to 1B—built on the robust Ettin ModernBERT encoders. These cross-encoders are designed to address the inherent limitations of vector embedding models, providing superior relevance scoring by jointly encoding the query and document pair. The models are paired with the embedding model `google/embeddinggemma-300m` and perform exceptionally well on the MTEB (English v2) Retrieval benchmark. The guide provides comprehensive details on implementation, showcasing a full retrieve-then-rerank pipeline, and optimizing usage with techniques like using bfloat16 and Flash Attention 2 for significant throughput gains.

Key Points

  • The new rerankers offer multiple model sizes (17M to 1B) allowing developers to trade off between computational cost and ranking accuracy.
  • The architecture supports a 'retrieve-then-rerank' pipeline, where fast embedding models retrieve candidates, and the reranker accurately reorders them for superior final results.
  • Performance can be significantly optimized using advanced techniques like bfloat16 and Flash Attention 2, leading to substantial speedups (up to 8.3x).

Why It Matters

This announcement is highly valuable to practitioners building production-grade RAG systems. While the technology (rerankers) is not new, the consistent release of state-of-the-art, highly optimized, and systematically sized components (the 'Ettin Reranker Family') lowers the barrier to entry for achieving high retrieval accuracy at scale. For professional developers, this means more reliable components to integrate, directly improving the quality and trustworthiness of AI systems that rely on external knowledge retrieval.

You might also be interested in