🤪 Well, that’s not ranking

Compare your content against BM25-ranked competitors for your search query... because why not?

Search Query/Keyword:

Your URL:

Competitor URLs:

All done?

Language Support

Currently supports content in English and German. Language detection affects:

BM25 Family (Classic, Plus, L, Adaptive)

Impact:Scores for unsupported languages will be inflated by 20-40% due to stop words being counted as significant terms. BM25L and BM25-adpt are most affected due to their length normalization components.

Example:When processing Russian content (unsupported), common words like "в", "и", "на" remain in the text, leading to different term frequencies and document length calculations than intended.
TF-IDF and WDF*IDF

Impact:Scores typically show 30-50% higher values for unsupported languages, as common function words artificially increase both TF and IDF components. WDF*IDF is slightly more robust, showing 20-35% inflation.

Example:For Chinese content (unsupported), function words that should be filtered remain in the calculations, potentially inflating frequency scores of non-meaningful terms.
DFR-PL2 (Divergence From Randomness)

Impact:Scores can be 40-60% higher for unsupported languages because unfiltered stop words create artificial term frequency patterns that the Poisson model interprets as significant.

Example:For Arabic content (unsupported), common particles and articles remain unfiltered, which affects the term frequency normalization and probability calculations.

Note: If language detection fails or detects an unsupported language, English stop words are used as fallback. This affects all scoring functions and may impact ranking accuracy.

Ranking Metrics We Calculate

BM25 Family

Classic BM25: The industry standard for document ranking, balancing term frequency, inverse document frequency, and document length.
Okapi at TREC-3 (Robertson et al., 1995), BM25 Revisited (Robertson & Zaragoza, 2009)
BM25+: Enhanced version that prevents negative term frequency values and improves long document handling.
A New Term Frequency Normalization (Lv & Zhai, 2012)
BM25L: Variant optimized for long documents with improved length normalization.
Lower-Bounding Term Frequency Normalization (Lv & Zhai, 2011)
BM25-adpt: Adaptive version that automatically adjusts parameters based on document length.
A Study of the k1 Parameter in BM25 (Lv & Zhai, 2011), Parameter Sensitivity in BM25 (He & Ounis, 2013)

Additional Metrics

TF (Term Frequency): Basic measure of how often search terms appear, normalized by document length.
Term Frequency Normalization (Singhal et al., 1996)
TF-IDF: Classic algorithm combining term frequency with inverse document frequency.
A Statistical Interpretation of Term Specificity (Spärck Jones, 1972), TF-IDF 40 Years Later (Roelleke, 2013)
WDF*IDF: Enhanced TF-IDF using weighted document frequency.
Probabilistic Models in IR (Amati & Van Rijsbergen, 2002)
DFR-PL2: Advanced probabilistic model using Divergence From Randomness with Poisson-Laplace.
Probabilistic Models of Information Retrieval Based on DFR (Amati & Van Rijsbergen, 2002), DFR Framework for IR (Clinchant & Gaussier, 2010)

Imprint | Privacy Policy

🤪 Well, that’s not ranking

Language Support

Ranking Metrics We Calculate

BM25 Family

Additional Metrics

Document Words