Test 8 - BoolBoosted vs PhaserBeam

Overview

Currently we are matching named features with a simple AND query.

This means that if something has the title of Treatise on Radioactivity, all of the below will match with equal scoring:

  • Treatise on Radioactivity

  • Radioactivity on Treatise

  • on Radioactivity Treatise

  • etc

This tests whether phrase matching is a better fit for this. You would think so given that is what it is for.

We wrap the query in a multi_match query, matching on the text type and keyword type fields, boosting the keyword field higher as that would infer an exact match.

Using the phrase type of multi_match then chooses the highest score of the two, and surfaces that.

Because we the lose the niceness of the AND search, we've added that as a tier, applied it similarly to the BaseQuery but boosted it by 2.

We think this should give us much better matching of items people know the name of, but retain the fetching of things that are loosely relevant.

Known unknowns

After running through the explain API with this, boosting seems to be similar, but not exactly the same as what you put in the query, we'll be exploring further as to why that is the case.

Glossary

Named feature A named feature is a piece of data in which the whole phrase contains semantic meaning. e.g. subjects, genres, titles, people and organisation names etc.

Results

TBD

Click through rate

ConstScore
BoolBoosted

first page only

TBD

TBD

beyond first page

TBD

TBD

Click distribution

TBD

Conclusions

TBD

Last updated