Although proximity queries are useful, the fact that they require all terms to be
present can make them overly strict.(((“proximity matching”, “using for relevance”)))(((“relevance”, “proximity queries for”))) It’s the same issue that we discussed in
<
a document is probably relevant enough to be worth showing to the user, but
the query would exclude it.
Instead of using proximity matching as an absolute requirement, we can
use it as a signal—as one of potentially many queries, each of which
contributes to the overall score for each document (see <
We can use a simple match
query as a clause. This is the query that
will determine which documents are included in our result set. We can trim
the long tail with the minimum_should_match
parameter. Then we can add other,
more specific queries as should
clauses. Every one that matches will
increase the relevance of the matching docs.
GET /my_index/my_type/_search
{
“query”: {
“bool”: {
“must”: {
“match”: { <1>
“title”: {
“query”: “quick brown fox”,
“minimum_should_match”: “30%”
}
}
},
“should”: {
“match_phrase”: { <2>
“title”: {
“query”: “quick brown fox”,
“slop”: 50
}
}
}
}
}
}
<1> The clause includes or excludes documents from the result set.
<2> The should
clause increases the relevance score of those documents that
match.