Docs

Search Configuration

Configure search settings for sanctions list

What is Search Configuration?

Search Configuration is a feature in Ambriel's AML solution that allows you to customize how sanctions lists are searched. You can configure the search settings to better suit your business needs.

What are the Configurations?

The following are the configurations you can make:

Frequency

The frequency determines how often the sanctions list is updated. The options are:

  • Daily
  • Weekly
  • Bi-Weekly
  • Quarterly
  • Monthly
  • Yearly

Enable Sanctions

Enable sanctions determines whether the sanctions list is enabled or not. If enabled, the sanctions list will be searched when you search for an entity.

Enable PEP

Enable PEP determines whether the PEP list is enabled or not. If enabled, the PEP list will be searched when you search for an entity.

Enable Crime

Enable crime determines whether the crime list is enabled or not. If enabled, the crime list will be searched when you search for an entity.

Enable Regulatory

Enable regulatory determines whether the regulatory list is enabled or not. If enabled, the regulatory list will be searched when you search for an entity.

Search Type

The search type determines how the search is performed. The options are:

  • Exact
  • Fuzzy

Score Threshold

The score_threshold determines the minimum similarity score required for a document to be considered a match. The score ranges from 0.0 (no similarity) to 1.0 (perfect match). Only results with a score equal to or above the specified threshold are returned.

Score ThresholdSearch QueryMatchResult
0.4sarah is a developersarah developer
0.6sarah is a developersarah developer
0.8sarah is a developersarah developer
0.7steve jobs biographysteve bio
0.9steve jobs biographysteve bio
0.5full stack engineerfull engineer
0.8full stack engineerfull engineer

Note

A lower score threshold allows more flexible (but potentially less accurate) matches. A higher score threshold enforces stricter matching.

Fuzzy Search Settings

The following are the fuzzy search settings you can make:

Edit Distance Thresholds

These thresholds control when typo-tolerant matching kicks in, based on the length of each token in the query.

min_len_1typo

A token must be at least this long to be allowed 1 typo. Default: 4

min_len_1typoTokenMatch TokenToken LengthAllowed TyposMatch?
3johnjon41 Yes – meets length
3jonjohn31 Yes – meets length
5davedavid40 No – token too short for typos
5daviddave51 Yes – meets threshold
4mikemiek41 Yes – meets threshold
4amyami30 No – too short for typo matching

Edit distance threshold for two typos

min_len_2typos

A token must be at least this long to be allowed 2 typos. Default: 8

min_len_2typosToken/QueryMatchLongest Token LengthAllowed TyposMatch?
5stevesteev51 Yes – within 1 typo
5steevsteve51 Yes – within 1 typo
7sarah is a developersarah is a devdeveloper → 9≤2 Yes – allowed (token length > 7)
7sarah is a devsarah is a developerdeveloper → 9≤2 Yes – allowed
6danieldanyel62 Yes – allowed
6danyeldaniel62 Yes – allowed
6benbann30 No – too short for typos

Num Typos

The num typos determines the maximum number of typos allowed in a search query.

Num TyposSearch QueryMatchResult
1stevesteev
1steevsteve
2sarah is a developersarah is a dev
2sarah is a devsarah is a developer

Important

Typos are allowed per token, not across the entire phrase.

Num TyposSearch QueryMatchResult
2michaelmykael
1michaelmykael

Drop Tokens Threshold

drop_tokens_threshold determines the minimum number of results needed before Search engine will begin dropping tokens (words) from the query. This parameter accepts only integer values.

0: Disables token dropping (behaves like a strict AND search).

≥1: If the search returns fewer than this number of results, Search engine will iteratively remove tokens (starting with those least likely to match) until enough results are found or only one token remains.

Drop Tokens ThresholdSearch QueryBehavior & Match
0sarah is a developerNo token dropping – exact presence of all tokens required for a match.
1 (default)sarah is a devIf fewer than 1 result, starts dropping “a”; if still too few, drops “is”, etc.
2steve is a dev todayIf fewer than 2 results, can drop “today” first, then “dev”, until ≥2 results.
3java spring boot microservicesIf < 3 results, drops least matching token (e.g., “microservices”), retry search.

Use Case

Useful when search queries contain noise words (like "a", "is", "the", etc.), allowing better match scoring without requiring exact matches.

Tip

Adjusting drop_tokens_threshold lets you control how relaxed or strict the search should be regarding missing words.

Use CaseValue
Strict exact-ish match0.8 or 0.9
Allow for common noise words (is, the, a)0.5 to 0.6
Require all tokens to match0

Typo Tokens Threshold

The typo_tokens_threshold parameter controls how many tokens in the search query are allowed to have typos (i.e., fuzzy matching). The remaining tokens must match exactly.

It does not limit the number of typos per token — that’s controlled by num_typos.

It does limit how many tokens in the query are eligible for typo tolerance.

Search Query: "Bernadette Hamill Johnson"
→ Tokens: ["Bernadette", "Hamill", "Johnson"]
Value (default=1)Max Tokens Allowed with TyposMatch Behavior
11Only 1 token can have typos, 2 tokens must match exactly
22Up to 2 tokens can have typos, 1 must match exactly
33All tokens can have typos (loosest match)

Use Case Recommendations

Use Casetypo_tokens_threshold Value
Strict exact-ish match1
Allow for common noise words (is, the, a)2
Loose fuzzy matching (name search)3

Remarks

Attention!

Loose fuzzy matching (name search) and less score threshold is not recommended. It can lead to unexpected/false-positive results. Be careful with this setting and use it with caution. Remember to always double-check the search results.

How to Configure Search Settings?

Login to your Ambriel dashboard.

Go to the Settings menu and select AML Configuration

Configure the search settings as desired.