Skip to main content
LanceDB Cloud and Enterprise provide performant full-text search based on BM25 so you can incorporate keyword-based search into retrieval solutions.
The create_fts_index API returns immediately, but index building happens asynchronously.

Creating FTS Indexes

Check FTS index status using the API:

Configuration Options

FTS Parameters

ParameterTypeDefaultDescription
with_positionboolFalseStore token positions (required for phrase queries)
base_tokenizerstr"simple"Text splitting method (simple, whitespace, or raw)
languagestr"English"Language for stemming/stop words
max_token_lengthint40Maximum token size; longer tokens are omitted
lower_caseboolTrueLowercase tokens
stemboolTrueApply stemming (runningrun)
remove_stop_wordsboolTrueDrop common stop words
ascii_foldingboolTrueNormalize accented characters
  • max_token_length can filter out base64 blobs or long URLs.
  • Disabling with_position reduces index size but disables phrase queries.
  • ascii_folding helps with international text (e.g., “café” → “cafe”).

Phrase Query Configuration

Enable phrase queries by setting:
ParameterRequired ValuePurpose
with_positionTrueTrack token positions for phrase matching
remove_stop_wordsFalsePreserve stop words for exact phrase matching