Search

A catalog of any useful size is hard to browse — one plugin in the demo catalog contains 28 entries on its own. You should not have to remember what each entry is called. Tome indexes every enabled skill and command so you can describe what you need and get the right entry back.

`tome query`

tome query "verify a Compact contract" --rerank

The query text is variadic and space-joined, so quoting is optional: tome query reset a counter works unquoted. Pass -q/--query "…" instead when the query itself contains flag-like or shell-significant tokens; the quoted form is mutually exclusive with the positional words.

Reranking is off by default (see Reranking below), so the example above passes --rerank to enable it. Its output — with reranked scores and rerank=true in the header — looks like this:

top_k=10  rerank=true  min_score=none  (10 results)
|   Score | Catalog         | Plugin          | Name                                      | Type    | Version | Path                                                      |
|---------|-----------------|-----------------|-------------------------------------------|---------|---------|-----------------------------------------------------------|
|  4.7874 | midnight-expert | midnight-verify | midnight-verify:verify-by-execution       | skill   | 0.13.0  | skills/midnight-verify:verify-by-execution/SKILL.md       |
|  3.4658 | midnight-expert | midnight-verify | midnight-verify:verify-by-zkir-checker    | skill   | 0.13.0  | skills/midnight-verify:verify-by-zkir-checker/SKILL.md    |
|  3.1529 | midnight-expert | midnight-verify | midnight-verify:verify-compact            | skill   | 0.13.0  | skills/midnight-verify:verify-compact/SKILL.md            |
|  2.7010 | midnight-expert | midnight-verify | midnight-verify:verify-by-witness         | skill   | 0.13.0  | skills/midnight-verify:verify-by-witness/SKILL.md         |
|  1.4746 | midnight-expert | midnight-verify | midnight-verify:verify-by-cli-execution   | skill   | 0.13.0  | skills/midnight-verify:verify-by-cli-execution/SKILL.md   |
|  0.0356 | midnight-expert | midnight-verify | midnight-verify:verify                    | command | 0.13.0  | commands/midnight-verify:verify.md                        |
| -0.4743 | midnight-expert | midnight-verify | midnight-verify:verify-by-source          | skill   | 0.13.0  | skills/midnight-verify:verify-by-source/SKILL.md          |
| -1.0289 | midnight-expert | midnight-verify | midnight-verify:verify-by-zkir-inspection | skill   | 0.13.0  | skills/midnight-verify:verify-by-zkir-inspection/SKILL.md |
| -1.2946 | midnight-expert | midnight-verify | midnight-verify:verify-tooling            | skill   | 0.13.0  | skills/midnight-verify:verify-tooling/SKILL.md            |
| -3.3564 | midnight-expert | midnight-verify | midnight-verify:verify-ledger             | skill   | 0.13.0  | skills/midnight-verify:verify-ledger/SKILL.md             |

The right skill is at the top with a clear margin, and the scores drop steeply — below zero for entries that only share vocabulary with the query.

The dim header line above the table shows the effective knobs that produced the results — the resolved top_k, whether reranking ran, the applied min_score floor (or none when no floor is enforced), and the result count. It is shown only in an interactive terminal; piped or redirected output omits it so the table stays clean to grep. The Type column reports whether each result is a skill, command, or agent.

Search runs in one or two stages:

KNN retrieval — your query is embedded with a local model and matched against the vector index to retrieve the nearest candidates. This always runs; the score is 1.0 − cosine distance.
Reranking (optional) — a cross-encoder reranker re-scores those candidates so the most relevant results are ranked first. This stage is off by default; see Reranking.

The embedder runs on your machine; nothing is sent anywhere unless you point a capability at an external provider.

Reranking

Reranking is off by default. KNN retrieval over the embeddings alone gives a usable ranking, and the bundled cross-encoder reranker (bge-reranker) is the largest model Tome ships, so it is an opt-in quality boost rather than a default cost. When reranking is off, results are ordered by embedding similarity and the scoring field reads embedding-similarity; when it is on, results are re-scored by the cross-encoder and scoring reads reranked.

There are three ways to turn reranking on:

Per query — pass --rerank to enable it for a single run (or --no-rerank to force it off for a run when it is otherwise on). The two flags are mutually exclusive.
In config — set [query] rerank = true in ~/.tome/config.toml to enable it for every query. Setting rerank = false disables it; an explicit config value always wins over the implicit enable below.
Implicitly, via a reranker provider — if you point the reranker capability at an external provider ([reranker] provider and model), reranking is enabled without also setting [query] rerank. Configuring a reranker backend is taken as a clear intent to use one.

A per-query flag overrides both the config value and the implicit enable.

Scoping and flags

Flag	Effect
`--top-k <n>`	Return at most n results.
`--min-score <s>`	Set the score floor used by `--strict`. On its own it changes nothing about which rows return — it's reflected in the header line and the `threshold_passed` JSON field, but results are only dropped when `--strict` is also passed. Defaults to `0.5` (embedding similarity — the default path) or `0.0` when reranking is on.
`--rerank`	Run the reranking stage for this query (off by default). Mutually exclusive with `--no-rerank`. See Reranking.
`--no-rerank`	Skip the reranking stage; results come back in raw KNN order. Only meaningful when reranking is otherwise on (via `[query] rerank` or a configured reranker provider). Mutually exclusive with `--rerank`.
`--catalog <name>`	Restrict the search to a catalog. Repeatable: pass `--catalog` several times to include entries from any of the named catalogs.
`--plugin <name>`	Restrict the search to a plugin (across all catalogs unless `--catalog` is also set). Repeatable: include entries from any of the named plugins.
`--kind <kind>`	Restrict the search to an entry kind (`skill`, `command`, or `agent`). Repeatable. `query` only searches indexed, searchable entries, so `--kind agent` typically returns nothing.
`-q`, `--query <text>`	The query as a single quoted string, instead of the positional words. Mutually exclusive with the positional form.
`--strict`	Fail (non-zero exit) instead of returning weak results when no result scores high enough.
`--json`	Emit machine-readable output.

The repeatable scoping flags compose, so you can narrow by kind and several plugins at once:

tome query reset a counter --kind skill --plugin a --plugin b

Limit results with `--top-k`

tome query "verify a Compact contract" --top-k 3 --rerank

top_k=3  rerank=true  min_score=none  (3 results)
|  Score | Catalog         | Plugin          | Name                                   | Type  | Version | Path                                                   |
|--------|-----------------|-----------------|----------------------------------------|-------|---------|--------------------------------------------------------|
| 4.3648 | midnight-expert | midnight-verify | midnight-verify:verify-by-execution    | skill | 0.13.0  | skills/midnight-verify:verify-by-execution/SKILL.md    |
| 3.8602 | midnight-expert | midnight-verify | midnight-verify:verify-by-zkir-checker | skill | 0.13.0  | skills/midnight-verify:verify-by-zkir-checker/SKILL.md |
| 3.6187 | midnight-expert | midnight-verify | midnight-verify:verify-compact         | skill | 0.13.0  | skills/midnight-verify:verify-compact/SKILL.md         |

Same query, same top three entries — different scores than the ten-result run above. Reranker scores are relative to the candidate set, not absolute, so compare scores within a single run, never across runs.

Why search matters: load on demand

The point of search is load on demand. Instead of loading every skill into your agent's context in advance, the agent searches at runtime and loads only what the current task needs. That:

protects the context window — skills that aren't relevant never take up space;
cuts token spend — you pay for the skills you use, not for every enabled entry;
scales — a large catalog stays usable because retrieval, not context size, does the filtering.

For example: the top result above, verify-by-execution, is a single SKILL.md of 11,652 characters (1,539 words). Loading it costs one skill's worth of context — the plugin's other 27 entries are not loaded.

Inside a configured harness, this same search runs over the MCP server, so your agent gets search and skill loading without you running tome query by hand.

Pitfalls

Exit code	What happened	What to do
`40`	`--strict` was set and no result scored high enough.	Expected in scripts — treat it as "no match", or broaden the query. See exit codes.

Where next

MCP server — the same search, driven by your agent mid-task.
Plugins & catalogs — what gets indexed, and when.

tome query​

Reranking​

Scoping and flags​

Limit results with --top-k​

Why search matters: load on demand​

Pitfalls​

Where next​