Options

Every parameter has an evidence-backed default: Document.from_text(text) with nothing else is a complete call. This page is the full reference for when you want to tune something.

`Document.from_text(text, ...)`

Chunking (index-time, fixed at construction):

param	default	what it does
`source`	`"document"`	a label for the document
`chunk_size`	`128`	target tokens per chunk
`chunk_overlap`	`1`	sentences of overlap between adjacent chunks

Assembly (how the context is built from candidates):

param	default	what it does
`strategy`	`"auto"`	`auto` · `reasoning_preserving` · `distractor_filtered` · `redundancy_pruned` · `max_density` · `raw_topk`
`token_budget`	`8192`	default assembled-context budget (override per call)
`candidate_k`	`20`	how many candidates to retrieve before assembly

Retrieval (lexical by default. The rest only apply when retrieval="semantic", see Retrieval options):

param	default	what it does
`retrieval`	`"lexical"`	`"lexical"` (BM25), `"hybrid"` (BM25 prune → dense rerank), or `"semantic"` (dense over every chunk)
`model`	`None`	name of a built-in embedding model to download: `"bge-small"` / `"bge-base"`
`candidate_pool`	`50`	hybrid only: BM25 prune depth before the dense rerank
`rerank`	`None`	optional cross-encoder second stage: `"cross-encoder"` (auto-downloaded) re-scores the candidate pool by jointly reading each `(query, passage)` pair. Works on any tier, off by default (details)

The rest are for bringing your own embedding model instead of model= (advanced, see Bring your own model):

param	default	what it does
`embedder_model`	`None`	path to a local embedding model
`embedder_tokenizer`	`None`	path to its tokenizer
`embedder_dim`	`384`	the model’s output dimension
`embedder_pooling`	`"cls"`	`"cls"` or `"mean"` (matches the model family)
`embedder_query_prefix`	`None`	query prefix for asymmetric models (E5: `"query: "`)
`embedder_passage_prefix`	`None`	passage prefix for asymmetric models (E5: `"passage: "`)

Lexical analyzer (added in 0.2.0, default flipped in 0.3.2):

param	default	what it does
`language`	raw pipeline (no stemming)	The 0.3.2 default is a minimal pipeline (tokenize → lowercase → ASCII fold, with no stemmer, no stopword filter, no CamelCase split), measured to beat the previous English-Snowball default on three English workloads (RAW_ANALYZER). Pass `language="english"` to opt back in to the Snowball pipeline (useful for code search and inflection-heavy English). For non-English content pass one of the 18 Snowball Porter2 languages: `arabic`, `danish`, `dutch`, `english`, `finnish`, `french`, `german`, `greek`, `hungarian`, `italian`, `norwegian`, `portuguese`, `romanian`, `russian`, `spanish`, `swedish`, `tamil`, `turkish`. One analyzer drives BOTH BM25 retrieval AND the grounding scorer, so the two layers stay in lockstep (German `Bücher` ↔ `Buch`, French `manger` ↔ `mange`, etc.). Unknown names error rather than silently falling back.
`language="char_ngram"`	opt-in	The subword typo / short-token tier (CharNgramAnalyzer). Tokenizes into overlapping character n-grams so a transcription / OCR typo still matches with no model: `lays` and `1ays` share `ays`, `ys` . Default grams 3–5; tune with `language="char_ngram:2-4"`. A recall booster for short, noisy queries (dealer orders, voice/OCR, catalogs), not a prose default — its clean set-coverage erodes at scale, so pair it with word-BM25 rather than replacing it. See the catalog & noisy-query guide.

For a custom analyzer (e.g. CJK word-segmentation), implement the crate::analyzer::Analyzer trait in Rust and pass it via Document::with_analyzer(Arc::new(MyAnalyzer)). Python and Node only expose the string-routed view.

Per-field BM25 weighting (added in 0.4.x):

param	default	what it does
`bm25_field_weights`	equal weight (`[1, 1, 1]`)	Per-field BM25 query boosts as `[text, source, heading]`. The default is bit-for-bit the equal-weight behavior (a `1.0` weight is skipped before it reaches the index). A domain lever for near-duplicate / title-heavy corpora: boost the field carrying the discriminating token. Not a universal win — on our synthetic catalog a boost on a field the near-duplicates share was inert, and over-boosting starves recall (CATALOG_REGIME Panel D). Sweep on a held-out set with your own eval; see Choosing a configuration → catalogs.

# Boost the heading field (e.g. brand/title) 2x; Node: { bm25FieldWeights: [1, 1, 2] }
doc = redhop.Document.from_text(catalog, options=redhop.DocumentOptions(bm25_field_weights=[1.0, 1.0, 2.0]))

`Document.from_chunks(chunks, ...)`

Takes the same assembly and retrieval options as from_text: you’ve already chunked, so the chunking params don’t apply. Each entry of chunks must be a typed redhop.Chunk(...) (new Chunk(...) in Node, redhop::core::Chunk::new(...) in Rust). Strings and plain dicts are no longer accepted as of 0.3.0. The typed constructor is what lets page / heading / line from your metadata reach ctx.citations. See Loaders → redhop.Chunk for the full field list.

`from_file` / `from_bytes` / `from_folder`

The file loaders (Loaders) take all the same options. from_folder adds recursive, gitignore, ignore, persist, and index_dir. See from_folder.

`doc.context(query, budget=None, neighbors=0, include_heading=False)`

param	default	what it does
`query`	—	the question
`budget`	the doc’s `token_budget`	per-call budget override: query-time, no re-indexing
`neighbors`	`0`	also include the N adjacent chunks on each side of every selected chunk (same file), see Structural expansion
`include_heading`	`False`	also include each selected chunk’s section heading

Also: doc.analyze(query) returns the same Decision Report without building a context (pure diagnostics), and doc.n_chunks / len(doc) (Node: doc.chunkCount, Rust: doc.len()) give the chunk count.

Structural expansion

A retrieved chunk often answers the question but drops you mid-section: the sentence before it set up the definition, the heading tells you which contract clause it is. neighbors= and include_heading= pull that structure back in, deterministically and with no model: they’re read straight from document order and headings.

ctx = doc.context("what notice is required to terminate?", neighbors=1, include_heading=True)

const ctx = doc.context("what notice is required to terminate?", undefined, 1, true);

let ctx = doc.context_expanded("what notice is required to terminate?", None, None, 1, true)?;

What makes this more than a ±1 hack is how it cooperates with the finite-attention core. A neighbor pulled in for continuity has little query overlap, so the normal distractor filter would throw it away. Instead, expansion chunks are justified by structure, not relevance: exempt from that filter, but still:

bounded by the token budget (companions fill only leftover space after the real evidence is placed),
deduplicated (overlapping windows merge),
emitted in document order, so each hit reads as a contiguous window rather than scattered fragments,
accounted for: they appear in ctx.citations (with their own heading/line) and as ctx.report.n_expanded, and the Decision Report prints a Structural expand: +N line.

Both default off at the per-call level, so plain context(query) is unchanged. Start with neighbors=1, include_heading=True for contracts/docs where surrounding context matters.

Auto-expansion defaults (`code_neighbors_default`, `prose_heading_default`)

The per-call neighbors= / include_heading= above are explicit user opt-ins. RedHop also auto-applies structural expansion in two cases via document-level defaults you set once at construction:

constructor kwarg	default	what it does
`code_neighbors_default`	`1`	When `context()` retrieves a chunk classified as code (`metadata["kind"]=="code"`), auto-pull this many adjacent chunks (i±1, …) so the function body arrives with its signature. Pass `0` to disable, or raise to `2`/`3` at loose budgets for further-reaching bodies. See CODE_NEIGHBORS_DEFAULT for the measured budget tradeoff.
`prose_heading_default`	`true`	When `context()` retrieves a prose chunk with a `metadata["heading"]`, auto-attach the section heading chunk. Measured +7pt ≥0.8 at typical budgets on heading-bearing content. Pass `false` to disable for memory-tight workloads. See PROSE_HEADING_DEFAULT.

Both surfaced from 0.3.3 onward on Python (from_text/from_file/from_bytes/ from_folder/from_chunks) and Node (as codeNeighborsDefault / proseHeadingDefault on the Options struct).

# Disable code-neighbor auto-pull (memory-tight code search)
doc = redhop.Document.from_file("src/auth.rs", options=redhop.DocumentOptions(code_neighbors_default=0))

# Disable heading auto-attach (categorical headings that don't help the query)
doc = redhop.Document.from_text(text, options=redhop.DocumentOptions(prose_heading_default=False))

const doc = Document.fromFile("src/auth.rs", { codeNeighborsDefault: 0 });
const dt = Document.fromText(text, { proseHeadingDefault: false });

// Set on DocumentConfig directly, or via redhop::LoadOptions
let mut cfg = redhop::DocumentConfig::default();
cfg.code_neighbors_default = 0;
cfg.prose_heading_default = false;

What `context(...)` returns

doc.context(query) returns a built context with everything you need to prompt a model and show your work:

on the result	what it gives
`ctx.text()`	the assembled context as one prompt string
`ctx.chunks`	the selected chunks’ text, in order
`ctx.citations`	provenance per selected chunk: `{source, page, heading, line, text}` (details)
`ctx.report`	the Decision Report: `auto_decision`, `total_tokens`, `retained_evidence_ratio`, …

Lower-level: `redhop.build_context(query, chunks, ...)`

Already have chunks and want to tune the assembly thresholds Document keeps fixed? The low-level API exposes them (Python. The same build_context / ContextConfig surface is re-exported from the redhop Rust crate):

param	default	what it does
`strategy`	`"reasoning_preserving"`	same strategy values as above
`token_budget`	`8192`	assembled-context budget
`distractor_min_grounding`	`0.10`	grounding bar below which a chunk is “junk”
`link_min_jaccard`	`0.12`	linkage at/above which a low-relevance chunk is rescued as a second hop
`auto_passthrough_max_tokens`	`1500`	`auto`: pass through at/below this size, prune above
`redundancy_max_cosine`	`0.92`	dedup threshold

analyze_context(...) and context_economics(...) give the diagnostics without assembling a context.

Next: Retrieval & context tips (the operational laws behind these defaults) · Benchmarks (every number, reproducible).

Options

Document.from_text(text, ...)

Document.from_chunks(chunks, ...)

from_file / from_bytes / from_folder

doc.context(query, budget=None, neighbors=0, include_heading=False)