feat: add origin ordering to brain search by EtanHey · Pull Request #518 · EtanHey/brainlayer

EtanHey · 2026-06-19T07:25:46Z

Summary

Add optional order for brain_search, with default relevance preserving existing hybrid relevance/recency behavior.
Add order="origin" to fetch a relevance-matched candidate set, sort by created_at ascending, and return the earliest requested chunks with date/order surfaced in output.
Thread the parameter through brain_search -> brain_recall(mode="search") -> handler dispatch, while bypassing warm helper routing for origin mode.

Test plan

pytest tests/test_search_handler.py::test_brain_search_origin_order_returns_oldest_matching_chunks_without_changing_default tests/test_3tool_aliases.py::TestBrainRecallSearchMode::test_search_mode_passes_order tests/test_search_filter_params.py::TestInputSchemaPresence::test_order_enum_values tests/test_search_filter_params.py::TestAliasResolution::test_order_passes_from_call_tool -q
pytest tests/test_search_handler.py tests/test_3tool_aliases.py tests/test_search_filter_params.py tests/test_mcp_labeled_field_output.py -q
pytest tests/test_mcp_warm_route.py tests/test_hybrid_search.py tests/test_phase3_qa.py -q
ruff check src/brainlayer/mcp/search_handler.py src/brainlayer/mcp/__init__.py src/brainlayer/mcp/_format.py tests/test_search_handler.py tests/test_3tool_aliases.py tests/test_search_filter_params.py
pytest --ignore=tests/test_vector_store.py --ignore=tests/test_engine.py --ignore=tests/test_enrichment_quality_benchmark.py --ignore=tests/test_eval_framework.py --ignore=tests/test_abcde_report.py --ignore=tests/test_enrichment_graders.py --ignore=tests/test_enrichment_llm_judge.py --ignore=tests/test_enrichment_gold.py --ignore=tests/test_experiment_store.py --ignore=tests/test_abcde_enrich_runner.py --ignore=tests/eval/phoenix_gate/test_phoenix_gate.py --ignore=tests/test_abcde_variants.py --ignore=tests/test_enrichment_judge.py -q (system Python broad non-eval; eval collection blocked by local NumPy 2.4 / numba compatibility)
BRAINLAYER_PREPUSH=1 BRAINLAYER_PREPUSH_SCOPE=changed-only ./scripts/run_tests.sh
Push-side pre-push hook: passed

Review notes

Local coderabbit review --agent hit the free OSS rate limit before producing findings; PR-level bot review requested after creation.

Note

Medium Risk
Touches MCP search dispatch and hybrid/KG ranking paths; default relevance behavior is preserved, but order=origin changes routing and result ordering in ways callers may rely on for “first mention” semantics within a bounded candidate pool.

Overview
Adds optional order on brain_search and brain_recall (mode=search): default relevance is unchanged; origin re-ranks an expanded hybrid candidate set (at least 100 matches) by created_at ascending and returns the earliest num_results, with order / order_scope in structured output and a short label in formatted results.

Origin skips warm helper routing and the smart/file shortcuts (timeline, regression, think/recall/current-context) so search stays on the hybrid path; file_path is applied as a source_file SQL LIKE filter via new source_file_filter_like plumbing in search_repo. Entity/KG hybrid search uses the same expansion and origin sort when filters allow entity routing.

Tests cover schema forwarding, default vs origin ordering, file-path filtering, entity route, and route bypasses.

^{Reviewed by Cursor Bugbot for commit 1419e37. Bugbot is set up for automated code reviews on this repo. Configure here.}

Note

Add origin ordering to brain search results

Adds an order parameter to brain_search and brain_recall (mode=search) accepting 'relevance' (default) or 'origin', where origin sorts by earliest created_at across an expanded candidate pool.
When order='origin', smart/file-based routing is bypassed, an expanded candidate set is fetched (up to _ORIGIN_CANDIDATE_LIMIT=100), and results are re-sorted by ascending created_at before truncating to the requested count.
Adds source_file_filter_like SQL LIKE filtering to hybrid_search, binary_knn_search, and kg_hybrid_search in search_repo.py, used to constrain origin-ordered results when a file path is inferred from the query.
Formatted output and structured responses include order and order_scope fields when origin ordering is active.
Behavioral Change: order='origin' disables helper/smart routing paths that are used for relevance ordering, so route selection behavior differs between the two modes.

^{Macroscope summarized 1419e37.}

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

EtanHey · 2026-06-19T07:26:01Z

@coderabbitai review

EtanHey · 2026-06-19T07:26:01Z

@codex review

EtanHey · 2026-06-19T07:26:01Z

@cursor @BugBot review

cursor · 2026-06-19T07:26:04Z

You need to increase your spend limit or enable usage-based billing to run background agents. Go to Cursor

coderabbitai · 2026-06-19T07:26:08Z

Warning

Review limit reached

@EtanHey, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 48 minutes and 58 seconds. Learn how PR review limits work.

Your organization has used up its prepaid credits, and credit purchases are no longer available. Enable the review add-on in the billing tab to keep reviews running — you're only billed for reviews past your plan's rate limits ($0.25/file).

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based credits.

🚦 How do rate limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan refill rate.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, the refill rate gradually slows as usage increases. The highest same-day bursts are limited more strictly.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 8fd1a705-2a05-4844-8b9c-5fa3dabecf4c

📥 Commits

Reviewing files that changed from the base of the PR and between 829029a and 1419e37.

📒 Files selected for processing (7)

src/brainlayer/mcp/__init__.py
src/brainlayer/mcp/_format.py
src/brainlayer/mcp/search_handler.py
src/brainlayer/search_repo.py
tests/test_3tool_aliases.py
tests/test_search_filter_params.py
tests/test_search_handler.py

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/brain-search-find-origin

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai · 2026-06-19T07:26:10Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 214cc6a9b0

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-19T07:29:59Z

        date_to=date_to,
        sentiment=sentiment,
        detail=detail,
+        order=order,


Honor origin ordering for KG entity hits

When a query detects a known KG entity, _brain_search_dispatch returns from the entity branch before reaching this _search(..., order=order) fallback. In that scenario order="origin" is accepted by validation, but kg_hybrid_search still receives only n_results=num_results and the formatted chunks stay in KG/relevance order, so entity queries such as brain_search(query="Avi Simon", order="origin") silently ignore the requested ordering instead of returning the earliest matching memories.

Useful? React with 👍 / 👎.

Fixed in 7db7533. The KG/entity route now expands origin candidates, sorts chunk hits by created_at before formatting, and has regression coverage in tests/test_search_handler.py::test_brain_search_origin_order_sorts_entity_route_chunks.

chatgpt-codex-connector · 2026-06-19T07:29:59Z



+def _origin_candidate_count(num_results: int) -> int:
+    return min(_MAX_PUBLIC_NUM_RESULTS, max(num_results, _ORIGIN_CANDIDATE_LIMIT))


Select origin results from all matches

For order="origin", this caps the candidate set at 100 hybrid-ranked results before sorting by created_at. On broad queries with more than 100 matching chunks, older exact/filtered matches outside the top relevance candidates are never considered, so the tool can return later memories while claiming to return the earliest matching chunks; this is especially likely for common project or entity terms in a large BrainLayer DB.

Useful? React with 👍 / 👎.

Intentionally kept this scoped to the relevance-matched hybrid candidate set. The task explicitly asked for a thin layer over existing hybrid_search candidates and not an unbounded/date-indexed search path. In 38a1704 I tightened the schema wording to "earliest hybrid-matched chunks" so clients do not read origin mode as an exhaustive all-DB oldest-match scan.

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

EtanHey · 2026-06-19T07:35:57Z

@cursor @BugBot re-review

EtanHey · 2026-06-19T07:35:57Z

@codex review

cursor · 2026-06-19T07:36:00Z

You need to increase your spend limit or enable usage-based billing to run background agents. Go to Cursor

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7db753371f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-19T07:39:23Z



+def _origin_candidate_count(num_results: int) -> int:
+    return min(_MAX_PUBLIC_NUM_RESULTS, max(num_results, _ORIGIN_CANDIDATE_LIMIT))


Return origin order from all matching candidates

When order="origin" is used on a query with more than 100 matching chunks, this cap means _search asks hybrid_search for only 100 relevance/MMR-ranked results and then _sort_hybrid_results_by_origin sorts that already-truncated set. Older matching chunks outside the top 100 relevance candidates are never considered, so the advertised “earliest matching chunks by created_at” can skip the actual origin memories on large projects; origin mode needs a date-ordered lookup or a larger/untruncated candidate path before applying num_results.

Useful? React with 👍 / 👎.

Intentionally kept this scoped to the relevance-matched hybrid candidate set. The task explicitly asked for a thin layer over existing hybrid_search candidates and not an unbounded/date-indexed search path. In 38a1704 I tightened the schema wording to "earliest hybrid-matched chunks" so clients do not read origin mode as an exhaustive all-DB oldest-match scan.

chatgpt-codex-connector · 2026-06-19T07:39:23Z

                entity_id=arguments.get("entity_id"),
                agent_id=arguments.get("agent_id"),
                num_results=arguments.get("num_results", 5),
+                order=arguments.get("order", "relevance"),


Expose order in the brain_recall schema

This forwards order through the direct brain_recall(mode="search") path, but the brain_recall tool schema above still has no order property. MCP clients build and often validate calls from list_tools, so brain_recall(mode="search", order="origin") is undiscoverable or rejected even though the handler and tests now support it; add the same enum/default schema entry used by brain_search.

Useful? React with 👍 / 👎.

Fixed in 38a1704. brain_recall(mode="search") now exposes the same order enum/default as brain_search, with schema coverage in tests/test_3tool_aliases.py::TestToolsListBackwardCompat::test_brain_recall_search_schema_has_order_param.

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

EtanHey · 2026-06-19T08:03:29Z

@cursor @BugBot re-review

EtanHey · 2026-06-19T08:03:29Z

@codex review

cursor · 2026-06-19T08:03:32Z

You need to increase your spend limit or enable usage-based billing to run background agents. Go to Cursor

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 38a170465b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-19T08:08:32Z

    try:
        if (
            allow_helper_route
+            and order == "relevance"


Honor origin order through smart routing

When order='origin' is requested, this only bypasses the warm helper; the normal dispatch path below still routes queries matching _query_signals_think/_query_signals_recall (for example, how did I implement auth or history of auth) to _think/_recall, where order is never applied. Those calls now accept order='origin' but silently return default-ordered non-origin output, so origin mode should either bypass these smart routes or implement equivalent ordering there.

Useful? React with 👍 / 👎.

Fixed in 86125a8. order="origin" now bypasses the current-context, think, and recall smart routes and falls through to the search/origin ordering path; regression coverage is in tests/test_search_handler.py::test_brain_search_origin_order_bypasses_smart_routes.

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

EtanHey · 2026-06-19T08:22:55Z

@codex review

EtanHey · 2026-06-19T08:22:55Z

@cursor @BugBot re-review

cursor · 2026-06-19T08:22:57Z

You need to increase your spend limit or enable usage-based billing to run background agents. Go to Cursor

EtanHey · 2026-06-19T08:23:23Z

@coderabbitai review

coderabbitai · 2026-06-19T08:26:14Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 86125a8f4c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-19T08:26:39Z



+def _origin_candidate_count(num_results: int) -> int:
+    return min(_MAX_PUBLIC_NUM_RESULTS, max(num_results, _ORIGIN_CANDIDATE_LIMIT))


Avoid truncating origin candidates before date ordering

When order="origin" is requested, this helper caps the hybrid query to 100 relevance-ranked results before _sort_hybrid_results_by_origin sorts by created_at. For queries with more than 100 matching chunks, any older match ranked outside the top 100 by relevance/MMR is never considered, so the tool can return a newer chunk even though the schema promises the earliest hybrid-matched chunks by created_at. This is most visible for broad project/history searches in the large BrainLayer DB; the origin path needs to order across the full filtered match set or clearly make the result approximate.

Useful? React with 👍 / 👎.

Clarified in a2f1989. The schema, text output, and structured response now say origin mode is scoped to the expanded relevance-ranked hybrid candidate set (order_scope="expanded_hybrid_candidates"), matching the requested thin layer over hybrid_search candidates rather than promising an exhaustive all-DB oldest-match scan.

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

EtanHey · 2026-06-19T08:43:23Z

@codex review

EtanHey · 2026-06-19T08:43:28Z

@cursor @BugBot re-review

cursor · 2026-06-19T08:43:31Z

You need to increase your spend limit or enable usage-based billing to run background agents. Go to Cursor

EtanHey · 2026-06-19T08:43:31Z

@coderabbitai review

coderabbitai · 2026-06-19T08:43:37Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

chatgpt-codex-connector · 2026-06-19T08:49:23Z

Codex Review: Didn't find any major issues. Chef's kiss.

Reviewed commit: a2f1989578

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

EtanHey · 2026-06-19T09:05:00Z

@codex review

EtanHey · 2026-06-19T09:05:04Z

@cursor @BugBot re-review

cursor · 2026-06-19T09:05:08Z

You need to increase your spend limit or enable usage-based billing to run background agents. Go to Cursor

EtanHey · 2026-06-19T09:05:08Z

@coderabbitai review

coderabbitai · 2026-06-19T09:05:15Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 1419e37. Configure here.}

cursor · 2026-06-19T09:06:25Z

                structured["kg_degrade_reason"] = kg_degrade_reason
            formatted_text = format_kg_search(entity_name, structured_results, fact_items, query)
+            if order == "origin":
+                formatted_text += f"\n{_ORIGIN_ORDER_LABEL}"


Origin label without chunk results

Low Severity

On the entity KG early-return path, order="origin" always adds order, order_scope, and the origin text banner whenever that mode is requested, even when structured_results is empty and the payload is only SQL facts. That implies hybrid chunks were origin-sorted when none were returned.

^{Reviewed by Cursor Bugbot for commit 1419e37. Configure here.}

Acknowledged. I am leaving this unchanged because the order fields/banner reflect the requested brain_search mode, while the KG facts-only payload still reports total: 0 and results: [], so there are no chunk dates or sorted chunk rows implied in the structured data. This is a low-severity presentation nuance, not a correctness issue, and the task explicitly asked origin mode to surface the chosen order in output.

chatgpt-codex-connector · 2026-06-19T09:10:52Z

Codex Review: Didn't find any major issues. Delightful!

Reviewed commit: 1419e37b03

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

feat: add origin ordering to brain search

214cc6a

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

cursor Bot reviewed Jun 19, 2026

View reviewed changes

Comment thread src/brainlayer/mcp/search_handler.py

chatgpt-codex-connector Bot reviewed Jun 19, 2026

View reviewed changes

fix: apply origin order to entity search

7db7533

macroscopeapp Bot reviewed Jun 19, 2026

View reviewed changes

Comment thread src/brainlayer/mcp/search_handler.py

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

test: format origin search tests

0da9385

chatgpt-codex-connector Bot reviewed Jun 19, 2026

View reviewed changes

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

fix: expose origin order in recall schema

38a1704

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Jun 19, 2026

View reviewed changes

fix: bypass smart routes for origin search

86125a8

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Jun 19, 2026

View reviewed changes

docs: clarify origin search candidate scope

a2f1989

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

cursor Bot reviewed Jun 19, 2026

View reviewed changes

Comment thread src/brainlayer/mcp/search_handler.py

fix: honor origin order for file routes

1419e37

greptile-apps Bot reviewed Jun 19, 2026

View reviewed changes

cursor Bot reviewed Jun 19, 2026

View reviewed changes

EtanHey merged commit 20c159e into main Jun 19, 2026
7 checks passed



		def _origin_candidate_count(num_results: int) -> int:
		return min(_MAX_PUBLIC_NUM_RESULTS, max(num_results, _ORIGIN_CANDIDATE_LIMIT))

Conversation

EtanHey commented Jun 19, 2026 • edited by macroscopeapp Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Review notes

Add origin ordering to brain search results

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

cursor Bot commented Jun 19, 2026

Uh oh!

coderabbitai Bot commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Uh oh!

coderabbitai Bot commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

EtanHey Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

EtanHey Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

cursor Bot commented Jun 19, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

EtanHey Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

EtanHey Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot left a comment

Choose a reason for hiding this comment

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

EtanHey commented Jun 19, 2026

Uh oh!

cursor Bot commented Jun 19, 2026

EtanHey commented Jun 19, 2026 •

edited by macroscopeapp Bot

Loading

coderabbitai Bot commented Jun 19, 2026 •

edited

Loading

coderabbitai Bot commented Jun 19, 2026 •

edited

Loading

coderabbitai Bot commented Jun 19, 2026 •

edited

Loading

coderabbitai Bot commented Jun 19, 2026 •

edited

Loading