Adjust/Improve pipeline to be compatible with more models by yuqiannemo · Pull Request #4 · Xtra-Computing/LLM-DNA

yuqiannemo · 2026-03-22T13:07:05Z

Add more dependencies to support more models
Protect sensitive info like openrouter API

JerryLife · 2026-03-25T12:58:47Z

Thanks for working on broader model compatibility here. Supporting more HF model families is a good direction, and the ModelLoader change is addressing a real misclassification issue.

That said, I don't think this is ready to merge as-is because the dependency changes would make the base package less portable.

Moving these packages into project.dependencies introduces hard platform requirements for every pip install llm-dna user, even if they never use those model families:

mlx and mlx-lm are Apple Silicon-specific — this will break installation on Linux/Windows, which is where most of our users run the package
optimum, compressed-tensors, etc. are architecture-specific/transitive runtime deps rather than core package requirements

I think the right direction is:

Keep the base install minimal
Move platform/model-specific packages into optional dependency groups (e.g. apple, quantization)
Document which extras are needed for which model families

On ModelLoader, the current fix works for openai/gpt-oss, but it's brittle because it carves out exceptions from a broad openai/gpt- rule — every new HuggingFace model under the openai/ namespace would need another exclusion. Instead, I'd suggest tightening the OpenRouter prefixes to be more specific:

openrouter_prefixes = [
    "openrouter/",
    "openrouter:",
    "anthropic/claude-",
    "deepseek/",
    "openai/gpt-3",
    "openai/gpt-4",
    "google/gemini-",
    "x-ai/grok-",
    "cohere/command",
    "perplexity/",
]

This way openai/gpt-3.5-turbo and openai/gpt-4o still route to OpenRouter, but openai/gpt-oss-* naturally falls through to HuggingFace without needing any exclusion logic.

One smaller process note: the PR description looks incomplete (2. is empty). Please expand that before merge so the packaging/dependency rationale is clear.

Net: I like the goal of the PR, but the dependency strategy needs to change before this can be merged.

yuqiannemo · 2026-03-28T10:52:19Z

Thanks for working on broader model compatibility here. Supporting more HF model families is a good direction, and the ModelLoader change is addressing a real misclassification issue.

That said, I don't think this is ready to merge as-is because the dependency changes would make the base package less portable.

Moving these packages into project.dependencies introduces hard platform requirements for every pip install llm-dna user, even if they never use those model families:

mlx and mlx-lm are Apple Silicon-specific — this will break installation on Linux/Windows, which is where most of our users run the package

optimum, compressed-tensors, etc. are architecture-specific/transitive runtime deps rather than core package requirements

I think the right direction is:

Keep the base install minimal

Move platform/model-specific packages into optional dependency groups (e.g. apple, quantization)

Document which extras are needed for which model families

On ModelLoader, the current fix works for openai/gpt-oss, but it's brittle because it carves out exceptions from a broad openai/gpt- rule — every new HuggingFace model under the openai/ namespace would need another exclusion. Instead, I'd suggest tightening the OpenRouter prefixes to be more specific:
openrouter_prefixes = [
    "openrouter/",
    "openrouter:",
    "anthropic/claude-",
    "deepseek/",
    "openai/gpt-3",
    "openai/gpt-4",
    "google/gemini-",
    "x-ai/grok-",
    "cohere/command",
    "perplexity/",
]
This way openai/gpt-3.5-turbo and openai/gpt-4o still route to OpenRouter, but openai/gpt-oss-* naturally falls through to HuggingFace without needing any exclusion logic.

One smaller process note: the PR description looks incomplete (2. is empty). Please expand that before merge so the packaging/dependency rationale is clear.

Net: I like the goal of the PR, but the dependency strategy needs to change before this can be merged.

Fixed

JerryLife

Thanks for the PR! The optional extras and sensitive info protection are great additions. A few issues to address before this is merge-ready:

Correctness

Hardcoded relative paths will break when installed as a package
- ModelLoader.py uses Path(__file__).parents[3] / "configs" / "openrouter_llm_list.jsonl" — this assumes a source-tree layout. When installed via pip install llm-dna, the configs/ directory won't exist at that relative path. Consider using importlib.resources or including the file as package data.
- Same issue in cli.py with Path(__file__).parents[2] / ".env" — this path won't resolve correctly in an installed package. The .env loading should probably just use load_dotenv() (which searches CWD and parent dirs) without the hardcoded project root path.
Missing openrouter_llm_list.jsonl
- The diff references configs/openrouter_llm_list.jsonl but the file doesn't appear in the diff. Is it committed on the branch? If so, it also needs to be included as package data in pyproject.toml to be available at install time.
Redundant inference in single-model response caching (api.py)
- The new block at L645-672 calls _generate_responses_for_model() again even though inference was already performed above for DNA extraction. This effectively doubles the computation cost for single-model mode. The responses from the earlier inference step should be reused instead of regenerated.

Improvements

[full] extras should reference sub-extras
Instead of duplicating all packages:
```
full = [
    "llm-dna[apple]",
    "llm-dna[quantization]",
    "llm-dna[model_families]",
]
```
This avoids needing to keep version pins in sync across two places.
Platform-specific markers
mlx / mlx-lm only work on macOS Apple Silicon, and mamba-ssm requires CUDA. Consider adding environment markers:
```
apple = [
    "mlx>=0.10.0; sys_platform == 'darwin'",
]
```
No tests for new model detection logic
The OpenRouter model detection via JSONL lookup is a significant behavioral change — a unit test would help prevent regressions.

Overall the direction is solid. Happy to help if you have questions on any of these points!

yuqiannemo added 5 commits March 22, 2026 20:25

Add missing deps for more models

b023d0b

Correct model classification

36c1c03

More deps

e6efb18

More deps

353c144

Update dep

c7e54db

yuqiannemo added 3 commits March 28, 2026 18:49

Refactor deps

7e9d0f5

Improve model loader classification

2e939e8

Protect sensitive info like token

38a3450

yuqiannemo and others added 5 commits March 28, 2026 18:55

Merge branch 'Xtra-Computing:main' into dna_dev/run_more_models

5eaa8f3

Fix: save responses.json for single model dna generation as well

5a89dac

Add awq dep

b26887c

Change cli to load env properly

4c819e2

Directly read from openrouter list to prevent misclassification

ce7cc77

JerryLife reviewed Apr 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust/Improve pipeline to be compatible with more models#4

Adjust/Improve pipeline to be compatible with more models#4
yuqiannemo wants to merge 13 commits intoXtra-Computing:devfrom
yuqiannemo:dna_dev/run_more_models

yuqiannemo commented Mar 22, 2026 •

edited

Loading

Uh oh!

JerryLife commented Mar 25, 2026

Uh oh!

yuqiannemo commented Mar 28, 2026

Uh oh!

JerryLife left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yuqiannemo commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JerryLife commented Mar 25, 2026

Uh oh!

yuqiannemo commented Mar 28, 2026

Uh oh!

JerryLife left a comment

Choose a reason for hiding this comment

Correctness

Improvements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yuqiannemo commented Mar 22, 2026 •

edited

Loading