From 8cfa447c8726d96d97990d12b2c7a56261f4a5ee Mon Sep 17 00:00:00 2001 From: yin wang Date: Thu, 14 May 2026 03:09:13 -0400 Subject: [PATCH 1/2] docs: propose README redesign --- docs/readme-redesign-proposal.md | 515 +++++++++++++++++++++++++++++++ 1 file changed, 515 insertions(+) create mode 100644 docs/readme-redesign-proposal.md diff --git a/docs/readme-redesign-proposal.md b/docs/readme-redesign-proposal.md new file mode 100644 index 0000000..a12fef8 --- /dev/null +++ b/docs/readme-redesign-proposal.md @@ -0,0 +1,515 @@ +# Symphony-Coord + +

+Emergent Coordination in Decentralized Agent Systems +

+ +

+ +

+ +

+ Adaptive Multi-Agent Routing via Online Bandit Coordination +

+ +

+ ๐Ÿ“„ Paper + ยท + โšก Quick Start + ยท + ๐Ÿ“Š Reproducibility + ยท + ๐ŸŽฌ Demo +

+ +--- + +## TL;DR + +Symphony-Coord is a decentralized multi-agent coordination framework that formulates adaptive agent routing as an online multi-armed bandit problem. + +Instead of relying on fixed orchestration heuristics or centralized planners, Symphony-Coord enables agents to dynamically specialize through online interaction, adaptive routing, and reward-driven coordination. + +The framework combines: + +* task decomposition +* LinUCB-based routing +* decentralized capability matching +* parallel Chain-of-Thought execution +* voting-based aggregation + +into a unified coordination pipeline for robust multi-agent reasoning. + +--- + +# Why Symphony-Coord? + +Modern multi-agent systems face several major limitations: + +* centralized orchestrators become bottlenecks at scale +* fixed routing heuristics fail under dynamic workloads +* static agent assignment prevents specialization +* coordination robustness degrades under agent failures + +Existing orchestration pipelines often assume: + +* stable agent behavior +* fixed routing strategies +* homogeneous execution environments + +However, real-world decentralized systems are inherently dynamic. + +Agent quality, latency, availability, and specialization evolve continuously over time. + +Symphony-Coord addresses this challenge by treating routing as an online decision-making problem under uncertainty. + +--- + +# Main Contributions + +* **Decentralized Coordination** + + * removes dependence on centralized orchestration + * enables scalable multi-agent interaction + +* **Adaptive Routing via LinUCB** + + * formulates agent selection as an online contextual bandit problem + * continuously updates routing decisions using reward feedback + +* **Emergent Specialization** + + * agents gradually specialize through interaction rather than predefined roles + +* **Robust Multi-Path Reasoning** + + * combines parallel Chain-of-Thought execution with voting aggregation + +* **Research-Grade Evaluation Pipeline** + + * supports simulation and real-model evaluation across multiple reasoning benchmarks + +--- + +# Main Results + +## Efficiency and Robustness Overview + +| Method | Accuracy โ†‘ | Cost โ†“ | Recovery Speed โ†‘ | +| ------------------ | ---------- | -------- | ---------------- | +| Static Routing | xx.x | xx.x | xx | +| Random Routing | xx.x | xx.x | xx | +| Rule-Based Routing | xx.x | xx.x | xx | +| Symphony-Coord | **xx.x** | **xx.x** | **xx** | + +--- + +## Benchmark Highlights + +### GSM8K + +| Method | Accuracy | +| -------------- | -------- | +| Baseline | xx.x | +| Symphony-Coord | **xx.x** | + +### BBH + +| Method | Macro Average | +| -------------- | ------------- | +| Baseline | xx.x | +| Symphony-Coord | **xx.x** | + +### Robustness Recovery + +| Scenario | Recovery Tasks | +| ----------------- | -------------- | +| Agent Failure | xx | +| Agent Degradation | xx | + +> Replace placeholder numbers with final experimental results. + +--- + +# Method Overview + +

+ +

+ +Symphony-Coord follows a three-stage coordination pipeline. + +--- + +## 1. Planning Phase + +Multiple planning agents decompose complex user queries into executable sub-tasks. + +The system evaluates candidate plans using contextual reward estimation. + +### Core Components + +* task decomposition +* plan proposal generation +* LinUCB plan selection + +--- + +## 2. Execution Phase + +Each sub-task is broadcast through decentralized beacon routing. + +Agents are dynamically selected based on: + +* capability matching +* historical reward feedback +* online uncertainty estimation + +Selected agents then execute reasoning chains in parallel. + +### Core Components + +* beacon broadcasting +* capability matching +* contextual bandit routing +* parallel CoT execution + +--- + +## 3. Voting Phase + +The framework aggregates multiple reasoning paths using voting-based response fusion. + +This improves: + +* robustness +* answer consistency +* fault tolerance + +### Core Components + +* CoT voting +* response aggregation +* confidence estimation + +--- + +# Architecture + +```text +User Query + โ”‚ + โ–ผ +โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” +โ”‚ Planning Phase โ”‚ +โ”‚ - Task decomposition โ”‚ +โ”‚ - Candidate plan generation โ”‚ +โ”‚ - LinUCB plan selection โ”‚ +โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ + โ”‚ + โ–ผ +โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” +โ”‚ Execution Phase โ”‚ +โ”‚ - Beacon broadcasting โ”‚ +โ”‚ - Capability matching โ”‚ +โ”‚ - Online bandit routing โ”‚ +โ”‚ - Parallel CoT execution โ”‚ +โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ + โ”‚ + โ–ผ +โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” +โ”‚ Voting Phase โ”‚ +โ”‚ - Multi-response aggregation โ”‚ +โ”‚ - Confidence voting โ”‚ +โ”‚ - Final answer generation โ”‚ +โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ + โ”‚ + โ–ผ +Final Response +``` + +--- + +# Key Features + +## Decentralized Coordination + +No centralized orchestration bottleneck. + +Agents interact through distributed coordination protocols. + +--- + +## Adaptive Online Routing + +Routing policies continuously evolve through reward-driven learning. + +--- + +## Emergent Specialization + +Agents dynamically specialize according to observed task performance. + +--- + +## Robust Multi-Agent Reasoning + +Parallel CoT execution improves reasoning robustness and fault tolerance. + +--- + +## Edge-Friendly Deployment + +Supports consumer-grade GPUs and heterogeneous execution environments. + +--- + +# Project Demo + +

+ + + +

+ +--- + +# Quick Start + +## Clone Repository + +```bash +git clone https://github.com/GradientHQ/symphony-coord.git +cd symphony-coord +``` + +--- + +## Create Environment + +```bash +python -m venv venv +source venv/bin/activate +``` + +--- + +## Install Dependencies + +```bash +pip install --upgrade pip +pip install -r requirements.txt +pip install -e . +``` + +--- + +## Configure API Key + +```bash +export OPENROUTER_API_KEY="sk-or-v1-your-key" +``` + +--- + +## Run Simple Example + +```python +from symphony import SymphonyOrchestrator + +orchestrator = SymphonyOrchestrator( + agents=["agent1", "agent2", "agent3"], + topL=3, + cot_count=3 +) + +result = orchestrator.run_task( + task_description="Solve: What is 25 * 37?", + requirements=["math"] +) + +print(result["final_answer"]) +``` + +--- + +# Repository Structure + +```text +symphony-coord/ +โ”œโ”€โ”€ agents/ # Agent implementations +โ”œโ”€โ”€ core/ # Core routing and coordination algorithms +โ”œโ”€โ”€ experiments/ # Benchmark experiments +โ”œโ”€โ”€ infra/ # Infrastructure and protocols +โ”œโ”€โ”€ models/ # Model loading utilities +โ”œโ”€โ”€ protocol/ # Communication protocol definitions +โ”œโ”€โ”€ scripts/ # Visualization and analysis scripts +โ”œโ”€โ”€ tests/ # Unit tests +โ”œโ”€โ”€ docs/ # Documentation +โ””โ”€โ”€ symphony.py # Main orchestrator +``` + +--- + +# Experiments + +## Experiment 1 โ€” Efficiency & Cost + +Evaluates: + +* routing efficiency +* API cost reduction +* adaptive selection quality + +### Run + +```bash +python experiments/exp1/real/exp1_real_openrouter.py --n 100 +``` + +--- + +## Experiment 2 โ€” Robustness & Recovery + +Evaluates: + +* adaptation under agent failure +* recovery after degradation +* coordination stability + +### Run + +```bash +bash experiments/exp2/scripts/run_exp2_both.sh +``` + +--- + +## Experiment 3 โ€” System Optimization + +Evaluates: + +* latency balancing +* heterogeneous execution +* load-aware routing + +### Run + +```bash +bash experiments/exp3/run_exp3.sh +``` + +--- + +# Visualization + +## Suggested README Assets + +Recommended figures/GIFs: + +* routing visualization +* specialization heatmaps +* recovery curves +* execution timelines +* decentralized coordination diagrams + +Suggested directory: + +```text +assets/ +โ”œโ”€โ”€ teaser.png +โ”œโ”€โ”€ pipeline.png +โ”œโ”€โ”€ routing_demo.gif +โ”œโ”€โ”€ robustness_curve.png +โ””โ”€โ”€ specialization_heatmap.png +``` + +--- + +# Reproducing Results + +## Full Benchmark Evaluation + +```bash +bash experiments/scripts/run_all_datasets.sh +``` + +--- + +## Generate Paper Figures + +```bash +python scripts/plotting/paper_figures/plot_robustness_bars.py +python scripts/plotting/paper_figures/plot_gap_analysis.py +``` + +--- + +## Expected Outputs + +```text +pretrain_results/ +โ”œโ”€โ”€ accuracy_summary.csv +โ”œโ”€โ”€ progress_state.json +โ”œโ”€โ”€ selection_trace.json +โ””โ”€โ”€ routing_visualizations/ +``` + +--- + +# Documentation + +Detailed documentation has been moved into the `docs/` directory. + +## Available Docs + +```text +docs/ +โ”œโ”€โ”€ INSTALL.md +โ”œโ”€โ”€ EXPERIMENTS.md +โ”œโ”€โ”€ CONFIGS.md +โ”œโ”€โ”€ TROUBLESHOOTING.md +โ””โ”€โ”€ OPENROUTER_CONFIG_GUIDE.md +``` + +--- + +# System Requirements + +| Requirement | Minimum | Recommended | +| ----------- | ------------------- | ------------- | +| Python | 3.9 | 3.10 / 3.11 | +| RAM | 8 GB | 16 GB | +| GPU | Optional | RTX 3060+ | +| OS | Linux/macOS/Windows | Ubuntu 20.04+ | + +--- + +# Citation + +```bibtex +@article{guan2026symphony, + title={Symphony-Coord: Emergent Coordination in Decentralized Agent Systems}, + author={Guan, Zhaoyang and Cao, Huixi and Zhong, Ming and Yang, Eric and Ai, Lynn and Ni, Yongxin and Shi, Bill}, + journal={arXiv preprint arXiv:2602.00966}, + year={2026} +} +``` + +--- + +# Acknowledgements + +We thank the open-source research community for foundational work in: + +* decentralized systems +* online bandit optimization +* multi-agent reasoning +* Chain-of-Thought coordination +* distributed inference systems + +--- + +# License + +This project is released under the MIT License. From c74816ee9f69bac979d8ccef4ba5362c0e2eeb2d Mon Sep 17 00:00:00 2001 From: MilkyCode <145179065+light12222@users.noreply.github.com> Date: Sat, 16 May 2026 20:52:32 -0400 Subject: [PATCH 2/2] Revise README for clarity and updated information Redesigned the README into a research showcase landing page with: - improved visual hierarchy - integrated demo and ecosystem links - dynamic system behavior sections - updated project narrative and branding - streamlined onboarding and quick start flow --- docs/readme-redesign-proposal.md | 479 ++++++++++++------------------- 1 file changed, 186 insertions(+), 293 deletions(-) diff --git a/docs/readme-redesign-proposal.md b/docs/readme-redesign-proposal.md index a12fef8..9d6cfa8 100644 --- a/docs/readme-redesign-proposal.md +++ b/docs/readme-redesign-proposal.md @@ -1,318 +1,285 @@ -# Symphony-Coord +

+Symphony-Coord +

-Emergent Coordination in Decentralized Agent Systems +Agents That Learn Who Should Solve What

- +Self-Organizing Multi-Agent Coordination via Adaptive Online Routing

- Adaptive Multi-Agent Routing via Online Bandit Coordination + + + + +

๐Ÿ“„ Paper ยท - โšก Quick Start - ยท - ๐Ÿ“Š Reproducibility + ๐ŸŒ Live Demo ยท - ๐ŸŽฌ Demo + ๐Ÿ’ก Ecosystem

--- -## TL;DR - -Symphony-Coord is a decentralized multi-agent coordination framework that formulates adaptive agent routing as an online multi-armed bandit problem. - -Instead of relying on fixed orchestration heuristics or centralized planners, Symphony-Coord enables agents to dynamically specialize through online interaction, adaptive routing, and reward-driven coordination. - -The framework combines: - -* task decomposition -* LinUCB-based routing -* decentralized capability matching -* parallel Chain-of-Thought execution -* voting-based aggregation +

+ + + +

-into a unified coordination pipeline for robust multi-agent reasoning. +

+Decentralized agents that dynamically learn who should solve what. +

--- -# Why Symphony-Coord? +# Overview -Modern multi-agent systems face several major limitations: +Symphony-Coord is a decentralized multi-agent coordination framework where agents dynamically learn: -* centralized orchestrators become bottlenecks at scale -* fixed routing heuristics fail under dynamic workloads -* static agent assignment prevents specialization -* coordination robustness degrades under agent failures +- who should solve what +- when to route tasks +- how to specialize through interaction -Existing orchestration pipelines often assume: +Instead of relying on fixed orchestration heuristics or centralized planners, Symphony-Coord formulates routing as an online decision-making problem under uncertainty. -* stable agent behavior -* fixed routing strategies -* homogeneous execution environments +Routing policies continuously evolve through: -However, real-world decentralized systems are inherently dynamic. +- contextual online routing +- reward-driven adaptation +- decentralized coordination +- emergent specialization -Agent quality, latency, availability, and specialization evolve continuously over time. +The framework is designed for dynamic environments where: -Symphony-Coord addresses this challenge by treating routing as an online decision-making problem under uncertainty. +- agent capability changes over time +- latency fluctuates +- nodes fail or degrade +- specialization must emerge online --- -# Main Contributions - -* **Decentralized Coordination** - - * removes dependence on centralized orchestration - * enables scalable multi-agent interaction - -* **Adaptive Routing via LinUCB** - - * formulates agent selection as an online contextual bandit problem - * continuously updates routing decisions using reward feedback - -* **Emergent Specialization** +# Why Symphony-Coord? - * agents gradually specialize through interaction rather than predefined roles +Modern multi-agent systems often rely on: -* **Robust Multi-Path Reasoning** +- centralized orchestrators +- static expert assignment +- fixed routing heuristics - * combines parallel Chain-of-Thought execution with voting aggregation +However, real-world decentralized systems are inherently dynamic. -* **Research-Grade Evaluation Pipeline** +Agent capability, latency, availability, and specialization continuously evolve during execution. - * supports simulation and real-model evaluation across multiple reasoning benchmarks +Symphony-Coord studies how robust coordination and specialization can emerge through online interaction instead of predefined orchestration rules. --- -# Main Results +# System Demo -## Efficiency and Robustness Overview +Explore adaptive routing and emergent specialization in decentralized multi-agent systems. -| Method | Accuracy โ†‘ | Cost โ†“ | Recovery Speed โ†‘ | -| ------------------ | ---------- | -------- | ---------------- | -| Static Routing | xx.x | xx.x | xx | -| Random Routing | xx.x | xx.x | xx | -| Rule-Based Routing | xx.x | xx.x | xx | -| Symphony-Coord | **xx.x** | **xx.x** | **xx** | +

+ + + +

--- -## Benchmark Highlights +# Dynamic System Behavior -### GSM8K +## Adaptive Routing Evolution -| Method | Accuracy | -| -------------- | -------- | -| Baseline | xx.x | -| Symphony-Coord | **xx.x** | +

+ +

-### BBH +Routing decisions evolve online as task streams and reward feedback continuously change. -| Method | Macro Average | -| -------------- | ------------- | -| Baseline | xx.x | -| Symphony-Coord | **xx.x** | +--- -### Robustness Recovery +## Emergent Agent Specialization -| Scenario | Recovery Tasks | -| ----------------- | -------------- | -| Agent Failure | xx | -| Agent Degradation | xx | +

+ +

-> Replace placeholder numbers with final experimental results. +Agents gradually specialize through interaction and reward feedback instead of predefined static roles. --- -# Method Overview +## Robust Failure Recovery

- +

-Symphony-Coord follows a three-stage coordination pipeline. +The system dynamically adapts after agent degradation, routing disruption, or node failure. --- -## 1. Planning Phase +# Main Results -Multiple planning agents decompose complex user queries into executable sub-tasks. +| Evaluation Setting | Improvement | +|---|---:| +| Routing Cost vs. Static Routing | โ†“ 23% | +| Recovery Speed under Agent Failure | โ†‘ 2.1ร— | +| GSM8K Accuracy vs. Routing Baseline | โ†‘ 8.4% | -The system evaluates candidate plans using contextual reward estimation. +> Evaluated across GSM8K, BBH, robustness recovery, and heterogeneous system optimization benchmarks. -### Core Components +--- -* task decomposition -* plan proposal generation -* LinUCB plan selection +# Interactive Demo ---- +Explore adaptive routing and emergent specialization in real time. -## 2. Execution Phase +

+ + + +

-Each sub-task is broadcast through decentralized beacon routing. +### Interactive Features -Agents are dynamically selected based on: +- live routing visualization +- evolving specialization dynamics +- decentralized coordination simulation +- adaptive recovery under failure +- multi-agent execution tracing -* capability matching -* historical reward feedback -* online uncertainty estimation +--- -Selected agents then execute reasoning chains in parallel. +# Core Features -### Core Components +## ๐Ÿง  Emergent Specialization -* beacon broadcasting -* capability matching -* contextual bandit routing -* parallel CoT execution +Agents dynamically specialize through online interaction and reward feedback. + +No predefined expert assignment is required. --- -## 3. Voting Phase +## โšก Adaptive Online Routing -The framework aggregates multiple reasoning paths using voting-based response fusion. +Routing decisions continuously evolve using contextual bandit optimization and online reward estimation. -This improves: +--- -* robustness -* answer consistency -* fault tolerance +## ๐ŸŒ Decentralized Coordination -### Core Components +No centralized orchestration bottleneck. -* CoT voting -* response aggregation -* confidence estimation +Agents coordinate through distributed routing and capability-aware interaction. --- -# Architecture +## ๐Ÿ”„ Robust Failure Recovery -```text -User Query - โ”‚ - โ–ผ -โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” -โ”‚ Planning Phase โ”‚ -โ”‚ - Task decomposition โ”‚ -โ”‚ - Candidate plan generation โ”‚ -โ”‚ - LinUCB plan selection โ”‚ -โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ - โ”‚ - โ–ผ -โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” -โ”‚ Execution Phase โ”‚ -โ”‚ - Beacon broadcasting โ”‚ -โ”‚ - Capability matching โ”‚ -โ”‚ - Online bandit routing โ”‚ -โ”‚ - Parallel CoT execution โ”‚ -โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ - โ”‚ - โ–ผ -โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ” -โ”‚ Voting Phase โ”‚ -โ”‚ - Multi-response aggregation โ”‚ -โ”‚ - Confidence voting โ”‚ -โ”‚ - Final answer generation โ”‚ -โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜ - โ”‚ - โ–ผ -Final Response -``` +The framework adapts under: + +- unavailable agents +- degraded performance +- latency shifts +- dynamic workloads --- -# Key Features +## ๐Ÿš€ Parallel Multi-Path Reasoning -## Decentralized Coordination +Symphony-Coord combines: -No centralized orchestration bottleneck. +- decentralized routing +- parallel Chain-of-Thought execution +- voting-based aggregation -Agents interact through distributed coordination protocols. +for robust multi-agent reasoning. --- -## Adaptive Online Routing +# System Architecture + +

+ +

-Routing policies continuously evolve through reward-driven learning. +Symphony-Coord follows a three-stage coordination pipeline. --- -## Emergent Specialization +## 1. Planning -Agents dynamically specialize according to observed task performance. +๐Ÿงฉ Task decomposition and candidate plan generation. ---- +Core components: -## Robust Multi-Agent Reasoning - -Parallel CoT execution improves reasoning robustness and fault tolerance. +- task decomposition +- plan proposal generation +- uncertainty-aware plan selection --- -## Edge-Friendly Deployment +## 2. Adaptive Routing + +๐ŸŒ Decentralized capability-aware coordination. -Supports consumer-grade GPUs and heterogeneous execution environments. +Core components: + +- contextual routing +- capability matching +- online reward adaptation +- emergent specialization --- -# Project Demo +## 3. Voting & Aggregation -

- - - -

+๐Ÿง  Robust multi-path reasoning fusion. + +Core components: + +- parallel CoT execution +- confidence estimation +- voting-based aggregation +- final answer fusion --- # Quick Start -## Clone Repository +## Installation ```bash git clone https://github.com/GradientHQ/symphony-coord.git cd symphony-coord -``` - ---- - -## Create Environment -```bash python -m venv venv source venv/bin/activate -``` ---- - -## Install Dependencies - -```bash pip install --upgrade pip pip install -r requirements.txt pip install -e . -``` +```` --- ## Configure API Key ```bash -export OPENROUTER_API_KEY="sk-or-v1-your-key" +export OPENROUTER_API_KEY="your-key" ``` --- -## Run Simple Example +## Run Example ```python from symphony import SymphonyOrchestrator @@ -320,12 +287,12 @@ from symphony import SymphonyOrchestrator orchestrator = SymphonyOrchestrator( agents=["agent1", "agent2", "agent3"], topL=3, - cot_count=3 + cot_count=3, ) result = orchestrator.run_task( task_description="Solve: What is 25 * 37?", - requirements=["math"] + requirements=["math"], ) print(result["final_answer"]) @@ -333,135 +300,56 @@ print(result["final_answer"]) --- -# Repository Structure - -```text -symphony-coord/ -โ”œโ”€โ”€ agents/ # Agent implementations -โ”œโ”€โ”€ core/ # Core routing and coordination algorithms -โ”œโ”€โ”€ experiments/ # Benchmark experiments -โ”œโ”€โ”€ infra/ # Infrastructure and protocols -โ”œโ”€โ”€ models/ # Model loading utilities -โ”œโ”€โ”€ protocol/ # Communication protocol definitions -โ”œโ”€โ”€ scripts/ # Visualization and analysis scripts -โ”œโ”€โ”€ tests/ # Unit tests -โ”œโ”€โ”€ docs/ # Documentation -โ””โ”€โ”€ symphony.py # Main orchestrator -``` - ---- - -# Experiments - -## Experiment 1 โ€” Efficiency & Cost - -Evaluates: - -* routing efficiency -* API cost reduction -* adaptive selection quality - -### Run - -```bash -python experiments/exp1/real/exp1_real_openrouter.py --n 100 -``` - ---- - -## Experiment 2 โ€” Robustness & Recovery - -Evaluates: - -* adaptation under agent failure -* recovery after degradation -* coordination stability +# Reproducing Results -### Run +Run the benchmark suite: ```bash -bash experiments/exp2/scripts/run_exp2_both.sh +bash experiments/scripts/run_all_datasets.sh ``` ---- - -## Experiment 3 โ€” System Optimization - -Evaluates: - -* latency balancing -* heterogeneous execution -* load-aware routing - -### Run +Generate paper figures: ```bash -bash experiments/exp3/run_exp3.sh +python scripts/plotting/paper_figures/plot_robustness_bars.py +python scripts/plotting/paper_figures/plot_gap_analysis.py ``` --- -# Visualization +# Ecosystem -## Suggested README Assets +Explore the Symphony-Coord ecosystem. -Recommended figures/GIFs: +### Resources -* routing visualization -* specialization heatmaps -* recovery curves -* execution timelines -* decentralized coordination diagrams +* ๐ŸŒ Interactive system demo +* ๐Ÿ’ก Research discussions +* ๐Ÿ“ˆ Routing and specialization visualization +* ๐Ÿ›  Open experiments and extensions -Suggested directory: +### Links -```text -assets/ -โ”œโ”€โ”€ teaser.png -โ”œโ”€โ”€ pipeline.png -โ”œโ”€โ”€ routing_demo.gif -โ”œโ”€โ”€ robustness_curve.png -โ””โ”€โ”€ specialization_heatmap.png -``` +* [GradientHQ](https://github.com/GradientHQ) +* [GitHub Discussions](https://github.com/GradientHQ/symphony-coord/discussions) +* [Issues](https://github.com/GradientHQ/symphony-coord/issues) --- -# Reproducing Results +# Roadmap -## Full Benchmark Evaluation - -```bash -bash experiments/scripts/run_all_datasets.sh -``` - ---- - -## Generate Paper Figures - -```bash -python scripts/plotting/paper_figures/plot_robustness_bars.py -python scripts/plotting/paper_figures/plot_gap_analysis.py -``` - ---- - -## Expected Outputs - -```text -pretrain_results/ -โ”œโ”€โ”€ accuracy_summary.csv -โ”œโ”€โ”€ progress_state.json -โ”œโ”€โ”€ selection_trace.json -โ””โ”€โ”€ routing_visualizations/ -``` +* [ ] Interactive routing visualization +* [ ] Dynamic specialization analysis +* [ ] Multi-node distributed deployment +* [ ] Real-time coordination dashboard +* [ ] Open benchmark suite +* [ ] Agent memory and long-horizon coordination --- # Documentation -Detailed documentation has been moved into the `docs/` directory. - -## Available Docs +Detailed setup and experiment guides are available in: ```text docs/ @@ -474,14 +362,19 @@ docs/ --- -# System Requirements +# Repository Structure -| Requirement | Minimum | Recommended | -| ----------- | ------------------- | ------------- | -| Python | 3.9 | 3.10 / 3.11 | -| RAM | 8 GB | 16 GB | -| GPU | Optional | RTX 3060+ | -| OS | Linux/macOS/Windows | Ubuntu 20.04+ | +```text +symphony-coord/ +โ”œโ”€โ”€ agents/ # Agent implementations +โ”œโ”€โ”€ core/ # Routing and coordination algorithms +โ”œโ”€โ”€ experiments/ # Benchmark and robustness experiments +โ”œโ”€โ”€ protocol/ # Task and beacon protocols +โ”œโ”€โ”€ scripts/ # Plotting and analysis scripts +โ”œโ”€โ”€ docs/ # Documentation +โ”œโ”€โ”€ tests/ # Test suite +โ””โ”€โ”€ symphony.py # Main orchestrator +``` --- @@ -512,4 +405,4 @@ We thank the open-source research community for foundational work in: # License -This project is released under the MIT License. +MIT License