Some checks failed
CI / CUDA type-check (push) Failing after 18s
CI / Format (push) Successful in 32s
build-prerelease / Resolve version stamps (push) Successful in 35s
CI / Test (push) Failing after 59s
CI / Clippy (push) Successful in 2m28s
CI / Build cortex SRPM (push) Has been skipped
CI / Publish cortex to COPR (push) Has been skipped
CI / Build neuron SRPM (push) Has been skipped
CI / Publish neuron to COPR (push) Has been skipped
CI / Bump version in source (push) Has been skipped
build-prerelease / Build cortex binary (push) Successful in 4m17s
build-prerelease / Build neuron-blackwell (push) Successful in 5m32s
build-prerelease / Package cortex RPM (push) Successful in 1m21s
build-prerelease / Build neuron-ampere (push) Successful in 7m50s
build-prerelease / Build neuron-ada (push) Successful in 5m55s
build-prerelease / Package helexa-neuron-ada RPM (push) Successful in 2m55s
build-prerelease / Package helexa-neuron-ampere RPM (push) Successful in 3m2s
build-prerelease / Package helexa-neuron-blackwell RPM (push) Successful in 3m52s
build-prerelease / Publish to rpm.lair.cafe (unstable) (push) Successful in 1m4s
Stage 6b. Third provider impl, completing the wire-format trio
(openai-chat, openai-responses, anthropic-messages). Lets a
helexa-acp endpoint configured with `wire_api = "anthropic-messages"`
drive Claude models — either against Anthropic directly or via
cortex's /v1/messages translation surface.
## Encoder (CompletionRequest → Anthropic body)
- System messages flatten to the top-level `system` field
(concatenated with blank lines when there are multiple).
- User text → `{role:"user", content:"..."}`.
- User MultiPart (text + images) → `content` array with Anthropic's
distinct image shape: `{type:"image", source:{type:"base64",
media_type, data}}` — structurally different from OpenAI's
`image_url` data URI.
- Assistant text → `{role:"assistant", content:"..."}`.
- Assistant tool_calls → `content` array with optional `{type:"text"}`
block plus one `{type:"tool_use", id, name, input:<parsed json>}`
per call. The internal arguments JSON string is parsed back to a
Value before encoding (Anthropic requires the parsed form);
malformed JSON falls back to a String input so the request body
still serialises.
- Tool result → `{role:"user", content:[{type:"tool_result",
tool_use_id, content}]}` per Anthropic's convention (no separate
`tool` role).
- `max_tokens` is required by Anthropic; defaults to 8192 when the
request doesn't specify.
## Decoder (Anthropic SSE → CompletionEvent)
Named SSE events:
- `message_start` → captures input_tokens from `usage` for the
eventual UsageStats.
- `content_block_start` (type=text) → TextDelta (initial text, if any).
- `content_block_start` (type=tool_use) → ToolCallStart; if a
pre-buffered `input` is present, also emits a single
ToolCallArgsDelta.
- `content_block_start` (type=thinking, for extended-thinking
models) → ReasoningDelta.
- `content_block_delta` (text_delta) → TextDelta.
- `content_block_delta` (input_json_delta) → ToolCallArgsDelta,
correlated by block index.
- `content_block_delta` (thinking_delta) → ReasoningDelta.
- `message_delta` → Usage (final output_tokens) + Finish with
stop_reason mapped: end_turn/stop_sequence → "stop", max_tokens
→ "length", tool_use → "tool_calls".
- `message_stop` → stream terminates.
- `ping` ignored (Anthropic's keep-alive).
- `error` → yields Err and ends the stream.
## Wiring
- Authentication: `x-api-key` + `anthropic-version: 2023-06-01`
headers (not Bearer). Both ship when api_key is configured;
servers that don't care (cortex) ignore them.
- `WireApi::AnthropicMessages` in build_provider now constructs
the provider instead of erroring "reserved for future".
- `provider::mod.rs` registers the new module.
18 new unit tests: encoder (system collapse, multi-system concat,
default max_tokens, multipart with image, tool_use blocks, tool
results, malformed JSON arg fallback), decoder (text streaming,
tool_use lifecycle, max_tokens→length mapping, empty deltas, ping
events, error events, cancellation, malformed payload skip,
thinking blocks).
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
146 lines
5.2 KiB
Rust
146 lines
5.2 KiB
Rust
//! helexa-acp — Agent Client Protocol bridge for multi-endpoint LLM
|
|
//! setups (helexa, LM Studio, Ollama, OpenRouter, OpenAI, Anthropic,
|
|
//! …) with a clean per-endpoint wire-format selector.
|
|
//!
|
|
//! Speaks ACP over stdio to an editor client (Zed today). Every
|
|
//! configured endpoint produces a wire-format-specific
|
|
//! [`provider::Provider`] implementation; the agent loop in
|
|
//! [`agent::Agent`] is provider-agnostic, so adding e.g. an Anthropic
|
|
//! /v1/messages provider doesn't touch `agent.rs`.
|
|
//!
|
|
//! Config: `$XDG_CONFIG_HOME/helexa-acp/config.toml` for the multi-
|
|
//! endpoint case; env vars (`HELEXA_ACP_BASE_URL`, etc.) for the
|
|
//! single-endpoint case when no config file exists.
|
|
|
|
use agent_client_protocol::{Result, Stdio};
|
|
use std::sync::Arc;
|
|
|
|
mod agent;
|
|
mod compaction;
|
|
mod config;
|
|
mod path_util;
|
|
mod prompt;
|
|
mod provider;
|
|
mod qwen3;
|
|
mod session;
|
|
mod store;
|
|
mod tool_runner;
|
|
mod tools;
|
|
|
|
use agent::Agent;
|
|
use config::{Config, EndpointConfig, WireApi};
|
|
use provider::{
|
|
Provider, anthropic_messages::AnthropicMessagesProvider, openai_chat::OpenAIChatProvider,
|
|
openai_responses::OpenAIResponsesProvider,
|
|
};
|
|
|
|
/// Set up tracing. Logs go to stderr by default — stdout is
|
|
/// reserved for the JSON-RPC stream. Setting `HELEXA_ACP_LOG_FILE`
|
|
/// to an absolute path appends logs to that file instead, which is
|
|
/// the practical way to capture debug output when the agent runs
|
|
/// under an editor (Zed, etc.) that doesn't surface stderr.
|
|
///
|
|
/// `RUST_LOG` still controls levels (e.g. `helexa_acp=debug`).
|
|
/// ANSI colours are auto-stripped when writing to a file so the log
|
|
/// is plain text.
|
|
fn init_tracing() {
|
|
let env_filter = tracing_subscriber::EnvFilter::try_from_default_env()
|
|
.unwrap_or_else(|_| tracing_subscriber::EnvFilter::new("info"));
|
|
|
|
let log_file = std::env::var("HELEXA_ACP_LOG_FILE")
|
|
.ok()
|
|
.filter(|s| !s.is_empty());
|
|
|
|
match log_file {
|
|
Some(path) => match std::fs::OpenOptions::new()
|
|
.create(true)
|
|
.append(true)
|
|
.open(&path)
|
|
{
|
|
Ok(file) => {
|
|
tracing_subscriber::fmt()
|
|
.with_writer(std::sync::Mutex::new(file))
|
|
.with_env_filter(env_filter)
|
|
.with_ansi(false)
|
|
.init();
|
|
}
|
|
Err(e) => {
|
|
// Fall back to stderr and shout. We don't want a
|
|
// typo'd log path to silence the agent entirely.
|
|
tracing_subscriber::fmt()
|
|
.with_writer(std::io::stderr)
|
|
.with_env_filter(env_filter)
|
|
.init();
|
|
tracing::warn!(
|
|
path = %path,
|
|
error = %e,
|
|
"HELEXA_ACP_LOG_FILE could not be opened; using stderr"
|
|
);
|
|
}
|
|
},
|
|
None => {
|
|
tracing_subscriber::fmt()
|
|
.with_writer(std::io::stderr)
|
|
.with_env_filter(env_filter)
|
|
.init();
|
|
}
|
|
}
|
|
}
|
|
|
|
/// Build a provider for `endpoint` according to its declared
|
|
/// `wire_api`. Future wire types (OpenAI Responses, Anthropic
|
|
/// /v1/messages, Ollama native) slot in here without changing the
|
|
/// caller.
|
|
fn build_provider(endpoint: EndpointConfig) -> anyhow::Result<Arc<dyn Provider>> {
|
|
match endpoint.wire_api {
|
|
WireApi::OpenAiChat => Ok(Arc::new(OpenAIChatProvider::new(endpoint)?)),
|
|
WireApi::OpenAiResponses => Ok(Arc::new(OpenAIResponsesProvider::new(endpoint)?)),
|
|
WireApi::AnthropicMessages => Ok(Arc::new(AnthropicMessagesProvider::new(endpoint)?)),
|
|
}
|
|
}
|
|
|
|
#[tokio::main]
|
|
async fn main() -> Result<()> {
|
|
init_tracing();
|
|
|
|
let cfg = Config::load()
|
|
.map_err(|e| agent_client_protocol::util::internal_error(format!("config: {e:#}")))?;
|
|
tracing::info!(
|
|
endpoints = cfg.endpoints.len(),
|
|
default_endpoint = %cfg.default_endpoint().name,
|
|
default_model = ?cfg.default_endpoint().default_model,
|
|
"helexa-acp starting"
|
|
);
|
|
|
|
// Build a provider for each configured endpoint up-front. Cheap —
|
|
// just sets up a reqwest::Client and resolves the API key — and
|
|
// surfaces config mistakes (missing API key env var, unsupported
|
|
// wire_api) before the editor even sends an initialize request.
|
|
let mut providers: Vec<Arc<dyn Provider>> = Vec::with_capacity(cfg.endpoints.len());
|
|
for endpoint in &cfg.endpoints {
|
|
match build_provider(endpoint.clone()) {
|
|
Ok(p) => {
|
|
tracing::info!(
|
|
endpoint = %endpoint.name,
|
|
base_url = %endpoint.base_url,
|
|
wire_api = ?endpoint.wire_api,
|
|
"registered provider"
|
|
);
|
|
providers.push(p);
|
|
}
|
|
Err(e) => {
|
|
tracing::warn!(
|
|
endpoint = %endpoint.name,
|
|
error = %format!("{e:#}"),
|
|
"skipping endpoint with invalid config"
|
|
);
|
|
}
|
|
}
|
|
}
|
|
|
|
let agent = Agent::new(&cfg, providers)
|
|
.await
|
|
.map_err(|e| agent_client_protocol::util::internal_error(format!("agent: {e:#}")))?;
|
|
agent.serve(Stdio::new()).await
|
|
}
|