cortex

helexa/cortex

Fork 0

Files

History

rob thijssen 302ccfb982

build-prerelease / Resolve version stamps (push) Successful in 31s

Details

CI / Format (push) Successful in 38s

Details

CI / Clippy (push) Successful in 3m28s

Details

build-prerelease / Build neuron-blackwell (push) Failing after 6m4s

Details

build-prerelease / Build neuron-ampere (push) Failing after 7m20s

Details

CI / Test (push) Successful in 7m29s

Details

CI / Build cortex SRPM (push) Has been skipped

Details

CI / Publish cortex to COPR (push) Has been skipped

Details

CI / Build neuron SRPM (push) Has been skipped

Details

CI / Publish neuron to COPR (push) Has been skipped

Details

CI / Bump version in source (push) Has been skipped

Details

build-prerelease / Build neuron-ada (push) Failing after 4m57s

Details

build-prerelease / Package helexa-neuron-ada RPM (push) Has been skipped

Details

build-prerelease / Package helexa-neuron-ampere RPM (push) Has been skipped

Details

build-prerelease / Package helexa-neuron-blackwell RPM (push) Has been skipped

Details

build-prerelease / Build cortex binary (push) Successful in 4m19s

Details

build-prerelease / Package cortex RPM (push) Successful in 1m24s

Details

build-prerelease / Publish to rpm.lair.cafe (unstable) (push) Has been skipped

Details

refactor(neuron): introduce InferenceEvent + wire projection layer

Step 1 of the OpenAI Responses API rollout. Pure refactor — no new
endpoints, no behaviour change on the wire. Lays the seam for
emitting Responses-shaped streaming events from the same harness
output as chat completions in Step 2.

- New `neuron::wire` module tree:
  - `wire::event::InferenceEvent` — format-agnostic enum
    (Start, TextDelta, ReasoningDelta, Finish) the candle harness
    now emits as its native streaming currency.
  - `wire::event::FinishReason` — typed reason that maps cleanly
    onto OpenAI `finish_reason`, OpenAI Responses `status`, and
    Anthropic `stop_reason` strings.
  - `wire::openai_chat::project_chat_stream` — async task that
    consumes an InferenceEvent receiver and produces a
    ChatCompletionChunk receiver, stamping per-request metadata
    (id, created, model_id) onto every chunk. Output matches the
    pre-refactor wire shape bit-for-bit.

- candle.rs refactored to emit InferenceEvent on its internal
  channel through all three streaming paths (CPU
  run_inference_streaming, CUDA single-GPU stream_inference_via_worker,
  CUDA TP chat_completion_tp_stream). The streaming functions lost
  their id/created/model_id parameters since wire-format metadata
  now lives in the projector.

- emit_delta + emit_delta_blocking simplified to single-purpose
  TextDelta emitters with no wire-format coupling.

- chat_completion_stream wraps the InferenceEvent receiver in
  wire_chat::project_chat_stream before returning so the
  /v1/chat/completions HTTP handler keeps consuming
  ChatCompletionChunks unchanged. External signature preserved.

Also fixes a pre-existing helexa-acp test race (three modules each
declared their own static LOCK for HOME mutation, so cross-module
parallelism flaked tests that read HOME at runtime). Consolidated
onto a single crate-wide path_util::ENV_LOCK.

122 helexa-acp tests + 44 neuron tests pass (5 new wire projection
tests). fmt + clippy --workspace -- -D warnings clean. Ran helexa-acp
suite 3x to confirm the env race is closed.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-05-29 11:30:17 +03:00

src

refactor(neuron): introduce InferenceEvent + wire projection layer

2026-05-29 11:30:17 +03:00

Cargo.toml

feat(helexa-acp): model picker + session/set_model handler

2026-05-29 09:10:16 +03:00