cortex

helexa/cortex

Fork 0

Files

History

rob thijssen 9b8bd146f6

CI / Format (push) Successful in 36s

Details

build-prerelease / Resolve version stamps (push) Successful in 38s

Details

CI / Clippy (push) Successful in 2m19s

Details

CI / Test (push) Successful in 4m32s

Details

CI / Build cortex SRPM (push) Has been skipped

Details

CI / Publish cortex to COPR (push) Has been skipped

Details

build-prerelease / Build neuron-blackwell (push) Successful in 3m43s

Details

CI / Build neuron SRPM (push) Has been skipped

Details

CI / Publish neuron to COPR (push) Has been skipped

Details

CI / Bump version in source (push) Has been skipped

Details

build-prerelease / Build cortex binary (push) Successful in 4m16s

Details

build-prerelease / Package cortex RPM (push) Successful in 1m23s

Details

build-prerelease / Build neuron-ampere (push) Successful in 4m56s

Details

build-prerelease / Build neuron-ada (push) Successful in 5m1s

Details

build-prerelease / Package helexa-neuron-ampere RPM (push) Successful in 2m51s

Details

build-prerelease / Package helexa-neuron-ada RPM (push) Successful in 3m0s

Details

build-prerelease / Package helexa-neuron-blackwell RPM (push) Successful in 3m39s

Details

build-prerelease / Publish to rpm.lair.cafe (unstable) (push) Successful in 59s

Details

feat(tp): --tp-smoke CLI subcommand + remote validation script

Adds a one-shot diagnostic that exercises the lower half of the TP
stack — WorkerPool::spawn, init_nccl, nccl_sanity_check — in isolation
from model load and inference. Runs N-1 worker subprocesses (rank 0
stays in this process), joins them in an NCCL communicator on the
specified CUDA devices, all_reduces a sentinel 1u32 per rank, verifies
the observed_sum equals world_size on every rank, then shuts down.

Output is `status=ok` on stdout (plus key=value lines for tp_size and
cuda_devices) when every check passes, non-zero exit + tracing on
stderr otherwise. The smoke command is diagnostic-only and not exposed
through the daemon HTTP API.

script/tp-smoke.sh wraps it with an ssh invocation against a fleet
host (default beast — the only host with 2 GPUs) and asserts the
status line, mirroring the validate-neuron.sh ergonomics.

This is step 1 of the TP test plan. A failure here means TP cannot
work on the host at all; step 2 (Stage 7b-iv) wires real model load
and inference through the same WorkerPool primitives.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-19 19:40:25 +03:00

cortex-cli

feat(neuron): OpenAI-compatible non-streaming chat completion

2026-05-18 16:47:58 +03:00

cortex-core

refactor(neuron): cut mistralrs/llamacpp, scaffold candle harness

2026-05-18 15:53:04 +03:00

cortex-gateway

feat(neuron): OpenAI-compatible non-streaming chat completion

2026-05-18 16:47:58 +03:00

neuron

feat(tp): --tp-smoke CLI subcommand + remote validation script

2026-05-19 19:40:25 +03:00