This website requires JavaScript.
Explore
Help
Register
Sign In
helexa
/
cortex
Watch
1
Star
0
Fork
0
You've already forked cortex
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
All Workflows
build-prerelease.yml
ci.yml
Actor
All actors
grenade
Status
All status
Success
Failure
Waiting
Running
fix(gateway): full observability + stop leaking upstream bodies
#160
:
Commit
aa88d37509
pushed by
grenade
main
2026-05-22 04:37:03 +00:00
19m27s
View workflow file
fix(router,handlers): strip trailing slash from rewritten URL + log upstream failures
#158
:
Commit
0f00f72b47
pushed by
grenade
main
2026-05-22 04:17:29 +00:00
6m38s
View workflow file
fix(router): rewrite loopback inference URLs to use neuron's host
#156
:
Commit
9b0ed0b57f
pushed by
grenade
main
2026-05-22 03:43:50 +00:00
19m49s
View workflow file
fix(rpm): migrate legacy helexa-cortex firewalld service to `cortex`
#154
:
Commit
dc2a803266
pushed by
grenade
main
2026-05-22 03:23:52 +00:00
10m46s
View workflow file
feat(stage-8e-3): quantize lm_head in TP Qwen3-Next
#152
:
Commit
e71181499e
pushed by
grenade
main
2026-05-21 19:14:09 +00:00
20m42s
View workflow file
fix(stage-8e-2e): bump quant prefill threshold to M > 64
#150
:
Commit
ee663e5e99
pushed by
grenade
main
2026-05-21 18:53:19 +00:00
2m23s
View workflow file
feat(stage-8e-2d): route quantized matmul by M (prefill vs decode)
#148
:
Commit
34f9b77d9d
pushed by
grenade
main
2026-05-21 18:36:26 +00:00
20m40s
View workflow file
fix(stage-8e-2c): cast bf16/f16 activations to f32 around QMatMul
#146
:
Commit
f084aaab8e
pushed by
grenade
main
2026-05-21 17:25:24 +00:00
19m56s
View workflow file
fix(stage-8e-2b): allow quant on the TP load path
#144
:
Commit
68a606a79c
pushed by
grenade
main
2026-05-21 16:46:09 +00:00
28m46s
View workflow file
feat(stage-8e-2): plumb quant config from ModelSpec to TP load path
#142
:
Commit
4aa71902d0
pushed by
grenade
main
2026-05-21 15:44:17 +00:00
40m32s
View workflow file
feat(stage-8e-1): MaybeQuantLinear primitive + parallel-linear quant variants
#140
:
Commit
bef159b21c
pushed by
grenade
main
2026-05-21 15:03:41 +00:00
8m10s
View workflow file
feat(stage-8d-7): direct safetensors fused-region loader
#138
:
Commit
8d7b099b36
pushed by
grenade
main
2026-05-21 14:55:30 +00:00
5m41s
View workflow file
diag(stage-8d-6): per-layer VRAM logging in TP load path
#136
:
Commit
89d98d1fb2
pushed by
grenade
main
2026-05-21 10:14:05 +00:00
19m51s
View workflow file
feat(stage-8d-5b): wire fused_gdn_gating CUDA kernel
#134
:
Commit
cc95fe28d9
pushed by
grenade
main
2026-05-21 09:14:29 +00:00
21m43s
View workflow file
feat(stage-8d-4): dispatch chunked_gated_delta_rule_recurrence at prefill
#132
:
Commit
09c945f81e
pushed by
grenade
main
2026-05-21 08:52:42 +00:00
1m43s
View workflow file
feat(stage-8d-3): wire causal_conv1d_update/full CUDA kernels
#130
:
Commit
05dc0bad18
pushed by
grenade
main
2026-05-21 08:50:35 +00:00
47s
View workflow file
feat(stage-8d-5): wire gated_delta_rule_recurrence kernel into tp_qwen3_5
#128
:
Commit
10c151efa5
pushed by
grenade
main
2026-05-21 08:49:46 +00:00
5m21s
View workflow file
feat(stage-8d-2): wire gated_delta_rule_recurrence kernel into qwen3_5
#126
:
Commit
44ae927e38
pushed by
grenade
main
2026-05-21 08:44:18 +00:00
4m41s
View workflow file
feat(stage-8d-1): import mistralrs GDN CUDA kernels — build infra only
#124
:
Commit
1ebbe87651
pushed by
grenade
main
2026-05-21 08:39:35 +00:00
4m46s
View workflow file
feat(tp): cancellation-safe inference + structured tracing
#122
:
Commit
70eb6af42b
pushed by
grenade
main
2026-05-21 05:41:50 +00:00
19m41s
View workflow file
fix(tp): always drain worker responses on leader failure
#120
:
Commit
d1a4aad91d
pushed by
grenade
main
2026-05-21 04:58:46 +00:00
18m53s
View workflow file
feat(stage-8c): TP-aware Qwen3-Next (tp_qwen3_5)
#118
:
Commit
95dc8745eb
pushed by
grenade
main
2026-05-20 19:24:58 +00:00
22m6s
View workflow file
fix(qwen3_5): promote beta to F32 alongside q/k/v in delta rule
#116
:
Commit
495d3f7c05
pushed by
grenade
main
2026-05-20 18:39:29 +00:00
25m57s
View workflow file
fix(qwen3_5): tensor names are under `model.language_model.*`, not `model.*`
#114
:
Commit
5c4c8e0eba
pushed by
grenade
main
2026-05-20 14:12:06 +00:00
23m45s
View workflow file
fix(qwen3_5): tensor names are under `model.language_model.*`, not `model.*`
#112
:
Commit
a77f19686e
pushed by
grenade
main
2026-05-20 13:48:20 +00:00
17s
View workflow file
fix(qwen3_5): nested rope_parameters + partial_rotary_factor=0.25
#110
:
Commit
07c44d5db1
pushed by
grenade
main
2026-05-20 13:39:23 +00:00
20m21s
View workflow file
feat(stage-8c): full-attention layer + decoder + Model + ForCausalLM for qwen3_5
#108
:
Commit
e7eb3dab6a
pushed by
grenade
main
2026-05-20 13:13:12 +00:00
20m25s
View workflow file
feat(stage-8c): linear-attention layer (Qwen3-Next GatedDeltaNet)
#106
:
Commit
180274548d
pushed by
grenade
main
2026-05-20 06:49:42 +00:00
19m40s
View workflow file
feat(stage-8c): scaffold qwen3_5 (Qwen3.6) — dispatch + stubs + TP gate
#104
:
Commit
a70f317729
pushed by
grenade
main
2026-05-20 06:19:33 +00:00
21m22s
View workflow file
feat(stage-8b): Llama + Qwen3 MoE families on the candle harness
#102
:
Commit
c6022aa6b9
pushed by
grenade
main
2026-05-20 05:55:49 +00:00
19m18s
View workflow file
First
Previous
1
2
Next
Last