-
Notifications
You must be signed in to change notification settings - Fork 64
Pull requests: hw-native-sys/simpler
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: strace.h must include profiling_config.h (STRACE silently no-op'd in c_api_shared)
#1268
opened Jul 3, 2026 by
ChaoWao
Collaborator
Loading…
3 of 4 tasks
update(distributed): convert all_to_all distributed to push-based lowering
#1267
opened Jul 3, 2026 by
georgebisbas
Contributor
Loading…
4 of 5 tasks
pto-isa version of spmd_paged_attention_highperf
#1266
opened Jul 3, 2026 by
MirkoDeVita98
Contributor
•
Draft
Normalize TPUSH/TPOP usage and add interim onboard lane-offset bridge
#1265
opened Jul 3, 2026 by
yanghaoran29
Contributor
Loading…
Refactor: move fanout wiring to orchestrator, drop wiring queue
#1264
opened Jul 3, 2026 by
ChaoWao
Collaborator
Loading…
3 of 4 tasks
optimize: prewire task dependencies on orchestrator side
#1263
opened Jul 3, 2026 by
Crane-Liu
Loading…
Refactor dep gen and l2 swimlane collectors to common
#1262
opened Jul 3, 2026 by
vegetabledoww
Contributor
Loading…
perf: prefetch dispatch buffers and inline timestamp reads
#1243
opened Jul 1, 2026 by
TaoZQY
Contributor
Loading…
[WIP] feat(kernels): thread runtime lane id into TPipe via setSubBlockId
#1241
opened Jul 1, 2026 by
yanghaoran29
Contributor
Loading…
Add: L2 input window support to L3-L2 message queue
#1236
opened Jul 1, 2026 by
ccyywwen
Contributor
Loading…
perf(runtime): overlap AICore handshake wakeups; batch the release barrier
#1214
opened Jun 30, 2026 by
ChaoWao
Collaborator
Loading…
2 of 3 tasks
feat(runtime): L3 post-fork host-buffer registration
#1190
opened Jun 29, 2026 by
doraemonmj
Contributor
Loading…
Add: run latency optimization assessment
#1186
opened Jun 29, 2026 by
puddingfjz
Contributor
Loading…
Add: host_build_graph runtime (host-orchestration variant of tensormap)
#1185
opened Jun 29, 2026 by
ChaoWao
Collaborator
Loading…
4 of 5 tasks
Add: SDMA workspace overlay + async completion demo on a5 onboard
#1179
opened Jun 27, 2026 by
jvjhfhg
Collaborator
Loading…
Integrate TraCR as the runtime/kernel profiler for Simpler
#1173
opened Jun 26, 2026 by
noabauma
Contributor
Loading…
Qwen3 test shapes for spmd paged attention highperf
#1172
opened Jun 26, 2026 by
MirkoDeVita98
Contributor
Loading…
test(simt): unify scatter kernel on templated MSCATTER, drop __CPU_SIM fork
#1160
opened Jun 25, 2026 by
ChaoZheng109
Collaborator
Loading…
Add: fully_distributed_within_core runtime — SPMD on-core orchestration
#1142
opened Jun 24, 2026 by
hengliao1972
Loading…
[Optimization] Replace wiring with polling-based task readiness test (~17% median device speedup)
#1137
opened Jun 24, 2026 by
SergioMartin86
Loading…
6 of 7 tasks
docs: architecture review + Rust-suitability analysis with SVG diagrams
#1136
opened Jun 24, 2026 by
yijunyu
Loading…
Add async chip callable register/run overlap
#1090
opened Jun 18, 2026 by
puddingfjz
Contributor
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.