Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Disk-level delta weight sync
#2089 opened Jun 16, 2026 by nanjiangwill Collaborator Loading…
Add rollout_data_transport nixl run-ci-changed
#2088 opened Jun 16, 2026 by zhuzilin Contributor Loading…
fix(opd): score teacher logprobs at rollout temperature, not 0
#2085 opened Jun 15, 2026 by EazyReal Contributor Loading…
feat(rl): add REINFORCE advantage estimator
#2083 opened Jun 15, 2026 by EazyReal Contributor Loading…
feat(coding_agent_rl): add SWE-bench harness evaluation path
#2079 opened Jun 15, 2026 by aoshen02 Contributor Loading…
3 tasks
fix(rollout): isolate per-trajectory exceptions in generate_and_rm_group
#2078 opened Jun 15, 2026 by aoshen02 Contributor Loading…
fix(script): correct GLM-4.7 expert_model_parallel_size for single-node 8 GPU
#2077 opened Jun 15, 2026 by aoshen02 Contributor Loading…
1 task
Support Qwen3.5-VL (dense + MoE) via Megatron-Bridge
#2075 opened Jun 14, 2026 by demouo Contributor Loading…
feat(rollouts) external rollouts endpoint with publish-only weight sync
#2071 opened Jun 12, 2026 by jvmncs Loading…
4 tasks done
fix(sglang): authenticate engine control-plane and router calls
#2068 opened Jun 12, 2026 by EazyReal Contributor Loading…
fix(metrics): make compute_pass_rate ragged-safe for over-sampled batches
#2064 opened Jun 12, 2026 by EazyReal Contributor Loading…
fix(rollout): apply rollout sample filter in the rollout manager
#2061 opened Jun 12, 2026 by EazyReal Contributor Loading…
[DON'T MERGE] run CI run-ci-megatron
#2053 opened Jun 11, 2026 by zhuzilin Contributor Loading…
support --num-workers for dataset parallel loading
#2048 opened Jun 10, 2026 by demouo Contributor Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.