-
Notifications
You must be signed in to change notification settings - Fork 731
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Modify the kernel test path & add it to the CI process.
#3044
opened Jan 22, 2025 by
sleepcoo
Loading…
[Doc]Update doc of profiling with PyTorch Profiler
#3038
opened Jan 22, 2025 by
Fridge003
Loading…
2 of 4 tasks
Allow local cutlass directory to be used in sgl-kernel build
#3037
opened Jan 22, 2025 by
trevor-m
Loading…
4 tasks
More efficient minmax-text-01 lightning_attention_decode with cuda
#3030
opened Jan 21, 2025 by
BBuf
Loading…
Refactor recursive helper methods to iterative approach to prevent s…
#3029
opened Jan 21, 2025 by
luzengxiangcn
•
Draft
4 tasks
[Fix] Address remain issues of supporting MiniCPMV
#2977
opened Jan 19, 2025 by
mickqian
Loading…
3 tasks done
[MOE] try to optimize cu kernel single block execution - distribute cumsum workload from thread 0 to other threads
#2970
opened Jan 19, 2025 by
yiakwy-xpu-ml-framework-team
Loading…
3 of 4 tasks
Support distributed tensor when updating weights
#2831
opened Jan 10, 2025 by
fzyzcjy
Loading…
3 tasks done
Support custom device mesh for tensor parallel workers
#2827
opened Jan 10, 2025 by
fzyzcjy
Loading…
3 tasks done
Use CUDA_VISIBLE_DEVICES instead of gpu_id variables everywhere.
#2824
opened Jan 10, 2025 by
heiner
Loading…
1 task done
Improve the mixed chunk prefill by lanuch two kernels
#2811
opened Jan 9, 2025 by
libratiger
•
Draft
1 of 3 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.