News

Before the PR to add extend CUDA graph support #6606 server was able to be started and sglang is able to load draft cuda graphs. With the addition of the extend cuda graphs, the draft is no longer ...