llm-serving

首页

llm-serving

列表

默认

浏览次数

发布日期

vllm

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

开源项目 2周前 0 点赞 0 评论 96 浏览

ray

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

开源项目 2周前 0 点赞 0 评论 125 浏览