1 items across the graph — tagged with Fastertransformer.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.