Blog posts tagged
"Model Serving"

Rob Gibbon
27 April 2026

Understanding disaggregated GenAI model serving with llm-d

AI Article

What is llm-d? llm-d is an open source solution for managing high-scale, high-performance Large Language Model (LLM) deployments. LLMs are at the heart of generative AI – so when you chat with ChatGPT or Gemini, you’re talking to an LLM. Simple LLM deployments – where an LLM is deployed to a single server – can ...

Rui Vasconcelos
23 April 2021

What is KFServing?

AI Article

TL;DR: KFServing is a novel cloud-native multi-framework model serving tool for serverless inference. A bit of history KFServing was born as part of the Kubeflow project, a joint effort between AI/ML industry leaders to standardize machine learning operations on top of Kubernetes. It aims at solving the difficulties of model deployment to ...

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Quick links

Categories

Industries

Partner programs

Quick links

Roles by department

Working here

Explore Canonical

Latest updates

Company highlights ›

Blog posts tagged
"Model Serving"

Understanding disaggregated GenAI model serving with llm-d

What is KFServing?

Blog posts tagged "Model Serving"

Understanding disaggregated GenAI model serving with llm-d

What is KFServing?

Blog posts tagged
"Model Serving"