Skip to main content

Blog posts tagged
"Model Serving"


Rob Gibbon
27 April 2026

Understanding disaggregated GenAI model serving with llm-d

AI Article

What is llm-d? llm-d is an open source solution for managing high-scale, high-performance Large Language Model (LLM) deployments. LLMs are at the heart of generative AI – so when you chat with ChatGPT or Gemini, you’re talking to an LLM. Simple LLM deployments – where an LLM is deployed to a single server – can ...


Rui Vasconcelos
23 April 2021

What is KFServing?

AI Article

TL;DR: KFServing is a novel cloud-native multi-framework model serving tool for serverless inference. A bit of history KFServing was born as part of the Kubeflow project, a joint effort between AI/ML industry leaders to standardize machine learning operations on top of Kubernetes. It aims at solving the difficulties of model deployment to ...