Skip to content

Instantly share code, notes, and snippets.

@bartoszmajsak
Created November 27, 2025 13:00
Show Gist options
  • Select an option

  • Save bartoszmajsak/d18de078af0c7790ee045a2f130dd911 to your computer and use it in GitHub Desktop.

Select an option

Save bartoszmajsak/d18de078af0c7790ee045a2f130dd911 to your computer and use it in GitHub Desktop.
apiVersion: kustomize.config.k8s.io/v1beta1
kind: Kustomization
metadata:
name: facebook-opt-125m-simulated
namespace: llm
namePrefix: facebook-opt-125m-
resources:
- model.yaml
apiVersion: serving.kserve.io/v1alpha1
kind: LLMInferenceService
metadata:
name: simulated-non-maas
spec:
model:
uri: hf://facebook/opt-125m
name: facebook/opt-125m
replicas: 1
router:
route: {}
template:
containers:
- name: main
image: "ghcr.io/llm-d/llm-d-inference-sim:v0.5.1"
imagePullPolicy: Always
command: ["/app/llm-d-inference-sim"]
args:
- --port
- "8000"
- --model
- facebook/opt-125m
- --mode
- random
- --ssl-certfile
- /var/run/kserve/tls/tls.crt
- --ssl-keyfile
- /var/run/kserve/tls/tls.key
env:
- name: POD_NAME
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: metadata.name
- name: POD_NAMESPACE
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: metadata.namespace
ports:
- name: https
containerPort: 8000
protocol: TCP
livenessProbe:
httpGet:
path: /health
port: https
scheme: HTTPS
readinessProbe:
httpGet:
path: /ready
port: https
scheme: HTTPS
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment