LLMs in Action: A Cloud Native Story, running on microshift

Forked from https://github.com/cncf/llm-in-action

Using Ollama UBI as internal k8s service

https://github.com/williamcaban/ollama-ubi

with Ollama UBI pod, pull the required model

This command will download the mistral model for ollama to use

~~ $ ollama pull mistral ~~

$ oc get pods -A |egrep "keynote|ollama"
ollama                     ollama-serve-6b77c4df5-rq8k8               1/1     Running                    0               3h6m
test                       keynote-66c595b94b-6z6zn                   1/1     Running                    2               2d12h
$ microshift version
MicroShift Version: 4.14.18
Base OCP Version: 4.14.18
$ oc get nodes
NAME         STATUS   ROLES                         AGE     VERSION
node-nvidia   Ready    control-plane,master,worker   3d17h   v1.27.11
$ oc version
Client Version: 4.14.0-202401111553.p0.g286cfa5.assembly.stream-286cfa5
Kustomize Version: v5.0.1
Kubernetes Version: v1.27.11

$ oc get routes -A
NAMESPACE   NAME           HOST                            ADMITTED   SERVICE      TLS
chat        chat-route     chat.apps.example.com           True       chat-svc     
ollama      ollama-route   ollama.apps.example.com         True       ollama-svc   
test        keynote        keynote-test.apps.example.com   True       keynote      

$ oc get pods
NAME                           READY   STATUS    RESTARTS   AGE
ollama-serve-6b77c4df5-rq8k8   1/1     Running   0          3h9m
$ oc rsh ollama-serve-6b77c4df5-rq8k8 
sh-5.1$ ollama list
NAME              	ID          	SIZE  	MODIFIED   
falcon:7b-instruct	4280f7257e73	4.2 GB	2 days ago	
llava:latest      	8dd30f6b0cb1	4.7 GB	2 days ago	
mistral:latest    	61e88e884507	4.1 GB	2 days ago	
sh-5.1$

Chat front end

https://github.com/arthur-r-oliveira/chat_application_k8s

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
img		img
kubernetes		kubernetes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cluster.yaml		cluster.yaml
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt
shutdown.sh		shutdown.sh
startup.sh		startup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMs in Action: A Cloud Native Story, running on microshift

Forked from https://github.com/cncf/llm-in-action

Using Ollama UBI as internal k8s service

with Ollama UBI pod, pull the required model

This command will download the mistral model for ollama to use

Chat front end

About

Releases

Packages

Languages

License

arthur-r-oliveira/llm-in-action_k8s

Folders and files

Latest commit

History

Repository files navigation

LLMs in Action: A Cloud Native Story, running on microshift

Forked from https://github.com/cncf/llm-in-action

Using Ollama UBI as internal k8s service

with Ollama UBI pod, pull the required model

This command will download the mistral model for ollama to use

Chat front end

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages