TELCODOCS-2134 updating AI pattern #507

kquinn1204 · 2024-12-05T15:33:18Z

[TELCODOCS-2134]: Implement updates based on audit to LLM and RAG generation pattern

Issue:https://issues.redhat.com/browse/TELCODOCS-2134

Link to docs preview: http://507.docs-pr.validatedpatterns.io/

mbaldessari · 2024-12-05T15:33:31Z

This is an automated message:

You can preview this docs PR at http://507.docs-pr.validatedpatterns.io
Note that they get generated every five minutes, so please wait a bit.

kquinn1204 · 2025-01-10T08:38:22Z

@hbisht-RH-ccs would appreciate your review of this PR

hbisht-RH-ccs

@kquinn1204 , I've just added a few comments, rest LGTM. Great work. Thanks!

hbisht-RH-ccs · 2025-01-15T11:37:41Z

content/patterns/rag-llm-gitops/GPU_provisioning.md


-Use the instructions to add nodes with GPU in OpenShift cluster running in AWS cloud. Nodes with GPU will be tainted to allow only pods that required GPU to be scheduled to these nodes
+By default the GPU nodes deployed are of instance type `g5.2xlarge`. If for some reason you want to change this maybe due to performance issues carry out the following steps: 


"The GPU nodes deployed are of instance type" is in the passive voice.
"maybe due to performance issues" is informal and could be more precise.
We can rewrite
By default, GPU nodes use the instance type g5.2xlarge. If you need to change the instance type—such as to address performance requirements, follow these steps:

hbisht-RH-ccs · 2025-01-15T11:45:31Z

content/patterns/rag-llm-gitops/_index.md

- Red Hat Openshift cluster running in AWS. Supported regions are us-west-2 and us-east-1.
- GPU Node to run Hugging Face Text Generation Inference server on Red Hat OpenShift cluster.
- Create a fork of the [rag-llm-gitops](https://github.com/validatedpatterns/rag-llm-gitops.git) git repository.
-
 ## Demo Description & Architecture

 The goal of this demo is to demonstrate a Chatbot LLM application augmented with data from Red Hat product documentation running on [Red Hat OpenShift AI](https://www.redhat.com/en/technologies/cloud-computing/openshift/openshift-ai). It deploys an LLM application that connects to multiple LLM providers such as OpenAI, Hugging Face, and NVIDIA NIM.


IMO we should use some other word here instead of demonstrates such as showcase..

hbisht-RH-ccs · 2025-01-15T11:52:08Z

content/patterns/rag-llm-gitops/getting-started.md

-## Deploying the demo
+## Prerequisites
+
+- Podman


Instead of just Podman may be we can write
Podman is installed on your system.

TELCODOCS-2134 updating AI pattern

7ce009c

openshift-ci bot added the size/L label Dec 5, 2024

kquinn1204 added 6 commits December 5, 2024 15:39

TELCODOCS-2134 updating AI pattern 2

8c15057

Adding further clarification

f336c80

adding corrections

cf0b225

reformatting section

9e00316

splitting out content

6193652

splitting out content

efff8b3

openshift-ci bot added size/XL and removed size/L labels Dec 11, 2024

reformatting section 3

3a04d89

openshift-ci bot added size/L and removed size/XL labels Dec 11, 2024

reformatting section 4

e3d9bf5

openshift-ci bot added size/XL and removed size/L labels Dec 11, 2024

kquinn1204 added 12 commits December 12, 2024 10:08

aliging various things

1d74e5d

aliging various things 2

329ff80

moving image

62ed02b

removing hackded key

b2442da

adding links

1f08bda

adding links 2

89fadc1

addig image 2

40e674f

updating the text

aa5e2b7

updating adding new provider

718ee7e

updating images

70c1020

correcting links etc.

8163d43

adding images

c944280

hbisht-RH-ccs reviewed Jan 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TELCODOCS-2134 updating AI pattern #507

TELCODOCS-2134 updating AI pattern #507

kquinn1204 commented Dec 5, 2024 •

edited

Loading

mbaldessari commented Dec 5, 2024

kquinn1204 commented Jan 10, 2025

hbisht-RH-ccs left a comment

hbisht-RH-ccs Jan 15, 2025

hbisht-RH-ccs Jan 15, 2025

hbisht-RH-ccs Jan 15, 2025


		Use the instructions to add nodes with GPU in OpenShift cluster running in AWS cloud. Nodes with GPU will be tainted to allow only pods that required GPU to be scheduled to these nodes
		By default the GPU nodes deployed are of instance type `g5.2xlarge`. If for some reason you want to change this maybe due to performance issues carry out the following steps:

TELCODOCS-2134 updating AI pattern #507

Are you sure you want to change the base?

TELCODOCS-2134 updating AI pattern #507

Conversation

kquinn1204 commented Dec 5, 2024 • edited Loading

mbaldessari commented Dec 5, 2024

kquinn1204 commented Jan 10, 2025

hbisht-RH-ccs left a comment

Choose a reason for hiding this comment

hbisht-RH-ccs Jan 15, 2025

Choose a reason for hiding this comment

hbisht-RH-ccs Jan 15, 2025

Choose a reason for hiding this comment

hbisht-RH-ccs Jan 15, 2025

Choose a reason for hiding this comment

kquinn1204 commented Dec 5, 2024 •

edited

Loading