Title

Run a simple RAG enabled chatbot in OKE using NVIDIA NIM, Qdrant and Gradio

About Or Title

About This Workshop

Desc Long

Running AI workloads in Oracle Kubernetes Engine (OKE) is easier than you think. Oracle Cloud Infrastructure (OCI) provides a wide variety of GPU-enabled nodes, both virtual and bare metal, tailored to run as worker nodes in your Kubernetes cluster. These come pre-installed with NVIDIA drivers and device plugin daemon-set, simplifying setup.
With OCI, you can harness the power of Large Language Models (LLMs) securely within your own tenancy, enabling them to provide answers based on your enterprise's data. By implementing Retrieval Augmented Generation (RAG) pipelines, you can enhance LLM capabilities, allowing them to provide accurate answers on data they weren't originally trained on. This approach not only improves accuracy but also reduces costs associated with model training and fine-tuning.

Workshop Length

3 hours

Outline

Lab 1 - Provision of resources to run JupyterHub Notebook
Lab 2 - Run JupyterHub Notebook to chat with LLM
Lab 3 - Retrieval Augmented Generation (RAG) Application

Prerequisites

- Administrative access to an OCI tenancy.
- Ability to spin-up A10 instances in OCI.
- Ability to create resources with Public IP addresses (Load Balancer, Instances, OKE API Endpoint).
- Access to HuggingFace.
- Accept selected HuggingFace model license agreement.

Other Workshops you might like

About This Workshop

Youtube Video

Workshop Info

Other Workshops you might like

Footer with links

Links_col_1

Resources

Links_col_2

Partners

Links_col_3

Solutions

Links_col_4

What's New

Links_col_5

Contact Us