1. Building LLMs like ChatGPT from scratch and Cloud Deployment

    • Buy now
    • Learn more
    • Discussions
  2. Introduction

    • Course Introduction
    • What you'll learn
    • Colab Notebooks
  3. Pre-requisites

    • RNNs and Attention Models
    • How the transformer works
    • Difference in training and inference
  4. Building Mistral from scratch

    • Global Architecture of Mistral
    • Tokenization
    • Rotary Positional Encoding (RoPE)
    • RoPE Practice
    • Group Query Attention
    • Sliding Window Attention
    • Kv-caching
    • Transformer Block
    • Full Transformer Model
  5. Deploying Mistral to the cloud (Runpod)

    • Deployment
  1. Products
  2. Course
  3. Section
  4. Lesson

Difference in training and inference

  1. Building LLMs like ChatGPT from scratch and Cloud Deployment

    • Buy now
    • Learn more
    • Discussions
  2. Introduction

    • Course Introduction
    • What you'll learn
    • Colab Notebooks
  3. Pre-requisites

    • RNNs and Attention Models
    • How the transformer works
    • Difference in training and inference
  4. Building Mistral from scratch

    • Global Architecture of Mistral
    • Tokenization
    • Rotary Positional Encoding (RoPE)
    • RoPE Practice
    • Group Query Attention
    • Sliding Window Attention
    • Kv-caching
    • Transformer Block
    • Full Transformer Model
  5. Deploying Mistral to the cloud (Runpod)

    • Deployment