All
Search
Images
Videos
Shorts
Maps
News
Copilot
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
33:39
YouTube
AI Engineer
Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou
LLM inference is not your normal deep learning model deployment nor is it trivial when it comes to managing scale, performance and COST. Understanding how to effectively size a production grade LLM deployment requires understanding of the model(s), the compute hardware, quantization and parallelization methods, KV Cache budgets, input and ...
34.9K views
Jan 1, 2025
Large Language Models as Optimizers Language Models for NLP
19:15
Large Language Models Explained! How LLMs Work for Beginners!
YouTube
The Data Guy
15.9K views
Feb 21, 2025
25:20
Everything You Need To Know About Large Language Models (LLMs)
YouTube
Matthew Berman
473.3K views
Mar 7, 2024
7:58
Large Language Models explained briefly
YouTube
3Blue1Brown
5.5M views
Nov 20, 2024
Top videos
36:12
Deep Dive: Optimizing LLM inference
YouTube
Julien Simon
46.4K views
Mar 11, 2024
20:18
LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)
YouTube
Faradawn Yang
2.5K views
5 months ago
40:56
LLM Optimization Secrets: Speed Up, Shrink Cost, and Scale Smarter in 2025!
YouTube
HustlerCoder
694 views
8 months ago
Large Language Models as Optimizers Neural Network Optimization
38:24
Proximal Policy Optimization (PPO) - How to train Large Language Models
YouTube
Serrano.Academy
80.3K views
Jan 24, 2024
1:00:15
Xavier Bresson - Integrating Large Language Models and Graph Neural Networks - LoG 2024 Keynote
YouTube
Learning on Graphs
2.1K views
Nov 29, 2024
11:42
Day 4/75 Large Language Models Top 2 Optimizers [Explained] Why Softmax is used in Transformers
YouTube
FreeBirds Crew - Data
1.3K views
Jan 25, 2024
36:12
Deep Dive: Optimizing LLM inference
46.4K views
Mar 11, 2024
YouTube
Julien Simon
20:18
LLM Inference Optimization #2: Tensor, Data & Expert Parallelism
…
2.5K views
5 months ago
YouTube
Faradawn Yang
40:56
LLM Optimization Secrets: Speed Up, Shrink Cost, and Scale Smarte
…
694 views
8 months ago
YouTube
HustlerCoder
17:52
AI Optimization Lecture 01 - Prefill vs Decode - Mastering LLM Techni
…
11.4K views
9 months ago
YouTube
Faradawn Yang
57:55
Optimization of LLM Systems with DSPy and LangChain/LangSmith
25.2K views
Apr 6, 2024
YouTube
LangChain
12:56
LLM System Design: Top 10 Optimization Techniques for Effici
…
741 views
11 months ago
YouTube
The AI Layers
12:24
Demystifying LLM Optimization: LoRA, QLoRA, and Fine-Tuning Ex
…
200 views
10 months ago
YouTube
aikoel
10:36
How to Scale LLMs: Flash Attention, ZeRO, & Parallelism | The Enginee
…
143 views
2 months ago
YouTube
The Savvy Scholar
36:08
LLM Optimization: What's Real & What's B.S. | Gordon Meagher
974 views
4 months ago
YouTube
Gerrid Smith
26:06
LLM Optimization Lecture 5: Continuous Batching and Piggyba
…
994 views
3 months ago
YouTube
Faradawn Yang
9:02
Prompt engineering essentials: Getting better results from LLMs |
…
165.1K views
11 months ago
YouTube
GitHub
45:32
A Survey of Techniques for Maximizing LLM Performance
221.2K views
Nov 13, 2023
YouTube
OpenAI
10:31
LLM Optimization vs Context Optimization: Which is Better for AI?
883 views
Feb 21, 2025
YouTube
IBM
22:02
EASIEST Way to Fine-Tune a LLM and Use It With Ollama
286.5K views
8 months ago
YouTube
Tech With Tim
2:37:05
Fine Tuning LLM Models – Generative AI Course
391.5K views
May 21, 2024
YouTube
freeCodeCamp.org
2:08:57
LangSmith | From Prompt to Production | LLM Development wit
…
37.7K views
3 months ago
YouTube
Naresh i Technologies
16:14
LLM as a Judge Prompt Optimization
2.8K views
11 months ago
YouTube
Arize AI
31:31
The complete TextGrad Tutorial - Easily optimize LLM prompts, mat
…
6.6K views
Sep 20, 2024
YouTube
Neural Breakdown with AVB
5:13
What is LLM quantization?
25.6K views
Nov 6, 2023
YouTube
Airtrain AI
15:19
vLLM: Easily Deploying & Serving LLMs
34.5K views
6 months ago
YouTube
NeuralNine
22:51
How LLMs Work: A Visual Guide
3.7K views
6 months ago
YouTube
HashLips Academy
14:55
What Is a Large Language Model (LLM)? Key Concepts Explained |
…
1.8K views
3 months ago
YouTube
WhiteboardDoodles
6:31
How do LLMs Work? | LLM Explained | Intellipaat
2.9K views
5 months ago
YouTube
Intellipaat
19:35
AEO, GEO & LLM SEO Optimisations with Practical Exam
…
7.6K views
8 months ago
YouTube
Learn From Top Rated Freelancer
12:13
How to Efficiently Serve an LLM?
4.8K views
Aug 5, 2024
YouTube
Ahmed Tremo
3:35
LangWatch LLM Optimization Studio
88.9K views
Nov 27, 2024
YouTube
LangWatch
45:11
LLM inference optimization: Model Quantization and Distillation
1.3K views
Sep 22, 2024
YouTube
YanAITalk
10:02
DsPy Tutorial - optimizing LLM pipelines with DsPy (part 2)
2.3K views
Apr 11, 2024
YouTube
AI Bites
0:32
Master LLM Optimization: Boost AI Performance & Efficiency
1 views
6 months ago
YouTube
Tutorials Time
See more videos
More like this
Feedback