All
Search
Images
Videos
Shorts
Maps
News
Copilot
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
48:46
YouTube
Umar Jamil
Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math
In this video I will explain Direct Preference Optimization (DPO), an alignment technique for language models introduced in the paper "Direct Preference Optimization: Your Language Model is Secretly a Reward Model". I start by introducing language models and how they are used for text generation. After briefly introducing the topic of AI ...
34.1K views
Apr 14, 2024
Direct Preference Optimization Tutorial
12:16
Direct Preference Optimization (DPO) explained + OpenAI Fine-tuning example
YouTube
Simeon Emanuilov
786 views
Dec 26, 2024
9:10
Direct Preference Optimization: Forget RLHF (PPO)
YouTube
Discover AI
16.1K views
Jun 6, 2023
53:03
DPO - Part1 - Direct Preference Optimization Paper Explanation | DPO an alternative to RLHF??
YouTube
Neural Hacks with Vasanth
2K views
Aug 12, 2023
Top videos
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning
YouTube
Serrano.Academy
30.7K views
Jun 21, 2024
16:57
Direct Preference Optimization (DPO) | Paper Explained
YouTube
Outlier
1.4K views
2 months ago
36:25
Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model Explained
YouTube
Gabriel Mongaras
19.2K views
Aug 10, 2023
Direct Preference Optimization Applications
18:44
W12L53: Direct Preference Optimization (DPO)
YouTube
IIT Madras - B.S. Degree
1.1K views
6 months ago
8:55
Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained
YouTube
AI Coffee Break with Letitia
39.1K views
Dec 22, 2023
19:39
Reinforcement Learning, RLHF, & DPO Explained
YouTube
Mark Hennings
16.2K views
Jun 12, 2024
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs dir
…
30.7K views
Jun 21, 2024
YouTube
Serrano.Academy
16:57
Direct Preference Optimization (DPO) | Paper Explained
1.4K views
2 months ago
YouTube
Outlier
36:25
Direct Preference Optimization (DPO): Your Language Model is S
…
19.2K views
Aug 10, 2023
YouTube
Gabriel Mongaras
8:55
Direct Preference Optimization: Your Language Model is Secretly
…
39.1K views
Dec 22, 2023
YouTube
AI Coffee Break with Letitia
12:16
Direct Preference Optimization (DPO) explained + OpenAI Fine-tu
…
786 views
Dec 26, 2024
YouTube
Simeon Emanuilov
18:44
W12L53: Direct Preference Optimization (DPO)
1.1K views
6 months ago
YouTube
IIT Madras - B.S. Degree Programme
2:45
Direct Preference Optimization (DPO) Explained: AI Alignment
7 views
3 months ago
YouTube
VLR Software Training
37:16
Hands-on 10: Large Language Model Alignment with Direct Prefe
…
3.7K views
7 months ago
YouTube
BrainOmega
59:40
Direct Preference Optimization (DPO) in 1 hour
2.1K views
5 months ago
YouTube
Zachary Huang
53:03
DPO - Part1 - Direct Preference Optimization Paper Explanation |
…
2K views
Aug 12, 2023
YouTube
Neural Hacks with Vasanth
12:55
DPO Coding | Direct Preference Optimization (DPO) Code impleme
…
384 views
11 months ago
YouTube
AILinkDeepTech
9:10
Direct Preference Optimization: Forget RLHF (PPO)
16.1K views
Jun 6, 2023
YouTube
Discover AI
7:52
21. Direct Preference Optimization (DPO) (Rafailov et al., 2023)
14 views
3 months ago
YouTube
LOADING_
48:46
Direct Preference Optimization (DPO) explained: Bradley-Terry
…
197 views
10 months ago
bilibili
yaojingguo
19:39
Reinforcement Learning, RLHF, & DPO Explained
16.2K views
Jun 12, 2024
YouTube
Mark Hennings
58:07
Aligning LLMs with Direct Preference Optimization
34.1K views
Feb 8, 2024
YouTube
DeepLearningAI
14:15
Direct Preference Optimization
820 views
Apr 9, 2024
YouTube
Data Science Gems
31:31
Lecture 40 : Aligning to User Preferences via Direct Preference
…
275 views
6 months ago
YouTube
NPTEL IIT Kharagpur
7:55
[Paper Review] Direct preference optimization(DPO) : Your languag
…
8 views
5 months ago
YouTube
LOADING_
14:16
Diffusion Model Alignment Using Direct Preference Optimization
44 views
2 months ago
bilibili
dalaska的欢愉
5:32
This AI Breakthrough Changes Everything (DPO Explained)
1 views
1 month ago
YouTube
CollapsedLatents
8:05
DPO : L'Alternative RLHF qui Révolutionne l'Alignement IA
26 views
3 months ago
YouTube
Deep Learner, One Step at a Time
13:01
DPO (Direct Preference Optimization)についてNotebookL
…
2 views
3 months ago
YouTube
Ai情報Note
6:18
4 Ways to Align LLMs: RLHF, DPO, KTO, and ORPO
4.1K views
Jul 10, 2024
YouTube
Snorkel AI
1:27:21
RLHF, PPO and DPO for Large language models
3.6K views
Feb 18, 2024
YouTube
Arvind N
1:10:29
[인공지능,머신러닝,딥러닝] (심화) Direct preference optimization (DP
…
2.7K views
Mar 18, 2024
YouTube
컴달인 - 컴퓨터 달인
41:28
LLMs | Alignment of Language Models: Contrastive Learning | Le
…
1.6K views
Sep 26, 2024
YouTube
LCS2
14:30
【双语】Direct Preference Optimization [NeurIPS 2023]
826 views
9 months ago
bilibili
Sa神带你学AI
48:46
【Umar Jamil】DPO: Direct Preference Optimization 详解 中英
…
63 views
Feb 11, 2025
bilibili
阳冰NaN
See more videos
More like this
Feedback