All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
substack.com
Direct Preference Optimization (DPO) explained
A Simpler Way to Fine-Tune Language Models than with RLHF
100 views
Dec 27, 2024
Direct Preference Optimization Tutorial
論文紹介:Direct Preference Optimization: Your Language Model is Secretly a Reward Model
speakerdeck.com
Aug 19, 2024
7:52
21. Direct Preference Optimization (DPO) (Rafailov et al., 2023)
YouTube
LOADING_
14 views
3 months ago
1:05
DeepLearning.AI on Instagram: "Our course recommendation of the day is “Post-training of LLMs, ” where you’ll learn how to customize pre-trained language models using Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Online Reinforcement Learning (RL). You'll learn when to use each method, how to curate training data, and implement them in code to shape model behavior effectively. Enroll at the link in bio or comment "LLM" to receive the link in your inbox."
Instagram
deeplearningai
8.1K views
4 months ago
Top videos
12:16
Direct Preference Optimization (DPO) explained + OpenAI Fine-tuning example
YouTube
Simeon Emanuilov
786 views
Dec 26, 2024
48:46
Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math
YouTube
Umar Jamil
34.1K views
Apr 14, 2024
58:07
Aligning LLMs with Direct Preference Optimization
YouTube
DeepLearningAI
34.1K views
Feb 8, 2024
Direct Preference Optimization Applications
DPT: Dynamic Preference Transfer for Cross-Domain Sequential Recommendation | Proceedings of the 34th ACM International Conference on Information and Knowledge Management
acm.org
3 months ago
12:13
Model Predictive Control
YouTube
Steve Brunton
334K views
Jun 11, 2018
14:23
Intro to Linear Programming
YouTube
Dr. Trefor Bazett
296.3K views
Apr 6, 2021
12:16
Direct Preference Optimization (DPO) explained + OpenAI Fine-tu
…
786 views
Dec 26, 2024
YouTube
Simeon Emanuilov
48:46
Direct Preference Optimization (DPO) explained: Bradley-Terry m
…
34.1K views
Apr 14, 2024
YouTube
Umar Jamil
58:07
Aligning LLMs with Direct Preference Optimization
34.1K views
Feb 8, 2024
YouTube
DeepLearningAI
2:45
Direct Preference Optimization (DPO) Explained: AI Alignment
7 views
3 months ago
YouTube
VLR Software Training
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs dir
…
30.7K views
Jun 21, 2024
YouTube
Serrano.Academy
16:57
Direct Preference Optimization (DPO) | Paper Explained
1.4K views
2 months ago
YouTube
Outlier
Direct Nash Optimization: Teaching language models to self-improve
…
Sep 3, 2024
Microsoft
論文紹介:Direct Preference Optimization: Your Language Mod
…
Aug 19, 2024
speakerdeck.com
8:55
Direct Preference Optimization: Your Language Model is Secretly
…
39.1K views
Dec 22, 2023
YouTube
AI Coffee Break with Letitia
12:55
DPO Coding | Direct Preference Optimization (DPO) Code impleme
…
384 views
11 months ago
YouTube
AILinkDeepTech
41:28
LLMs | Alignment of Language Models: Contrastive Learning | Le
…
1.6K views
Sep 26, 2024
YouTube
LCS2
6:18
4 Ways to Align LLMs: RLHF, DPO, KTO, and ORPO
4.1K views
Jul 10, 2024
YouTube
Snorkel AI
36:25
Direct Preference Optimization (DPO): Your Language Model is S
…
19.2K views
Aug 10, 2023
YouTube
Gabriel Mongaras
7:55
[Paper Review] Direct preference optimization(DPO) : Your languag
…
8 views
5 months ago
YouTube
LOADING_
53:03
DPO - Part1 - Direct Preference Optimization Paper Explanation |
…
2K views
Aug 12, 2023
YouTube
Neural Hacks with Vasanth
19:39
Reinforcement Learning, RLHF, & DPO Explained
16.2K views
Jun 12, 2024
YouTube
Mark Hennings
13:01
DPO (Direct Preference Optimization)についてNotebookL
…
2 views
3 months ago
YouTube
Ai情報Note
37:16
Hands-on 10: Large Language Model Alignment with Direct Prefe
…
3.7K views
7 months ago
YouTube
BrainOmega
7:52
21. Direct Preference Optimization (DPO) (Rafailov et al., 2023)
14 views
3 months ago
YouTube
LOADING_
8:05
DPO : L'Alternative RLHF qui Révolutionne l'Alignement IA
26 views
3 months ago
YouTube
Deep Learner, One Step at a Time
14:16
DPO (Direct Preference Optimization) 算法讲解
50.6K views
Mar 3, 2024
bilibili
RethinkFun
5:08
LLM Alignment Methods - DPO vs IPO vs KTO vs PCL
1.6K views
Jan 27, 2024
YouTube
Fahd Mirza
18:44
W12L53: Direct Preference Optimization (DPO)
1.1K views
6 months ago
YouTube
IIT Madras - B.S. Degree Programme
1:10:29
[인공지능,머신러닝,딥러닝] (심화) Direct preference optimization (DP
…
2.7K views
Mar 18, 2024
YouTube
컴달인 - 컴퓨터 달인
2:00
Diffusion Model Alignment Using Direct Preference Optimization
1.5K views
Nov 24, 2023
bilibili
PaperWeekly
1:06:31
UMass CS685 S24 (Advanced NLP) #12: Direct preference optimizatio
…
3.1K views
Mar 13, 2024
YouTube
Mohit Iyyer
42:49
Direct Preference Optimization (DPO)
7.3K views
Nov 13, 2023
YouTube
Trelis Research
1:27:21
RLHF, PPO and DPO for Large language models
3.6K views
Feb 18, 2024
YouTube
Arvind N
40:55
Fast Fine Tuning and DPO Training of LLMs using Unsloth
5.9K views
Mar 25, 2024
YouTube
AI Anytime
See more videos
More like this
Feedback