All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
[Interesting content] InstructGPT, RLHF and SFT
1 views
Jan 24, 2023
substack.com
3:27
1.1K views · 101 reactions | A new short course on Reinforcement...
1.1K views
1 month ago
Facebook
DeepLearning.AI
2:44
What is Reinforcement Learning from Human Feedback (RLHF)? |
…
Apr 20, 2023
techtarget.com
2:33
基于人类反馈微调大语言模型:RLHF与DPO方法详解 第九部分
154 views
1 month ago
bilibili
光子AI
3:20
基于人类反馈微调大语言模型:RLHF与DPO方法详解 第四部分
196 views
1 month ago
bilibili
光子AI
3:56
基于人类反馈微调大语言模型:RLHF与DPO方法详解
239 views
1 month ago
bilibili
光子AI
19:23
手把手带你快速弄懂SFT、RLHF、DPO !从定义到适用边界全流程解
…
1.5K views
1 month ago
bilibili
爱学大模型的柒柒
3:51
基于人类反馈微调大语言模型:RLHF与DPO方法详解(第二部
…
185 views
1 month ago
bilibili
光子AI
Generating Conversation: RLHF and LLM Evaluations with Nathan Lam
…
1.3K views
Sep 6, 2023
YouTube
RunLLM
🐐Llama 3 Fine-Tune with RLHF [Free Colab 👇🏽]
20.5K views
Aug 6, 2023
YouTube
Whispering AI
2:24
Deep-Hole Drilling Technique
857.8K views
Aug 3, 2012
YouTube
VEQTER Ltd.
6:49
Deep Tendon Reflexes (Stanford Medicine 25)
3.5M views
Mar 17, 2014
YouTube
Stanford Medicine 25
10:47
(Sponsored) High-Speed PCB Design Tips - Phil's Lab #25
98.8K views
Jun 28, 2021
YouTube
Phil’s Lab
2:17
SPEED GANG - 10 LINES DEEP (LYRIC VIDEO)
601.1K views
Oct 29, 2018
YouTube
Speed Gang
2:00
2-Minute Neuroscience: Deep Brain Stimulation
106.9K views
Sep 24, 2020
YouTube
Neuroscientifically Challenged
14:18
The Fastest Ship in the U.S. Navy: Boeing Pegasus-Class Hydrofoils
10.1M views
Aug 3, 2015
YouTube
DOCUMENTARY TUBE
25:40
Python Reinforcement Learning Tutorial for Beginners in 25 Minutes
67.4K views
Mar 10, 2021
YouTube
Nicholas Renotte
3:32
Deep Tendon Reflexes: Explained by Ortho Eval Pal
94.6K views
Aug 19, 2019
YouTube
Ortho Eval Pal with Paul Marquis PT
20:52
Depth First Search (DFS) Explained: Algorithm, Examples, and Code
508.1K views
Jul 5, 2020
YouTube
Reducible
3:01:58
Reinforcement Learning in 3 Hours | Full Course using Python
521.3K views
Jun 6, 2021
YouTube
Nicholas Renotte
1:00:04
'Deep Relaxation' Delta Binaural Beat - 0.5Hz (1h Pure)
338.6K views
Feb 18, 2016
YouTube
Samuel Schüpbach
14:37
Simple Explanation of LSTM | Deep Learning Tutorial 36 (Tensorflow,
…
563.8K views
Feb 6, 2021
YouTube
codebasics
1:21:50
Stranded Deep World Record - First Ever Glitchless Speedrun! - 1hr 20
…
88.2K views
Apr 28, 2021
YouTube
Speedy Deep
36:26
A friendly introduction to deep reinforcement learning, Q-network
…
138.6K views
May 24, 2021
YouTube
Serrano.Academy
6:04
Hubble's UItra Deep Field in 3D is an amazing journey through space a
…
1.2M views
Jan 12, 2021
YouTube
VideoFromSpace
11:31
Reinforcement Learning in DeepSeek-R1 | Visually Explained
42.7K views
Feb 1, 2025
YouTube
AGI Lambda
6:34
W2 9 How LLMs follow instructions, Instruction tuning and RLHF
7.7K views
Dec 22, 2023
YouTube
AI Thought
4:24
Speed King
393.3K views
Feb 19, 2017
YouTube
Deep Purple - Topic
1:22
Deep (Sped Up)
396.1K views
Oct 31, 2022
YouTube
Summer Walker - Topic
42:40
State of GPT | BRK216HFS
751.5K views
May 25, 2023
YouTube
Microsoft Developer
See more videos
More like this
Feedback