Practical Guide to Reinforcement Learning from Human Feedback

portes grátis

título Practical Guide to Reinforcement Learning from Human Feedback

subtítulo Using Human Signals to Align AI Models

autor K, Sandip

editor Packt Publishing Limited

data de edição 03/2026

capa Mole

idioma Inglês

ISBN13 9781835880500

prazo de entrega Pré-lançamento - envio 15 a 20 dias após a sua edição

Descrição não disponível.

Table of Contents

Introduction to Reinforcement Learning
Role of Human Feedback in Reinforcement Learning
Reward Modeling
Policy Training Based on Reward Model
Introduction to Language Models and Fine Tuning
Parameter Efficient Fine Tuning
Reward Modeling for Language Model Tuning
Reinforcement Learning for Tuning Language Models
Challenges of Reinforcement Learning with Human Feedback
Direct Preference Optimization
RLHF and Model Evaluations
Other Applications

Este título pertence ao(s) assunto(s) indicados(s). Para ver outros títulos clique no assunto desejado.