Build advanced, job-ready skills in fine-tuning using Hugging Face, reinforcement learning, and direct preference optimization for language models.
Build advanced, job-ready skills in fine-tuning using Hugging Face, reinforcement learning, and direct preference optimization for language models.
This intermediate-level course addresses the high demand for AI engineers skilled in advanced fine-tuning techniques for large language models (LLMs). Designed for AI specialists looking to enhance their career prospects, the curriculum focuses on cutting-edge methods to customize pre-trained models for specific applications like chatbots, translation, and content generation. Students will explore instruction-tuning with Hugging Face, reward modeling, and training specialized reward models to evaluate LLM outputs. The course delves into reinforcement learning from human feedback (RLHF), teaching participants how to view LLMs as probability distributions and policies that can be optimized. Key techniques covered include proximal policy optimization (PPO) with Hugging Face configuration and direct preference optimization (DPO) using partition functions. Through hands-on labs, students gain practical experience implementing reward modeling, PPO, and DPO approaches. This focused two-week program delivers the specialized knowledge employers are actively seeking, enabling AI professionals to tailor large language models with sophisticated techniques that improve accuracy and relevance in real-world applications.
Instructors:
English
English
What you'll learn
Master instruction-tuning techniques using Hugging Face frameworks
Implement and train reward models for evaluating LLM-generated content
Understand how to conceptualize large language models as probability distributions
Transform LLMs into policies for reinforcement learning applications
Apply reinforcement learning from human feedback (RLHF) to improve model outputs
Implement proximal policy optimization (PPO) with Hugging Face frameworks
Skills you'll gain
This course includes:
PreRecorded video
Graded quizzes,5 labs,Practice quizzes
Access on Mobile, Tablet, Desktop
Limited Access access
Shareable certificate
Closed caption
Top companies offer this course to their employees
Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.





Fee Structure
Payment options
Financial Aid
Instructor
Pioneering Data Scientist Bridging AI Research and Education
Dr. Joseph Santarcangelo, a Data Scientist at IBM, brings a unique blend of academic excellence and practical expertise to the field of data science and artificial intelligence. With a Ph.D. in Electrical Engineering, his groundbreaking research focused on the intersection of machine learning, signal processing, and computer vision to understand how video content influences human cognitive processes. At IBM, he has established himself as a prominent educator and course developer, creating comprehensive learning materials that have reached hundreds of thousands of students worldwide. His teaching portfolio encompasses a wide range of technical subjects, from foundational Python programming to advanced topics in artificial intelligence, machine learning, and computer vision. Santarcangelo's ability to translate complex technical concepts into accessible learning experiences has made him an influential figure in data science education, maintaining consistently high ratings from learners while continuing to push the boundaries of applied machine learning and artificial intelligence research.
Testimonials
Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.
Frequently asked questions
Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.