Master high-performance GPU programming with CUDA framework. Transform CPU algorithms into parallel GPU implementations.
Master high-performance GPU programming with CUDA framework. Transform CPU algorithms into parallel GPU implementations.
This course cannot be purchased separately - to access the complete learning experience, graded assignments, and earn certificates, you'll need to enroll in the full GPU Programming Specialization program. You can audit this specific course for free to explore the content, which includes access to course materials and lectures. This allows you to learn at your own pace without any financial commitment.
2.2
(11 ratings)
5,906 already enrolled
Instructors:
English
What you'll learn
Develop C/C++ software for CPUs and NVIDIA GPUs using CUDA
Transform sequential algorithms into parallel GPU implementations
Manage different types of GPU memory for optimal performance
Implement multi-dimensional thread and block configurations
Optimize code for massive parallel execution
Skills you'll gain
This course includes:
2.1 Hours PreRecorded video
5 quizzes, 5 programming assignments
Access on Mobile, Tablet, Desktop
FullTime access
Shareable certificate
Get a Completion Certificate
Share your certificate with prospective employers and your professional network on LinkedIn.
Created by
Provided by

Top companies offer this course to their employees
Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.





There are 5 modules in this course
This comprehensive course focuses on parallel programming using NVIDIA's CUDA framework. Students learn to transform sequential CPU algorithms into parallel GPU implementations that can execute hundreds to thousands of times simultaneously. The curriculum covers essential concepts including thread management, memory hierarchies (global, shared, constant, and register), and performance optimization techniques. Through hands-on programming assignments, students develop practical skills in GPU computing for large-scale data processing.
Course Overview
Module 1 · 3 Hours to complete
Threads, Blocks and Grids
Module 2 · 5 Hours to complete
Host and Global Memory
Module 3 · 5 Hours to complete
Shared and Constant Memory
Module 4 · 3 Hours to complete
Register Memory
Module 5 · 3 Hours to complete
Fee Structure
Instructor
Innovator in Computer Science Education
Chancellor Pascale has been a faculty member at Johns Hopkins University's Whiting School of Engineering for over 10 years, specializing in the Computer Science department. He earned his undergraduate degree in Computer Science from Drexel University and a Master’s degree from Johns Hopkins University. With over 15 years of experience in software and service development, he focuses on web applications, AI/ML for image and language translation, and network management. Chancellor Pascale was awarded the Fulbright-Nehru Fellowship, where he led a three-week workshop on Linux API development at Stella Maris College in Chennai, India.
Testimonials
Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.
2.2 course rating
11 ratings
Frequently asked questions
Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.