Learn essential data wrangling techniques using R and tidyverse to clean, process, and prepare data for advanced analysis.
Learn essential data wrangling techniques using R and tidyverse to clean, process, and prepare data for advanced analysis.
This comprehensive course teaches the critical process of data wrangling, focusing on transforming raw data into analysis-ready formats. Students learn to import data from various sources, use the tidyverse package for data cleaning, and master techniques for string processing, HTML parsing, and text mining. The course covers practical skills in handling different file formats, working with dates and times, and using regular expressions for data manipulation. Emphasis is placed on real-world applications and common data cleaning challenges.
4.4
(26 ratings)
25,247 already enrolled
Instructors:
English
English
What you'll learn
Master data import techniques for various file formats in R
Apply tidyverse tools for efficient data cleaning and organization
Implement web scraping methods for data collection
Develop expertise in string processing using regular expressions
Utilize dplyr for advanced data manipulation tasks
Perform text mining and temporal data analysis
Skills you'll gain
This course includes:
PreRecorded video
Graded assignments, exams
Access on Mobile, Tablet, Desktop
Limited Access access
Shareable certificate
Closed caption
Get a Completion Certificate
Share your certificate with prospective employers and your professional network on LinkedIn.
Created by
Provided by

Top companies offer this course to their employees
Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.





Module Description
This course focuses on the essential data wrangling skills needed for effective data science. The curriculum covers importing data from various formats, cleaning and tidying data using tidyverse, web scraping, string processing with regular expressions, and text mining. Students learn to handle real-world data challenges, including working with dates and times, parsing HTML, and preparing data for analysis. The course emphasizes practical applications and common data transformation scenarios encountered in data science projects.
Fee Structure
Instructor

1 Course
Senior Lecturer in Public Leadership at Harvard Kennedy School
Ronald Heifetz is among the world's foremost authorities on the practice and teaching of leadership. He speaks extensively and advises heads of governments, businesses, and nonprofit organizations across the globe. In 2016, President Juan Manuel Santos of Colombia highlighted Heifetz's advice in his Nobel Peace Prize Lecture. Heifetz founded the Center for Public Leadership at Harvard Kennedy School where he has taught for nearly four decades. He is the King Hussein bin Talal Senior Lecturer in Public Leadership. Heifetz played a pioneering role in establishing leadership as an area of study and education in the United States and at Harvard. His research addresses two challenges: developing a conceptual foundation for the analysis and practice of leadership; and developing transformative methods for leadership education, training, and consultation. Heifetz co-developed the adaptive leadership framework with Riley Sinder and Marty Linsky to provide a basis for leadership research and practice. His first book, Leadership Without Easy Answers (1994), is a classic in the field and one of the ten most assigned course books at Harvard and Duke Universities. Heifetz co-authored the best-selling Leadership on the Line: Staying Alive through the Dangers of Change with Marty Linsky, which serves as one of the primary go-to books for practitioners across sectors (2002, revised 2017). He then co-authored the field book, The Practice of Adaptive Leadership: Tools and Tactics for Changing your Organization and the World with Alexander Grashow and Marty Linsky (2009). Heifetz began his focus on transformative methods of leadership education and development in 1983. Drawing students from throughout Harvard's graduate schools and neighboring universities, his courses on leadership are legendary; his core course is considered the most influential in their career by Kennedy School alumni. His teaching methods have been studied extensively in doctoral dissertations and in Leadership Can Be Taught by Sharon Daloz Parks (2005). A graduate of Columbia University, Harvard Medical School, and the Kennedy School, Heifetz is both a physician and cellist. He trained initially in surgery before deciding to devote himself to the study of leadership in public affairs, business, and nonprofits. Heifetz completed his medical training in psychiatry, which provided a foundation to develop more powerful teaching methods and gave him a distinct perspective on the conceptual tools of political psychology and organizational behavior. As a cellist, he was privileged to study with the great Russian virtuoso, Gregor Piatigorsky.
Testimonials
Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.
Frequently asked questions
Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.