RiseUpp Logo
Educator Logo

IBM: Data Science and Machine Learning - Capstone Course

This course is part of Ciencia de datos con Python Professional Certificate.

In this capstone course, you'll apply your data science and machine learning knowledge to a real-world business scenario, transforming theory into practical skills that employers value. You'll explore datasets from New York City's 311 system, where residents report non-urgent problems. Focusing on complaints submitted to the Housing Preservation and Development Department, you'll analyze trends, visualize data, and build predictive models to answer crucial questions that help authorities address these complaints effectively. Throughout this project, you'll utilize Python, data ingestion techniques, exploratory data analysis, visualization, feature engineering, probabilistic modeling, and model validation. Upon completion, you'll have created a comprehensive data science project that demonstrates your job-readiness to potential employers, complete with a digital badge credential to verify your newly acquired skills.

Spanish

Español

Powered by

Provider Logo
IBM: Data Science and Machine Learning - Capstone Course

This course includes

6 Weeks

Of Self-paced video lessons

Intermediate Level

Completion Certificate

awarded on course completion

Free course

What you'll learn

  • Apply data science and machine learning knowledge to real-world business scenarios

  • Analyze and visualize data using Python programming

  • Perform feature engineering exercises to prepare data for modeling

  • Create and validate predictive machine learning models

  • Develop actionable insights from real-life data problems to showcase to potential employers

  • Earn a verified digital badge credential demonstrating your practical data science skills

Skills you'll gain

Data Science
Machine Learning
Python Programming
Data Visualization
Predictive Modeling
Feature Engineering
Data Analysis
Probability
Model Validation
Housing Data Analysis

This course includes:

PreRecorded video

Assignments, projects

Access on Mobile, Tablet, Desktop

Limited Access access

Shareable certificate

Closed caption

Get a Completion Certificate

Share your certificate with prospective employers and your professional network on LinkedIn.

Created by

Provided by

Certificate

Top companies offer this course to their employees

Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.

icon-0icon-1icon-2icon-3icon-4

Module Description

This course covers essential data science skills applied to real-world scenarios using New York City's 311 system data. Students will explore datasets focusing on housing complaints, learning to perform comprehensive data analysis and build predictive models. The curriculum includes data ingestion and cleaning techniques, exploratory data analysis, data visualization methods, feature engineering approaches, probabilistic modeling, and model validation. Students will analyze the increasing volume of complaints to the Housing Preservation and Development Department, using Python and machine learning tools to develop actionable insights. The project-based approach emphasizes practical skills that employers value, culminating in a portfolio-ready project demonstrating the student's ability to solve complex data problems.

Instructors

Data Science Thought Leader

Sourav Mazumder is a Data Science Thought Leader and Distinguished Engineer at IBM, with over 22 years of experience in the IT industry and more than a decade specializing in artificial intelligence, data science, and big data. He has driven business innovation across sectors such as manufacturing, insurance, telecom, banking, media, healthcare, and retail in regions including the USA, Europe, Australia, Japan, and India. Sourav has deep expertise in a wide range of technologies, including Watson APIs, Spark, Hadoop, BigSQL, HBase, MongoDB, Cognos, R, Python, and Java, and has contributed to projects from MVP development to full-scale production. He is a prolific author and speaker, regularly publishing articles and presenting at industry conferences on AI, data science, and big data. Sourav holds certifications such as The Open Group Distinguished Data Scientist and IBM Master Inventor, and his work includes research and patents in areas like trustworthy AI, ML Ops, and AI governance

Linda Liu
Linda Liu

2 Courses

Data Science Architect & Evangelist

Linda Liu is a data science consultant in IBM's Data Science and Cloud Expert Lab, specializing in data analytics projects and technical enablement. With experience in DevOps management and consulting for leading technology companies, she focuses on implementing data-driven solutions and training programs for IBM's internal and external IT audiences. Her work spans developing analytics frameworks and collaborating on professional certification courses like "Ciencia de datos con Python" for IBM on edX.

IBM: Data Science and Machine Learning - Capstone Course

This course includes

6 Weeks

Of Self-paced video lessons

Intermediate Level

Completion Certificate

awarded on course completion

Free course

Testimonials

Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.

Frequently asked questions

Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.