RiseUpp Logo
Educator Logo

Managing Big Data in Clusters and Cloud Storage

This course is part of Modern Big Data Analysis with SQL Specialization.

This course cannot be purchased separately - to access the complete learning experience, graded assignments, and earn certificates, you'll need to enroll in the full Modern Big Data Analysis with SQL Specialization program. You can audit this specific course for free to explore the content, which includes access to course materials and lectures. This allows you to learn at your own pace without any financial commitment.

4.7

(295 ratings)

12,175 already enrolled

English

پښتو, বাংলা, اردو, 3 more

Powered by

Provider Logo
Managing Big Data in Clusters and Cloud Storage

This course includes

19 Hours

Of Self-paced video lessons

Beginner Level

Completion Certificate

awarded on course completion

Free course

What you'll learn

  • Browse and manage databases in big data systems

  • Explore files in distributed filesystems and cloud storage

  • Create and manage big data databases using Apache Hive and Impala

  • Choose appropriate data types and file formats

  • Optimize database performance and query execution

Skills you'll gain

Big Data Management
Cloud Storage
Apache Hive
Apache Impala
HDFS
SQL
Data Types
File Systems
Database Management
Data Architecture

This course includes:

2.8 Hours PreRecorded video

9 assignments

Access on Mobile, Tablet, Desktop

FullTime access

Shareable certificate

Closed caption

Get a Completion Certificate

Share your certificate with prospective employers and your professional network on LinkedIn.

Created by

Provided by

Certificate

Top companies offer this course to their employees

Top companies provide this course to enhance their employees' skills, ensuring they excel in handling complex projects and drive organizational success.

icon-0icon-1icon-2icon-3icon-4

There are 5 modules in this course

This comprehensive course focuses on managing and analyzing big data in cluster environments and cloud storage systems. Students learn to work with distributed SQL engines like Apache Hive and Apache Impala, understand different data types and file formats, and manage databases in cloud environments. The course covers practical aspects of data management, including creating and managing tables, choosing appropriate data types, and optimizing query performance.

Orientation to Data in Clusters and Cloud Storage

Module 1 · 2 Hours to complete

Defining Databases, Tables, and Columns

Module 2 · 5 Hours to complete

Data Types and File Types

Module 3 · 2 Hours to complete

Managing Datasets in Clusters and Cloud Storage

Module 4 · 5 Hours to complete

Optimizing Hive and Impala (Honors)

Module 5 · 5 Hours to complete

Fee Structure

Individual course purchase is not available - to enroll in this course with a certificate, you need to purchase the complete Professional Certificate Course. For enrollment and detailed fee structure, visit the following: Modern Big Data Analysis with SQL Specialization

Instructors

Ian Cook
Ian Cook

4.8 rating

526 Reviews

29,923 Students

2 Courses

Staff Curriculum Developer and Data Science Expert

Ian Cook is a Staff Curriculum Developer at Cloudera, where he leverages his deep expertise in data science and statistics to create impactful learning materials. Ian has authored several R packages and has held data scientist roles at TIBCO Software and Advanced Micro Devices. In addition to his work at Cloudera, he is the cofounder of Research Triangle Analysts, the largest data science meetup group in Raleigh, North Carolina. Ian holds a master’s degree in statistics from Lehigh University. His extensive background in data science and his commitment to education are reflected in his courses, including Analyzing Big Data with SQL and Managing Big Data in Clusters and Cloud Storage, where he helps learners develop the skills to analyze and manage large-scale data effectively.

Glynn Durham
Glynn Durham

4.7 rating

1,086 Reviews

53,385 Students

2 Courses

Senior Instructor and Expert in Big Data and Database Technologies

Glynn Durham, a Louisiana native, earned his Master’s Degree in Computer Science from Louisiana State University, where he studied under Peter Chen, the creator of entity-relationship modeling, a key method for relational database design. Glynn’s career has been largely dedicated to teaching technical subjects, with over five years of experience each at Oracle, MySQL, Sun Microsystems, and Cloudera. As a Senior Instructor, he brings a wealth of practical and theoretical knowledge to his students, specializing in big data and database technologies. His courses, Foundations for Big Data Analysis with SQL and Managing Big Data in Clusters and Cloud Storage, provide learners with essential skills to handle and analyze large datasets in modern data environments. Glynn’s extensive industry experience and instructional expertise make him a valuable educator in the field of data management.

Managing Big Data in Clusters and Cloud Storage

This course includes

19 Hours

Of Self-paced video lessons

Beginner Level

Completion Certificate

awarded on course completion

Free course

Testimonials

Testimonials and success stories are a testament to the quality of this program and its impact on your career and learning journey. Be the first to help others make an informed decision by sharing your review of the course.

Frequently asked questions

Below are some of the most commonly asked questions about this course. We aim to provide clear and concise answers to help you better understand the course content, structure, and any other relevant information. If you have any additional questions or if your question is not listed here, please don't hesitate to reach out to our support team for further assistance.