Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Big Data Overview:
- Definition and concept of Big Data
- Reasons behind the growing popularity of Big Data
- Real-world Big Data case studies
- Key characteristics of Big Data
- Solutions available for working with Big Data
Hadoop & Its Components:
- Understanding what Hadoop is and identifying its core components
- Hadoop architecture and the types of data it can handle or process
- A brief history of Hadoop, including companies that adopted it and their motivations
- Detailed explanation of the Hadoop framework and its components
- Understanding HDFS and how read-write operations work within the Hadoop Distributed File System
- Procedures for setting up a Hadoop cluster in various modes: Stand-alone, Pseudo-distributed, or Multi-node cluster
(This section covers setting up a Hadoop cluster using VirtualBox, KVM, or VMware, including necessary network configurations, starting Hadoop Daemons, and testing the cluster functionality).
- Understanding the MapReduce framework and its operational mechanics
- Executing MapReduce jobs on a Hadoop cluster
- Comprehending replication, mirroring, and rack awareness within the context of Hadoop clusters
Hadoop Cluster Planning:
- Strategies for planning your Hadoop cluster
- Evaluating hardware and software requirements for effective cluster planning
- Analyzing workloads to plan a cluster that avoids failures and ensures optimal performance
What is MapR and Why Choose MapR:
- Overview of MapR and its architectural design
- Understanding and utilizing MapR Control System, MapR Volumes, snapshots, and mirrors
- Planning a cluster specifically for MapR environments
- Comparing MapR with other distributions and Apache Hadoop
- MapR installation and cluster deployment processes
Cluster Setup & Administration:
- Managing services, nodes, snapshots, mirror volumes, and remote clusters
- Understanding and managing cluster nodes
- Gaining insight into Hadoop components and installing them alongside MapR Services
- Accessing cluster data, including via NFS, and managing services and nodes
- Managing data through volumes, handling users and groups, assigning roles to nodes, and performing node commissioning and decommissioning. This also includes cluster administration, performance monitoring, configuring and analyzing metrics, and administering MapR security
- Understanding and working with M7 Native storage for MapR tables
- Configuring and tuning the cluster for maximum performance
Cluster Upgrade and Integration with Other Setups:
- Upgrading MapR software versions and exploring different upgrade types
- Configuring the MapR cluster to access HDFS clusters
- Deploying a MapR cluster on Amazon Elastic MapReduce
All the topics above include demonstrations and practice sessions, allowing learners to gain hands-on experience with the technology.
Requirements
- Fundamental knowledge of the Linux File System
- Basic Java programming skills
- Familiarity with Apache Hadoop (recommended)
28 Hours
Testimonials (1)
practical things of doing, also theory was served good by Ajay