Course Curriculum
Curriculum
Course Curriculum for Hadoop Developer [Hadoop 2.x]
Introduction Big Data and Hadoop
- Big Data Introduction
- Hadoop Introduction
- What is Hadoop? Why Hadoop?
- Hadoop History?
- Different types of Components in Hadoop?
Deep Drive in HDFS (for Storing the Data)
- Introduction of HDFS
- HDFS Design
- HDFS role in Hadoop
- Features of HDFS
- Daemons of Hadoop and its functionality
- Name Node
- Secondary Name Node
- Job Tracker
- Data Node
- Task Tracker
- YARN Processing
- Network Topology
- Nodes
- Racks
- Data Center
- How to store the Data in to HDFS
- How to Read the Data from HDFS
MapReduce using Java (Processing the Data)
- The introduction of MapReduce
- MapReduce Architecture
- Data flow in MapReduce
- Splits
- Mapper
- Portioning
- Sort and shuffle
- Combiner
- Reducer
- MapReduce life cycle
- How MapReduce Works
- Writing and Executing the Basic MapReduce Program using Java
- Submission & Initialization of MapReduce Job.
- File Input/output Formats in MapReduce Jobs
- TextInput Format
- Key Value Input Format
- Sequence File Input Format
- NLine Input Format
- Example programming
- Partition MapReduce Program
PIG
- Introduction to Apache PIG
- Introduction to PIG Data Flow Engine
- MapReduce vs PIG in detail
- When should PIG use?
- Data Types in PIG
- Basic PIG programming
- Modes of Execution in PIG
- Operators/Transformations in PIG
- PIG UDF’s with Program
- Word Count Example in PIG
- The difference between the MapReduce and PIG
SQOOP
- Introduction to SQOOP
- Use of SQOOP
- Connect to MYSQL Database
- SQOOP commands
- Import
- Detailed scenarios on importing data / performance tuning
- Various option in import like –m, – splitby , boundary query
- Export
- Eval
- Import
- SQOOP Job execution
HIVE
- Introduction to HIVE
- HIVE Meta Store
- HIVE Architecture
- Tables in HIVE
- Managed Tables
- External Tables
- Hive Functions
- Hive Data Types
- Primitive Types
- Complex Types
- Partition
- Joins in HIVE
- HIVE UDF’s
HBASE
- Introduction to HBASE
- Fundamentals of HBASE
- What is NoSQL?
- HBASE Data Model
- Table and Row
- Column Family and Column Qualifier
- Cell and its Versioning
- Categories of NoSQL Data Bases
- Key-Value Database
- Document Database
- Column Family Database
- HBASE Architecture
- HMaster
- Region Servers
- Regions
- MemStore
- Store
- SQL vs NOSQL
- How HBASE is differed from RDBMS
- HDFS vs HBASE
- Client side buffering or bulk uploads
- HBASE Designing Tables
- Get
- Scan
- Put
- Delete
Zookeeper
- Introduction Zookeeper
- Data Modal
- Operations
OOZIE
- Introduction to OOZIE
- Use of OOZIE
- Where tuse?
- OOZIE workflows
- OOZIE coordinators
- OOZIE jobs
Flume
- Introduction to Flume
- Uses of Flume
- Flume Architecture
- Flume Master
- Flume Collectors
- Flume Agents
Kafka
- Introduction to kafka
- Creating kafka topic
- Producer & consumers in kafka
- Example
Course Reviews
0 STUDENTS ENROLLED