Loading...

Hadoop Training 2 : Deep Dive In HDFS (What is Hadoop ?) | What is HDFS ? | What is Hive ?

114,008 views

Loading...

Loading...

Loading...

Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Jun 17, 2013

By http://www.HadoopExam.com

Full Hadoop Training is in Just $69/3500INR visit : www.HadoopExam.com

Download full training Brochure from : http://hadoopexam.com/BigData_Hadoop_...

Big Data and Hadoop Trainings are Being Used by Learners from US, UK , Europe , Spain, Germany, Singapore, Malaysia, Egypt, Saudi Arabia, Turkey , Dubai, India, Chicago , MA, etc

Please find the link for Hadoop Interview Questions PDF
http://HadoopExam.com/Hadoop_Intervie...

Module 1 : Introduction to BigData, Hadoop (HDFS and MapReduce) : Available (Length 35 Minutes)
1. BigData Inroduction
2. Hadoop Introduction
3. HDFS Introduction
4. MapReduce Introduction

Video URL : http://www.youtube.com/watch?v=R-qjyE...

Module 2 : Deep Dive in HDFS : Available (Length 48 Minutes)


1. HDFS Design
2. Fundamental of HDFS (Blocks, NameNode, DataNode, Secondary Name Node)
3. Rack Awareness
4. Read/Write from HDFS
5. HDFS Federation and High Availability
6. Parallel Copying using DistCp
7. HDFS Command Line Interface
Video URL : http://www.youtube.com/watch?v=PK6Im7...

Module 3 : Understanding MapReduce
1. JobTracker and TaskTracker
2. Topology Hadoop cluster
3. Example of MapReduce
Map Function
Reduce Function
4. Java Implementation of MapReduce
5. DataFlow of MapReduce
6. Use of Combiner

Video URL : Watch Private Video

Module 4 : MapReduce Internals -1 (In Detail) : Available (Length 57 Minutes)

1. How MapReduce Works
2. Anatomy of MapReduce Job (MR-1)
3. Submission & Initialization of MapReduce Job (What Happen ?)
4. Assigning & Execution of Tasks
5. Monitoring & Progress of MapReduce Job
6. Completion of Job
7. Handling of MapReduce Job
- Task Failure
- TaskTracker Failure
- JobTracker Failure

Video URL : Watch Private Video

Module 5 : MapReduce-2 (YARN : Yet Another Resource Negotiator) : Available (Length 52 Minutes)


1. Limitation of Current Architecture (Classic)
2. What are the Requirement ?
3. YARN Architecture
4. JobSubmission and Job Initialization
5. Task Assignment and Task Execution
6. Progress and Monitoring of the Job
7. Failure Handling in YARN
- Task Failure
- Application Master Failure
- Node Manager Failure
- Resource Manager Failure

Video URL : Watch Private Video

Module 6 : Advanced Topic for MapReduce (Performance and Optimization) : Available (Length 58 Minutes)

1. Job Sceduling
2. In Depth Shuffle and Sorting
3. Speculative Execution
4. Output Committers
5. JVM Reuse in MR1
6. Configuration and Performance Tuning

Video URL : Watch Private Video

Module 7 : Advanced MapReduce Algorithm : Available

File Based Data Structure
- Sequence File
- MapFile
Default Sorting In MapReduce
- Data Filtering (Map-only jobs)
- Partial Sorting
Data Lookup Stratgies
- In MapFiles
Sorting Algorithm
- Total Sort (Globally Sorted Data)
- InputSampler
- Secondary Sort

Video URL : Watch Private Video
Module 8 : Advanced MapReduce Algorithm -2 : Available

1. MapReduce Joining
- Reduce Side Join
- MapSide Join
- Semi Join
2. MapReduce Job Chaining
- MapReduce Sequence Chaining
- MapReduce Complex Chaining

Module 9 : Features of MapReduce : Available

MapReduce Counters
Data Distribution Using JobConfiguration Distributed Cache

Module 11 : Apache Pig : Available (Length 52 Minutes)

1. What is Pig ?
2. Introduction to Pig Data Flow Engine
3. Pig and MapReduce in Detail
4. When should Pig Used ?
5. Pig and Hadoop Cluster


Video URL : Watch Private Video

Module 12 : Fundamental of Apache Hive Part-1 : Available (Length 60 Minutes)

1. What is Hive ?
2. Architecture of Hive
3. Hive Services
4. Hive Clients
5. how Hive Differs from Traditional RDBMS
6. Introduction to HiveQL
7. Data Types and File Formats in Hive
8. File Encoding
9. Common problems while working with Hive

Module 13 : Apache Hive : Available (Length 73 Minutes )
1. HiveQL
2. Managed and External Tables
3. Understand Storage Formats
4. Querying Data
- Sorting and Aggregation
- MapReduce In Query
- Joins, SubQueries and Views
5. Writing User Defined Functions (UDFs)

Module 14 : Single Node Hadoop Cluster Set Up In Amazon Cloud : Available (Length 60 Minutes Hands On Practice Session)
1. � How to create instance on Amazon EC2
2. � How to connect that Instance Using putty
3. � Installing Hadoop framework on this instance
4. � Run sample wordcount example which come with Hadoop framework.
In 30 minutes you can create Hadoop Single Node Cluster in Amazon cloud, does it interest you ?


Module 15 : Hands On : Implementation of NGram algorithm : Available (Length 48 Minutes Hands On Practice Session)
1. Understand the NGram concept using (Google Books NGram )
2. Step by Step Process creating and Configuring eclipse for writing MapReduce Code
3. Deploying the NGram application in Hadoop Installed in Amazon EC2
4. Analyzing the Result by Running NGram application (UniGram, BiGram, TriGram etc.)

Hadoop Learning Resources
Phone : 022-42669636
Mobile : +91-8879712614
www.HadoopExam.com

Loading...

When autoplay is enabled, a suggested video will automatically play next.

Up next


to add this to Watch Later

Add to

Loading playlists...