Our Big Data and Hadoop training program is designed to ensure that you gain expertise in HDFS, Yarn, MapReduce, HBase, Oozie, Flume and Sqoop. The lectures, real-time use cases and hands-on exercises will make it easy for you to manage Hadoop 2.7 environment and perform data analytics using Pig and Hive. By the end of the course, you’ll gain confidence to build powerful data processing applications using Hadoop.
By the end of this program participants will have learnt to:
This course is designed for:
Introduction to Big Data & Hadoop What is Big Data Learn about the history and rise of Big Data Why did Big Data suddenly become so prominent Limitations of traditional large scale systems Who are the main vendors in the space - Cloudera - Hortonworks Introduction to Hadoop History of Hadoop Companies using Hadoop |
Hadoop Architecture / Introduction to HDFS Understanding Hadoop Master-Slave Architecture Understanding HDFS and MapReduce framework Regular file system vs HDFS Learn about NameNode, DataNode, Secondary Node Learn about JobTracker, TaskTracker Understand how data is written and read from HDFS |
Installing and setting up a Hadoop Cluster Understand the important configuration files in a Hadoop Cluster Deploy the Cloudera Hadoop distribution in a VM player Run HDFS and Linux commands Execute some examples to get a high level understanding Hadoop deployment - Single node, Multinode Learn how to setup and deploy a multinode Hadoop Cluster on AWS |
Understanding Hadoop MapReduce Framework Overview of the MapReduce Framework Understand the concept of Mappers, Reducers, Partitioners, Combiners Understand different Input Formats Understand different Output Formats Custom Data Types Writing MapReduce Mappers, and Reducers in Java using Eclipse Using writable interface JUnit and MRUnit Testing Frameworks Writing and running unit test |
PIG Introduction to PIG Setting up and running PIG Grunt Pig Latin Writing PIG Latin scripts |
Cloudera Impala Introduction to Impala Installing and using impala Create table using Impala Query the Impala table Impala SQL language reference Impala shell commands |
Hive and HiveQL Understand the Hive architecture Why need for another data warehousing system Installing, congifuring and running Hive HiveQL - Importing data, sorting and aggregating, joins, map joins Writing join queries and inserting data back into Hive Understand how queries are converted into MapReduce jobs Hive Tables and storage formats UDF and UDAF Choosing between PIG, Hive and Impala |
Zookeeper Overview of Zookeeper Uses of Zookeeper Zookeeper Service Zookeeper Data Model Building applications with Zookeeper |
Sqoop Overview of Sqoop Where is Sqoop used - import/export structured data Using Sqoop to import data from RDBMS into HDFS Using Sqoop to import data from RDBMS into Hive Using Sqoop to import data from RDBMS into HBase Using Sqoop to export data from HDFS into RDMBS Sqoop connectors |
Flume Overview of Flume Where is Flume used - import/export unstructured data Using Flume to load data into HDFS Using Flume to load data into HBase Using Flume to load data into Hive |
HBase Introduction to HBase Why use HBase HBase Architecture - read and write paths HBase vs RDBMS Installing and Configuration Schema design in HBase - column families, hotspotting Accessing data with HBase API - Reading, Adding, Updating data from the shell, JAVA API SCAN and Advanced API Using Zookeeper with HBase |
Cassandra and MongoDB Introduction to NoSQL database Advantage of NoSQL vs traditional RDBMS Introduction to Apache Cassandra Overview of Cassandra - data model, reading/writing data, CQL Introduction to MongoDB MongoDB vs Cassandra Introduction to Mahout |
Apache Oozie Introduction to Oozie Oozie workflow jobs Oozie coordinator jobs Creating Oozie Workflows Using HUE UI for Oozie Using CLI to run and track workflows |
Hadoop 2.0, YARN, MRv2 Understand new features in Hadoop 2.0 Learn advanced Hadoop concepts Introduction to YARN YARN architecture Upgrading MRv1 to MRv2 Developing application using MapReduce version 2 |
All our instructors are working professionals and experts in Big Data and Hadoop Development. They have real world experience in Big Data and Hadoop.
All our instructors are working professionals and experts in Big Data and Hadoop Development. They have real world experience in Big Data and Hadoop.
Yes, towards the end of the training, you will get a project to complete. Once you submit the project, the instructor will validate the project and then you will get the course completion certificate. This project will help you in understanding how the different components are related to each other and how is the data flow between different components.
Your system should have 4GB RAM, a processor better than core 2 duo. In case, your system falls short of these requirements, we can provide you remote access to our Hadoop Cluster.
Absolutely yes! One can always use Windows to work on Hadoop. You need to install Oracle Virtual Box on your Windows machine and then you can import our Virtual Machine in it, which we will provide you.
Yes, our Virtual Machine can be installed on Mac machine also.
1 Mbps of internet speed is preferable to attend the LIVE classes. However, we have seen people attending the classes from a much slower internet speed.
Once you join the course, you will get lifetime support. Even after the course completion, you can get back to the support team for any queries that you may have.
Yes, we provide our own Certification. At the end of your course, you will work on a real time Project. You will receive a Problem Statement along with a data-set to work. Once you are successfully through the project (Reviewed by an Expert), you will be awarded a certificate with a performance-based grading.
Hadoop is one of the hottest career options available today for Software Engineers. There are around 12,000 jobs currently in U.S. alone for Hadoop Developers and demand for Hadoop Developers is far more than the availability.
Excellent training by Signup Training trainer. Thank you.
Signup Training has amazing customer service and I really appreciated the continuous support before and after training. The prep tools are also excellent. Excellent overall.
Great training! Keep me updated on other training programs you offer, would love to attend. Thanks.
Course was good and I really appreciate all that I've learned. Thank you.
The training program was organized in an effective manner by Signup Training.
Overall good. Thanks.
Appreciated all the insight provided by the instructor and the additional material suggestions to help pass the certification.
Very good training. Excellent trainer and good follow up.
Trainer made it easy for us to follow. Training was organized in a professional manner. Excellent overall!!
Instructor was great. Very knowledgeable and thorough. Taking the practice tests and then walking through each question was very helpful in assimilating the theoretical knowledge we learned from the PMBOK.
Signup Training's coach was excellent! Professional and helpful. He offered suggestions on a game plan for studying for the PMP exam and passing the first time!
Excellent training overall
Best trainer!! Great communication and easy to follow. Great overall!!
Signup Training online training format was good. I enjoyed learning in this format.
Great course. Was well organized and easy to follow the training instructions.
The training was a great experience overall and I feel comfortable now going into the exam .
The course was well taught. I appreciated the test preparation tips. having the recorded classes have made test preparation easier.
Pro. Rodney was Great!, Support Team like "Jenny Thomas" was amazing, she sent me all the info asap, and even called, I'd like to take the future courses from Signup Training.
Signup Training's instructor is very professional and courteous. In my opinion, he goes over and above the typical instructor, with a strong focus on what to expect.
It was a good experience overall.
For me I have found that a short break every hour helps me learn better. Our instructor was a trooper - for teaching 4 days straight, otherwise it was a good experience.
Great training.
Good instructor. There were a few things that were rushed by but talking to the instructor during breaks helped fill in the areas rushed over. Great overall experience.
Great instructor, good follow up. May take up future courses.
Good instructor, easy to follow instructions. Great overall.
Great course. Excellent overall.
Great Instructor! Very interactive and easy to understand.
Really happy with the course. Thank you.
I wish we would have been able to have a hard copy of the slides to take notes as we took the class. I understand that the items are in the book, but it would've been nice to have to make notes on. With such an intense course, every little bit will help. Excellent overall though.
Instructor was great. Excellent overall.
I honestly think with the test changing next month, and trying to complete the application, and study this might be a little to much to accomplish. Good training.