Course Outline
Introduction to Hortonworks Data Platform (HDP)
Overview of Big Data and Apache Hadoop
Installing and Configuring HDP
Setting up, Deploying, and Managing Hadoop Cluster
Understanding and ConfiguringYARN and MapReduce
Overview of Job Scheduling
Ensuring Data Integrity
Understanding Enterprise Data Movement
Using HDFS Commands & Services
Transferring Data Using Flume
Working with Hive
Scheduling Workflow Using Oozie
Exploring Hadoop 2.x
Understanding Hbase Architecture
Monitoring HDP2 Services Using Ambari
New Features in HDP
Troubleshooting
Summary and Next Steps
Requirements
- An understanding of Hadoop and big data
- An understanding of Spark
- Familiarity with the command line
- System administration experience
Audience
- Hadoop administrators
Testimonials (5)
A lot of practical examples, different ways to approach the same problem, and sometimes not so obvious tricks how to improve the current solution
Rafal - Nordea
Course - Apache Spark MLlib
very interactive...
Richard Langford
Course - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
Course - A Practical Introduction to Stream Processing
practice tasks
Pawel Kozikowski - GE Medical Systems Polska Sp. Zoo
Course - Python and Spark for Big Data (PySpark)
The VM I liked very much The Teacher was very knowledgeable regarding the topic as well as other topics, he was very nice and friendly I liked the facility in Dubai.