Course Outline
Basics Hadoop.
Introduction to Pig.
Basic data analysis using Pig.
Processing complex data with Pig.
Multiple Dataset Operations Using Pig.
Troubleshooting and optimizing Pig.
Introduction to Hive, Impala, ELK.
Querying in Hive, Impala, ELK.
Data management in Hive.
Data storage and performance.
Analyses using tools Hive and Impala.
Working with the tool Impala and ELK.
Analysis of text and complex data types.
Optimization Hive, Pig, Impala, ELK.
Interoperability and workflow.
Questions, tasks, certification.
Requirements
This course is suggested for all data scientists, business analysts, developers and administrators who have experience with SQL and/or scripting languages. No prior knowledge of Apache Hadoop is required before this course.
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from €9120 online delivery, based on a group of 2 delegates, €2880 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Testimonials (5)
The live examples
Ahmet Bolat - Accenture Industrial SS
Course - Python, Spark, and Hadoop for Big Data
During the exercises, James explained me every step whereever I was getting stuck in more detail. I was completely new to NIFI. He explained the actual purpose of NIFI, even the basics such as open source. He covered every concept of Nifi starting from Beginner Level to Developer Level.
Firdous Hashim Ali - MOD A BLOCK
Course - Apache NiFi for Administrators
Trainer's preparation & organization, and quality of materials provided on github.
Mateusz Rek - MicroStrategy Poland Sp. z o.o.
Course - Impala for Business Intelligence
That I had it in the first place.
Peter Scales - CACI Ltd
Course - Apache NiFi for Developers
practical things of doing, also theory was served good by Ajay