Course Outline
Introduction
- Apache Arrow vs Parquet
Installing and Configuring Apache Arrow
Overview of Apache Arrow Features and Architecture
Exploring Data with Pandas and Apache Arrow
Exploring Data with Spark and Apache Arrow
Exploring Data with R and Apache Arrow
Exploring Data with MapD and Apache Arrow
Other Data Analysis Integrations
- PySpark, Parquet files on S3, and Oracle tables and Elasticsearch indices
Troubleshooting
Summary and Conclusion
Requirements
- A basic undersanding of SQL
- Familiarity with Python or R
- Some familiarity with Apache Spark
Testimonials (5)
The trainer adapted the materials and contents to what he thought would be best for us and he succeeded. The quality of the training was excellent.
Jorge Sanchez Hernandez - CSMART - Carnival
Course - QGIS for Geographic Information System
Dużo cierpliwości
Mateusz - WestWind Energy Polska Sp. z o.o.
Course - ArcGIS for Spatial Analysis
Professional and very practical, usuefull in a daily work
Jozefin Rékasi - SC Automobile Dacia SA
Course - Advanced Data Analysis with TIBCO Spotfire
It covered the areas i said i was interested in before the course: data relationships, using python script. Connecting to databases will be covered in the advanced module.
Cristian Tudose - SC Automobile Dacia SA
Course - Introduction to Spotfire
I genuinely enjoyed the lots of labs and practices.