SQL For Data Science and Data Analysis Training Course
This instructor-led, live training (online or onsite) is aimed at software developers, managers, and business analyst who wish to use big data systems to store and retrieve large amounts of data.
By the end of this training, participants will be able to:
- Query large amounts of data efficiently.
- Understand how Big Data system store and retrieve data
- Use the latest big data systems available
- Wrangle data from data systems into reporting systems
- Learn to write SQL queries in:
- MySQL
- Postgres
- Hive Query Language (HiveQL/HQL)
- Redshift
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Lesson 1 - SQL basics:
- Select statements
- Join types
- Indexes
- Views
- Subqueries
- Union
- Creating tables
- Loading data
- Dumping data
- NoSQL
Lesson 2 - Data Modeling:
- Transaction based ER systems
- Data warehousing
- Data warehouse models
- Star schema
- Snowflake schemas
- Slowly changing dimensions (SCD)
- Structured and non-structured data
- Different table type storage engines:
- Column based
- Document-based
- In Memory
Lesson 3 - Index in the NoSQL/Data science world
- Constraints (Primary)
- Index-based scanning
- performance tuning
Lesson 4 - NoSQL and non-structured data
- When to use NoSQL
- Eventually consistent data
- Schema on read vs. Schema on write
Lesson 5 - SQL for data analytics
- Windowing function
- Lateral Joins
- Lead & Lag
Lesson 6 - HiveQL
- SQL Support
- External and Internal Tables
- Joins
- Partitions
- Correlated subqueries
- Nested queries
- When to use Hive
Lesson 7 - Redshift
- Design and structured
- Locks and shared resources
- Postgres differences
- When to use redshift
Requirements
- An understanding of databases
- Experience with SQL an asset.
Audience
- Business analysts
- Software developers
- Database developers
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from €4560 online delivery, based on a group of 2 delegates, €1440 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Need help picking the right course?
opleidingen@nobleprog.com or +31 208 080 666
SQL For Data Science and Data Analysis Training Course - Enquiry
SQL For Data Science and Data Analysis - Consultancy Enquiry
Consultancy Enquiry
Testimonials (3)
Gunnar adjusted the content for the second day based on our feedback from day one. He checked in with us to find out what we liked, disliked, found hard and how we wanted to approach day 2. I liked Gunnar's style of teaching: Lecture, share examples, allowed us time to practice and answer questions before moving to the next subject. It meant we could fully understand a topic before moving onto the next subject. This reduced overload of information and gave us a chance to spend more time on the topics we struggled with and less time on the stuff we found easy.
Ffion - Complete Coherence
Course - SQL For Data Science and Data Analysis
Gunnar’s training technique is dynamic, thorough, and perfectly tailored to individual needs. In our group of five, he ensured everyone stayed on track and fully understood the material throughout the course. The knowledge and resources we gained will undoubtedly be valuable for years to come. Thank you, Gunnar!
Marcia - Complete Coherence
Course - SQL For Data Science and Data Analysis
Gunnar created a great rapport with the audience and was quick to identify our needs. He was engaging and highly knowledgeable throughout and we enjoyed his humour.
Kurt - Complete Coherence
Course - SQL For Data Science and Data Analysis
Provisional Upcoming Courses (Contact Us For More Information)
Related Courses
Advanced Data Analysis with TIBCO Spotfire
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire Analyst techniques for analyzing data.
By the end of this training, participants will be able to:
- Share visualizations among different team members.
- Secure access to software based on roles and access controls.
- Create visualizations such as map charts.
- Integrate statistical computing languages such as R with Spotfire.
ArcGIS for Spatial Analysis
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at field ecologists and conservation managers who wish to create data spatial projects in ArcGIS.
By the end of this training, participants will be able to:
- Output spatial data as visualizations.
- Conduct geostatics on actual data.
- Implement spatial data analysis, data processing, and mapping with ArcGIS.
- Analyze spatial data for projects in ArcGIS.
ArcMap in ArcGIS
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at intermediate-level environmental scientists and archaeologists who wish to learn how to use ArcMap in ArcGIS for collecting, organizing, managing, and analyzing geographic information.
By the end of this training, participants will be able to:
- Understand the fundamentals of ArcMap and ArcGIS.
- Collect, organize, manage, and analyze geographic information on the social and archaeological elements.
- Conduct spatial queries for impact evaluation.
ArcGIS from Basic to Advanced
35 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at beginner-level to advanced-level GIS professionals and analysts who wish to learn how to effectively use ArcGIS for data visualization, spatial analysis, and geospatial project management.
By the end of this training, participants will be able to:
- Navigate and utilize ArcGIS tools for geospatial data management.
- Create and customize maps with layers and attributes.
- Perform advanced spatial analysis and geoprocessing tasks.
- Automate workflows using ModelBuilder and Python.
ArcGIS Enterprise for Technical Support
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at beginner-level IT support personnel who wish to provide robust support for ArcGIS Enterprise, addressing any anomalies or failures effectively.
By the end of this training, participants will be able to:
- Understand the architecture and components of ArcGIS Enterprise.
- Learn to install, configure, and manage ArcGIS Enterprise.
- Gain skills in troubleshooting and resolving common issues.
- Develop proficiency in monitoring and maintaining ArcGIS Enterprise environments.
- Master the techniques for backup, recovery, and performance optimization.
ArcGIS Fundamentals
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at beginner-level professionals who wish to learn the fundamental concepts and tools of ArcGIS.
By the end of this training, participants will be able to:
- Understand the basic concepts of GIS and spatial data.
- Navigate the ArcGIS interface.
- Create and manage spatial data.
- Perform basic spatial analysis.
- Create maps and visualizations.
ArcGIS Pro for Spatial Analysis
14 HoursBy the end of this training, participants will be able to:
- Understanding GIS concepts and spatial data types using the latest version of ArcGIS Pro.
- Investigate the user interface and evaluate the uses
- Explore ArcGIS* Pro and how to utilize online content
- Manage, manipulate and analyze your own data
- Display and share your data in a meaningful and creative way
Note: We will use the latest version of ArcGIS Pro.
Advanced ArcGIS Pro for Spatial Analysis
35 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at advanced-level GIS professionals who wish to use ArcGIS Pro to enhance their spatial analysis capabilities, conduct comprehensive geostatistical analysis, and apply advanced 3D modeling techniques for more effective decision-making and problem-solving in real-world scenarios.
By the end of this training, participants will be able to:
- Develop advanced skills in spatial analysis techniques using ArcGIS Pro.
- Utilize Python scripting for automation and complex data processing.
- Apply spatial modeling for problem-solving in real-world scenarios.
- Conduct geostatistical analysis for advanced data interpretation.
- Integrate external data sources and leverage 3D spatial data analysis.
ArcGIS with Python Scripting
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at GIS analysts who wish to automate repetitive tasks in GIS processes.
By the end of this training, participants will be able to:
- Build GIS applications using Python and ArcGIS tools.
- Develop with the ArcGIS package ArcPy, using Python.
- Apply the ArcGIS modules for map automation using object classes in Python.
Automated Monitoring with Zabbix
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) covers the installation, planning and configuration of Zabbix, and focuses on practical implementation and tooling.
By the end of this training, participants will be able to:
- Install and configure Zabbix for monitoring IT infrastructure.
- Set up and manage hosts, items, triggers, and actions within Zabbix.
- Utilize Zabbix's features for data collection, alerting, and reporting.
- Integrate Zabbix with other tools and platforms for enhanced monitoring and automation.
Insurtech: A Practical Introduction for Managers
14 HoursInsurtech (a.k.a Digital Insurance) refers to the convergence of insurance + new technologies. In the field of Insurtech "digital insurers" apply technology innovations to their business and operating models in order to reduce costs, improve the customer experience and enhance the agility of their operations.
In this instructor-led training, participants will gain an understanding of the technologies, methods and mindset needed to bring about a digital transformation within their organizations and in the industry at large. The training is aimed at managers who need to gain a big picture understanding, break down the hype and jargon, and take the first steps in establishing an Insurtech strategy.
By the end of this training, participants will be able to:
- Discuss Insurtech and all its component parts intelligently and systematically
- Identify and demystify the role of each key technology within Insurtech.
- Draft a general strategy for implementing Insurtech within their organization
Audience
- Insurers
- Technologists within the insurance industry
- Insurance stakeholders
- Consultants and business analysts
Format of the course
- Part lecture, part discussion, exercises and case study group activities
QGIS for Geographic Information System
21 HoursA geographic information system (GIS) is a system designed to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The acronym GIS is sometimes used for geographic information science (GIScience) to refer to the academic discipline that studies geographic information systems and is a large domain within the broader academic discipline of geoinformatics.
QGIS functions as geographic information system (GIS) software, allowing users to analyze and edit spatial information, in addition to composing and exporting graphical maps. QGIS supports both raster and vector layers; vector data is stored as either point, line, or polygon features. Multiple formats of raster images are supported, and the software can georeference images. To summarize it allows the users to Create, edit, visualise, analyse and publish geospatial information on Windows, Mac, Linux, BSD.
This program, in its first phase, introduces the QGIS interface for general usage. In the second phase, we introduce PyQGIS - the python libraries of QGIS that allows the integration of GIS functionalities in your python code or your python application, so that you may even create your own Python Plugin around a particular GIS functionality.
Introduction to Spotfire
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at business analysts and data analysts who wish to learn basic Spotfire Analyst techniques for analyzing data.
By the end of this training, participants will be able to:
- Install and configure TIBCO Spotfire.
- Combine data from different databases.
- Visualize large datasets.
- Create and share complex dashboards.
AI-Driven Data Analysis with TIBCO Spotfire X
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at business analysts and data analysts who wish to use TIBCO Spotfire X with its artificial intelligence capabilities to visualize, transform, and analyze data.
By the end of this training, participants will be able to:
- Install and configure TIBCO Spotfire X.
- Understand the features and architecture of TIBCO Spotfire X.
- Understand the concepts behind augmented and predictive analytics.
- Learn how to load, process, and visualize data using Spotfire X.
- Create interactive and enhanced data visualizations.
Data Analysis with SQL, Python and Spotfire
14 HoursIn this instructor-led, live training in the Netherlands, participants will learn three different approaches for accessing, analyzing and visualizing data. We start with an introduction to RDMS databases; the focus will be on accessing and querying an Oracle database using the SQL language. Then we look at strategies for accessing an RDMS database programmatically using the Python language. Finally, we look at how to visualize and present data graphically using TIBCO Spotfire.
Format of the Course
Interactive lecture and discussion.
Lots of exercises and practice.
Hands-on implementation in a live-lab environment.