Cluster Analysis with R and SAS Training Course
R is a programming language and software environment for statistical computing. SAS is a statistical software platform for predictive analysis, data management, advanced analytics, and more. With R in SAS, users can find natural groups of data for cluster analysis that are essential to data mining.
This instructor-led, live training (online or onsite) is aimed at data analysts who wish to program with R in SAS for cluster analysis.
By the end of this training, participants will be able to:
- Use cluster analysis for data mining
- Master R syntax for clustering solutions.
- Implement hierarchical and non-hierarchical clustering.
- Make data-driven decisions to help to improve business operations.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
Cluster Analysis
- What is cluster analysis?
- Types of cluster types
Cluster Analysis Continued
- Cluster analysis vs object segmentation
- Hierarchical vs non-hierarchical clustering
Preparing the Development Environment
- Installing and configuring SAS
- Installing and configuring R
Cluster Analysis with SAS
- Importing data
- Standardizing data
- Implementing hierarchical clustering
- Interpretting output
- Working with K means clustering for non-hierarchical
- Interpretting output
Cluster Analysis with R
- Using hierarchical clustering functions
- Working with non-hierarchical clustering functions
Summary and Conclusion
Requirements
- Experience with R programming
- SAS experience
Audience
- Data Analysts
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from €4560 online delivery, based on a group of 2 delegates, €1440 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Need help picking the right course?
opleidingen@nobleprog.com or +31 208 080 666
Cluster Analysis with R and SAS Training Course - Enquiry
Cluster Analysis with R and SAS - Consultancy Enquiry
Consultancy Enquiry
Testimonials (5)
how the trainor shows his knowledge in the subject he's teachign
john ernesto ii fernandez - Philippine AXA Life Insurance Corporation
Course - Data Vault: Building a Scalable Data Warehouse
Open discussion with trainer
Tomek Danowski - GE Medical Systems Polska Sp. Z O.O.
Course - Process Mining
I genuinely enjoyed the hands passed exercises.
Yunfa Zhu - Environmental and Climate Change Canada
Course - Foundation R
The pace was just right and the relaxed atmosphere made candidates feel at ease to ask questions.
Rhian Hughes - Public Health Wales NHS Trust
Course - Introduction to Data Visualization with Tidyverse and R
Richard's training style kept it interesting, the real world examples used helped to drive the concepts home.
Jamie Martin-Royle - NBrown Group
Course - From Data to Decision with Big Data and Predictive Analytics
Provisional Upcoming Courses (Contact Us For More Information)
Related Courses
From Data to Decision with Big Data and Predictive Analytics
21 HoursAudience
If you try to make sense out of the data you have access to or want to analyse unstructured data available on the net (like Twitter, Linked in, etc...) this course is for you.
It is mostly aimed at decision makers and people who need to choose what data is worth collecting and what is worth analyzing.
It is not aimed at people configuring the solution, those people will benefit from the big picture though.
Delivery Mode
During the course delegates will be presented with working examples of mostly open source technologies.
Short lectures will be followed by presentation and simple exercises by the participants
Content and Software used
All software used is updated each time the course is run, so we check the newest versions possible.
It covers the process from obtaining, formatting, processing and analysing the data, to explain how to automate decision making process with machine learning.
Data Mining and Analysis
28 HoursObjective:
Delegates be able to analyse big data sets, extract patterns, choose the right variable impacting the results so that a new model is forecasted with predictive results.
Data Mining
21 HoursCourse can be provided with any tools, including free open-source data mining software and applications
Data Mining with R
14 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has a wide variety of packages for data mining.
Data Vault: Building a Scalable Data Warehouse
28 HoursIn this instructor-led, live training in the Netherlands, participants will learn how to build a Data Vault.
By the end of this training, participants will be able to:
- Understand the architecture and design concepts behind Data Vault 2.0, and its interaction with Big Data, NoSQL and AI.
- Use data vaulting techniques to enable auditing, tracing, and inspection of historical data in a data warehouse.
- Develop a consistent and repeatable ETL (Extract, Transform, Load) process.
- Build and deploy highly scalable and repeatable warehouses.
Data Visualization
28 HoursThis course is intended for engineers and decision makers working in data mining and knoweldge discovery.
You will learn how to create effective plots and ways to present and represent your data in a way that will appeal to the decision makers and help them to understand hidden information.
Data Mining & Machine Learning with R
14 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has a wide variety of packages for data mining.
Data Science for Big Data Analytics
35 HoursBig data is data sets that are so voluminous and complex that traditional data processing application software are inadequate to deal with them. Big data challenges include capturing data, data storage, data analysis, search, sharing, transfer, visualization, querying, updating and information privacy.
Foundation R
7 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at beginner-level professionals who wish to gain a mastery of the fundamentals of R and how to work with data.
By the end of this training, participants will be able to:
- Understand the R programming environment and RStudio interface.
- Import, manipulate, and explore datasets using R commands and packages.
- Perform basic statistical analysis and data summarization.
- Generate visualizations using both base R and ggplot2.
- Manage workspaces, scripts, and packages effectively.
Oracle SQL Intermediate - Data Extraction
14 HoursThe objective of the course is to enable participants to gain a mastery of how to work with the SQL language in Oracle database for data extraction at intermediate level.
Process Mining
21 HoursProcess mining, or Automated Business Process Discovery (ABPD), is a technique that applies algorithms to event logs for the purpose of analyzing business processes. Process mining goes beyond data storage and data analysis; it bridges data with processes and provides insights into the trends and patterns that affect process efficiency.
Format of the Course
- The course starts with an overview of the most commonly used techniques for process mining. We discuss the various process discovery algorithms and tools used for discovering and modeling processes based on raw event data. Real-life case studies are examined and data sets are analyzed using the ProM open-source framework.
Introductory R for Biologists
28 HoursR is an open-source free programming language for statistical computing, data analysis, and graphics. R is used by a growing number of managers and data analysts inside corporations and academia. R has also found followers among statisticians, engineers and scientists without computer programming skills who find it easy to use. Its popularity is due to the increasing use of data mining for various goals such as set ad prices, find new drugs more quickly or fine-tune financial models. R has a wide variety of packages for data mining.
SAS Programming
14 HoursThis instructor-led, live training in the Netherlands (online or onsite) is aimed at data analysts who wish to program in SAS for advanced data management, data analysis, and more.
By the end of this training, participants will be able to:
- Master macros to write efficient SAS programs.
- Train a model and make predictions on unseen data with predictive modeling.
- Create charts and plots for data visualization.
Statistics with SPSS Predictive Analytics Software
14 HoursGoal:
Learning to work with SPSS at the level of independence
The addressees:
Analysts, researchers, scientists, students and all those who want to acquire the ability to use SPSS package and learn popular data mining techniques.
Introduction to Data Visualization with Tidyverse and R
7 HoursThe Tidyverse is a collection of versatile R packages for cleaning, processing, modeling, and visualizing data. Some of the packages included are: ggplot2, dplyr, tidyr, readr, purrr, and tibble.
In this instructor-led, live training, participants will learn how to manipulate and visualize data using the tools included in the Tidyverse.
By the end of this training, participants will be able to:
- Perform data analysis and create appealing visualizations
- Draw useful conclusions from various datasets of sample data
- Filter, sort and summarize data to answer exploratory questions
- Turn processed data into informative line plots, bar plots, histograms
- Import and filter data from diverse data sources, including Excel, CSV, and SPSS files
Audience
- Beginners to the R language
- Beginners to data analysis and data visualization
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice