Training & Exams

Analyzing Big Data with Microsoft R.

About This Course

Course Code
M20773

Course Type
Specialist

Vendor
Microsoft

Duration
3 Days

RRP
£1,656.00

Course Overview
Download

Special Notices

Please note: for Attend from Anywhere customers an additional screen is required. The additional screen must have a minimum screen size of 19 inch and minimum resolution of 1280x1024, with the vertical resolution 1024 being the most critical.

Overview

The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis on a large dataset, and show how to utilize it in Big Data environments, such as a Hadoop or Spark cluster, or a SQL Server database.

Target Audience

Objectives

After completing this course, students will be able to:

Course Outline

Module 1: Microsoft R Server and R Client

Explain how Microsoft R Server and Microsoft R Client work.

Lessons


Lab : Exploring Microsoft R Server and Microsoft R Client


Module 2: Exploring Big Data

At the end of this module the student will be able to use R Client with R Server to explore big data held in different data stores.

Lessons


Lab : Exploring Big Data


Module 3: Visualizing Big Data

Explain how to visualize data by using graphs and plots.

Lessons


Lab : Visualizing data


Module 4: Processing Big Data

Explain how to transform and clean big data sets.

Lessons


Lab : Processing big data


Module 5: Parallelizing Analysis Operations

Explain how to implement options for splitting analysis jobs into parallel tasks.

Lessons


Lab : Using rxExec and RevoPemaR to parallelize operations


Module 6: Creating and Evaluating Regression Models

Explain how to build and evaluate regression models generated from big data

Lessons


Lab : Creating a linear regression model


Module 7: Creating and Evaluating Partitioning Models

Explain how to create and score partitioning models generated from big data.

Lessons


Lab : Creating and evaluating partitioning models


Module 8: Processing Big Data in SQL Server and Hadoop

Explain how to transform and clean big data sets.

Lessons


Lab : Processing big data in SQL Server and Hadoop

Prerequisites

In addition to their professional experience, students who attend this course should have:


It is recommended that delegates review this self-pace content to gain an introduction to the R language

https://www.edx.org/course/introduction-r-data-science-microsoft-dat204x-5

About This Course

Course Code
M20773

Course Type
Specialist

Vendor
Microsoft

Duration
3 Days

RRP
£1,656.00

Course Overview
Download