Skip to content

Pages

Courses

ALL COURSES

Resources

ALL RESOURCES

Blogs

ALL BLOGS
Use coupon code LEARN21 at checkout for 21% off any course. Valid through 10/31/21.
*Offer excludes Scaled Agile (SAFe®) courses, CSP-SM and private team training.

Big Data Fundamentals

This course is a survey of big data – the landscape, the technology behind it, business drivers and strategic possibilities. “Big data” is a hot buzzword, but most organizations are struggling to put it to practical use. Without assuming any prior knowledge of Apache Hadoop or big data management, this course teaches a wide range of professional roles how to tap and manage the potential benefits of big data, including:

  • Discovering customer insights buried in your existing data
  • Uncovering product opportunities from data insights
  • Pinpointing decision points and criteria
  • Scaling your existing workflows and operations
  • Learning to ask questions that drive tangible business value from Big Data tools 
Available formats for this course
In-Person
Live Online
Private Team Training
Duration
2 days/16 hours of instruction
Public Classroom Pricing

Starting at: $1295(USD)

$1650(CAD)

GSA Price: $1185

Group Rate: $1195

Get the full details on this course. Download the .PDF Brochure below:

Part 1: Introduction to Big Data

  1. Academic
  2. Early web
  3. Web-scale
    • 1994 – 2012
    • 2016
    • 2020

Part 2: Sources (Examples)

  1. Internet
  2. Transport systems
  3. Medical, healthcare
  4. Insurance
  5. Military and others

Part 3: Hadoop – the free platform for working with big data

  1. History
  2. Yahoo
  3. Platform fragmentation
  4. What usage looks like in the enterprise

Part 4: The concepts

  1. Load data how you find it
  2. Process it when you can
  3. Project it into various schemas on the fly
  4. Push it back to where you need it

Part 5: The basics

  1. What it’s good for
  2. What can’t it do / disadvantages
  3. Most common use cases for big data

Part 6: Introduction to HDFS

  1. Robustness
  2. Data Replication
  3. Gotchas

Part 7: MapReduce – the core big data function

  1. Map explained
  2. Sort and shuffle explained
  3. Reduce explained

Demonstration: Hadoop, HDFS, and MapReduce – Let’s try it!

Part 8: YARN

  1. How it fits
  2. How it works
  3. Resource Manager
  4. Application Master

Part 9: PIG

  1. What it is
  2. How it works
  3. Compatibilities
  4. Advantages
  5. Disadvantages

Demonstration: YARN and PIG – Let’s try it!

Part 10: Processing Data

  1. The Piggy Bank
  2. Loading and Illustrating the data
  3. Writing a Query
  4. Storing the Result

Part 11: HIVE

  1. Data warehousing
  2. What it is, what it’s not
  3. Language compatibilities
  4. Advantages

Demonstration: HIVE – Let’s try it!

Example demo walkthrough: Contextual advertising

Part 12: OOZIE

  1. What it is
  2. Complex workflow environments
  3. Reducing time-to-market
  4. Frequency execution
  5. How it works with other big data tools

Example demo walkthrough: How to run a job

Part 13: FLUME – stream, collect, store and analyze high-volume log data

  1. How it works: Event, source, sink, channel, agent and client
  2. How it works illustrated
  3. How it works demonstrated

Part 14: SPARK

  1. Move over 2012 Big Data tools: Apache SPARK is the new power tool
  2. The new open source cluster framework
  3. When SPARK performs 100 times faster
  4. Performance comparison of Spark and Hadoop
  5. What else can it do?

Part 15: HBASE

  1. What it is
  2. Common use cases

Part 16: Using External Tools

This class is for anyone involved in project, product, or IT work who is actively consuming or considering big data services. No specific technical experience or prerequisites are needed. 
•    Software Engineers and Team Leads
•    Project Managers
•    Business Analysts
•    DBAs and Data Engineering teams
•    Business Customers
•    System Analysts

  • Navigate the technology stacks and tools used to work with big data
  • Establish a common vocabulary on your teams for applying big data practices
  • Get an overview of how big data technologies work: Apache Hadoop, Spark, Pig, Hive, Sqoop, OOZIE, and FLUME
  • Design both functional and non-functional requirements for working with big data
  • Understand common business cases for big data
  • Differentiate between hype and what’s truly possible
  • Look at examples of real-world big data use cases
  • Select initiatives and projects that have high potential to benefit from big data applications
  • Understand what type of staffing, technical skills, and training is required for projects that incorporate or focus on big data

Big Data Fundamentals Schedule

Location
Date
Register
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000VNV3AQAX', 'startDate' => '2021-11-16', 'startTime' => '10:30 AM', 'endDate' => '2021-11-17', 'endTime' => '6:30 PM', 'name' => '571006VCL02', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Nov 16th - 17th
10:30 AM - 6:30 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000VNV45QAH', 'startDate' => '2021-12-07', 'startTime' => '8:30 AM', 'endDate' => '2021-12-08', 'endTime' => '4:30 PM', 'name' => '571006VCL03', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Dec 7th - 8th
8:30 AM - 4:30 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000ZIs8kQAD', 'startDate' => '2022-02-24', 'startTime' => '8:30 AM', 'endDate' => '2022-02-25', 'endTime' => '4:30 PM', 'name' => '571007VCL', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Feb 24th - 25th
8:30 AM - 4:30 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000ZIs8lQAD', 'startDate' => '2022-04-12', 'startTime' => '10:00 AM', 'endDate' => '2022-04-13', 'endTime' => '6:00 PM', 'name' => '571008VCL', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Apr 12th - 13th
10:00 AM - 6:00 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000ZIs8mQAD', 'startDate' => '2022-06-07', 'startTime' => '12:00 PM', 'endDate' => '2022-06-08', 'endTime' => '8:00 PM', 'name' => '571008VCL02', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Jun 7th - 8th
12:00 PM - 8:00 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000ZIs8nQAD', 'startDate' => '2022-08-15', 'startTime' => '8:30 AM', 'endDate' => '2022-08-16', 'endTime' => '4:30 PM', 'name' => '571009VCL', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Aug 15th - 16th
8:30 AM - 4:30 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000ZIs8oQAD', 'startDate' => '2022-10-11', 'startTime' => '10:00 AM', 'endDate' => '2022-10-12', 'endTime' => '6:00 PM', 'name' => '571010VCL', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Oct 11th - 12th
10:00 AM - 6:00 PM ET
$1295(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000ZIs8pQAD', 'startDate' => '2022-12-05', 'startTime' => '8:30 AM', 'endDate' => '2022-12-06', 'endTime' => '4:30 PM', 'name' => '571010VCL02', 'standardCourseFee' => '1295', 'standardCourseFeeCAD' => '1650', 'standardCourseFeeEUR' => '1175', 'courseId' => 'BIGDATAFUNDS', 'course' => 'a00370000029uiMAAQ', 'courseSfId' => 'a00370000029uiMAAQ', 'courseName' => 'Big Data Fundamentals', 'instructorId' => '85388', 'instructorSfId' => 'a0237000001Y4RDAA0', 'instructorDisplayName' => 'TBD', 'instructorName' => 'TBD', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso3oQAB', 'subjectName' => 'Data & Analytics', 'specialitySfId' => 'a051G00000Jso43QAB', 'specialityName' => 'Big Data', 'expertiseSfId' => 'a051G00000JsoC4QAJ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '0', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Dec 5th - 6th
8:30 AM - 4:30 PM ET
$1295(usd)

Request Private Team Training