Skip to content

Pages

Courses

ALL COURSES

Resources

ALL RESOURCES

Blogs

ALL BLOGS
Use coupon code LEARN21 at checkout for 21% off any course. Valid through 9/30/21.
*Offer excludes Scaled Agile (SAFe®) courses, CSP-SM and private team training.

Python in Data Science: Intermediate Python

Intermediate Python with NumPy, Pandas, SciKit Learn, SciPy, Spark, Streaming & More

This course covers the essentials of using Python as a tool for data scientists to perform exploratory data analysis, complex visualizations, and large-scale distributed processing on “Big Data”. In this course, we cover essential mathematical and statistics libraries such as NumPy, Pandas, SciPy, SciKit-Learn, frameworks like TensorFlow and Spark, as well as visualization tools like matplotlib, PIL, and Seaborn. This course is ‘intermediate level’ as it assumes that attendees have solid data analytics and data science background and have basic Python knowledge.  Topics are introductory in nature but are covered in-depth, geared for experienced students.

This course is about 50% hands-on lab to 50% lecture ratio, combining engaging instructor presentations, demos, and practical group discussions with extensive machine-based student labs and project work. Throughout the course, students will learn to write Python scripts and apply them within a scientific framework working with the latest technologies listed on the agenda. This course provides indoctrination in the practical use of the umbrella of technologies that are on the leading edge of data science development. 

Available formats for this course
In-Person
Live Online
Private Team Training
Duration
5 days/40 hours of instruction
Public Classroom Pricing

Starting at: $2595(USD)

$3350(CAD)

GSA Price: $1945

Group Rate: $2495

Get the full details on this course. Download the .PDF Brochure below:

Part 1: Python for Data Science
1. Python Review (Optional)
•    Python Language
•    Essential Syntax
•    Lists, Sets, Dictionaries, and Comprehensions
•    Functions
•    Classes, Modules, and imports
•    Exceptions
2. iPython
•    iPython basics
•    Terminal and GUI shells
•    Creating and using notebooks
•    Saving and loading notebooks
•    Ad hoc data visualization
•    Web Notebooks (Jupyter)
3. numpy
•    numpy basics
•    Creating arrays
•    Indexing and slicing
•    Large number sets
•    Transforming data
•    Advanced tricks
4.scipy
•    What can scipy do?
•    Most useful functions
•    Curve fitting
•    Modeling
•    Data visualization
•    Statistics
5. A tour of scipy subpackages
•    Clustering
•    Physical and mathematical Constants
•    FFTs
•    Integral and differential solvers
•    Interpolation and smoothing
•    Input and Output
•    Linear Algebra
•    Image Processing
•    Distance Regression
•    Root-finding
•    Signal Processing
•    Sparse Matrices
•    Spatial data and algorithms
•    Statistical distributions and functions
•    C/C++ Integration
6. pandas
•    pandas overview
•    Dataframes
•    Reading and writing data
•    Data alignment and reshaping
•    Fancy indexing and slicing
•    Merging and joining data sets
7. matplotlib
•    Creating a basic plot
•    Commonly used plots
•    Ad hoc data visualization
•    Advanced usage
•    Exporting images
8. The Python Imaging Library (PIL)
•    PIL overview
•    Core image library
•    Image processing
•    Displaying images
9. seaborn
•    Seaborn overview
•    Bivariate and univariate plots
•    Visualizing Linear Regressions
•    Visualizing Data Matrices
•    Working with Time Series data
10. SciKit-Learn Machine Learning Essentials
•    SciKit overview
•    SciKit-Learn overview
•    Algorithms Overview
•    Classification, Regression, Clustering, and Dimensionality Reduction
•    SciKit Demo
11. TensorFlow Overview
•    TensorFlow overview
•    Keras
•    Getting Started with TensorFlow

Part 2: Python on Spark
1. PySpark Overview
•    Python and Spark
•    SciKit-Learn vs. Spark MLlib
•    Python at Scale
•    PySpark Demo
2. RDDs and DataFrames
•    DataFrames and Resilient Distributed Datasets (RDDs)
•    Partitions
•    Adding variables to a DataFrame
•    DataFrame Types
•    DataFrame Operations
•    Dependent vs. Independent variables
•    Map/Reduce with DataFrames
3. Spark SQL
•    Spark SQL Overview
•    Data stores: HDFS, Cassandra, HBase, Hive, and S3
•    Table Definitions
•    Queries
4. Spark MLib
•    MLib overview
•    MLib Algorithms Overview
•    Classification Algorithms
•    Regression Algorithms
•    Decision Trees and forests
•    Recommendation with ALS
•    Clustering Algorithms
•    Machine Learning Pipelines
•    Linear Algebra (SVD, PCA)
•    Statistics in MLib
5. Spark Streaming
•    Streaming overview
•    Integrating Spark SQL, MLlib, and Streaming

  • Experienced data analysts, developers, engineers or anyone tasked with utilizing Python for data analytics tasks. 
  • Attending students are required to have a background in basic Python development skills.

  • Work with Python in a Data Science Context
  • Use NumPy, Pandas, and MatPlotLib
  • Create and process images with PIL
  • Visualize with Seaborn
  • Interact with Spark using DataFrames
  • Use SparkSQL, MLlib, and Streaming in BigData

Python in Data Science: Intermediate Python Schedule

Location
Date
Register
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000VOqoHQAT', 'startDate' => '2021-09-13', 'startTime' => '10:00 AM', 'endDate' => '2021-09-17', 'endTime' => '6:00 PM', 'name' => 'TTPS487605VCL02', 'standardCourseFee' => '2595', 'standardCourseFeeCAD' => '3350', 'standardCourseFeeEUR' => '2300', 'courseId' => 'TTPS4876', 'course' => 'a001G00000EGy3xQAD', 'courseSfId' => 'a001G00000EGy3xQAD', 'courseName' => 'Python in Data Science: Intermediate Python', 'instructorId' => '151136', 'instructorSfId' => 'a023700000Z1UwcAAF', 'instructorDisplayName' => 'Trivera', 'instructorName' => 'Trivera', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso5fQAB', 'subjectName' => 'Software Development', 'specialitySfId' => 'a051G00000Jso5uQAB', 'specialityName' => 'Python', 'expertiseSfId' => 'a051G00000JsoBMQAZ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Sep 13th - 17th 10:00 AM - 6:00 PM ET
$2595(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000VOqoIQAT', 'startDate' => '2021-11-01', 'startTime' => '10:00 AM', 'endDate' => '2021-11-05', 'endTime' => '6:00 PM', 'name' => 'TTPS487606VCL', 'standardCourseFee' => '2595', 'standardCourseFeeCAD' => '3350', 'standardCourseFeeEUR' => '2300', 'courseId' => 'TTPS4876', 'course' => 'a001G00000EGy3xQAD', 'courseSfId' => 'a001G00000EGy3xQAD', 'courseName' => 'Python in Data Science: Intermediate Python', 'instructorId' => '151136', 'instructorSfId' => 'a023700000Z1UwcAAF', 'instructorDisplayName' => 'Trivera', 'instructorName' => 'Trivera', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso5fQAB', 'subjectName' => 'Software Development', 'specialitySfId' => 'a051G00000Jso5uQAB', 'specialityName' => 'Python', 'expertiseSfId' => 'a051G00000JsoBMQAZ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Nov 1st - 5th 10:00 AM - 6:00 PM ET
$2595(usd)
CPSFDC\Entity\Session::__set_state(array( 'entitySanitizer' => 'CPSFDC\\Utils\\Sanitizers\\SessionSanitizer', 'sfId' => 'a011G00000VOqoJQAT', 'startDate' => '2021-12-06', 'startTime' => '10:00 AM', 'endDate' => '2021-12-10', 'endTime' => '6:00 PM', 'name' => 'TTPS487606VCL02', 'standardCourseFee' => '2595', 'standardCourseFeeCAD' => '3350', 'standardCourseFeeEUR' => '2300', 'courseId' => 'TTPS4876', 'course' => 'a001G00000EGy3xQAD', 'courseSfId' => 'a001G00000EGy3xQAD', 'courseName' => 'Python in Data Science: Intermediate Python', 'instructorId' => '151136', 'instructorSfId' => 'a023700000Z1UwcAAF', 'instructorDisplayName' => 'Trivera', 'instructorName' => 'Trivera', 'locationCode' => 'VCL', 'sessionStatus' => 'O', 'city' => 'Live Online Training', 'state' => '', 'cityState' => 'Live, Online Training', 'locationSfId' => 'a0637000000tn2hAAA', 'subjectSfId' => 'a051G00000Jso5fQAB', 'subjectName' => 'Software Development', 'specialitySfId' => 'a051G00000Jso5uQAB', 'specialityName' => 'Python', 'expertiseSfId' => 'a051G00000JsoBMQAZ', 'expertiseName' => NULL, 'certificationSfId' => NULL, 'certificationBody' => '', 'certificationName' => '', 'pdus' => '', 'deliveryModalities' => 'In-Person;Live Online', 'fullFacilityAddress' => 'Live Instructor-Led Online Training
Please call Cprime for location details
at 1-877-800-5221
, LOL', 'gtr' => '', ))
Live, Online Training
Dec 6th - 10th 10:00 AM - 6:00 PM ET
$2595(usd)

Request Private Team Training