Haskell Data Analysis
This course is geared for experienced Python developers, analysts, or other data-driven professionals who want to utilize Haskell to work and generate publication-ready visualizations in no time.
Every business and organization that collects data is capable of tapping into its own data to gain insights on how to improve. Haskell is a purely functional programming language, well-suited to handling significant data analysis problems. This course will take you through the more complex data analysis problems in a hands-on manner. This course will help you get up-to-speed with the basics of data analysis and approaches in the Haskell language.
You'll learn about statistical computing, file formats (CSV and SQLite3), descriptive statistics, charts, and progress to more advanced concepts such as understanding the importance of normal distribution. While mathematics is a big part of data analysis, we've tried to keep this course approachable and straightforward so that you can apply what you learn to the real world. By the end of this course, you will have a thorough understanding of data analysis and the different ways of analyzing data. You will have a mastery of Haskell's tools and techniques for effective data analysis.
Duration
2 days/16 hours of instructionPublic Classroom Pricing
Private Group Pricing
Have a group of 5 or more students? Request special pricing for private group training today.
Download the Course Brochure
Part 1: Descriptive Statistics
- Descriptive Statistics
- The CSV library – working with CSV files
- Data range
- Data mean and standard deviation
- Data median
- Data mode
Part 2: SQLite3
- SQLite3
- SQLite3 command line
- Working with SQLite3 and Haskell
- Slices of data
- Working with SQLite3 and descriptive statistics
Part 3: Regular Expressions
- Regular Expressions
- Dots and pipes
- Atom and Atom modifiers
- Character classes
- Regular expressions in CSV files
- SQLite3 and regular expressions
Part 4: Visualizations
- Visualizations
- Line plots of a single variable
- Plotting a moving average
- Creating publication-ready plots
- Feature scaling
- Scatter plots
Part 5: Kernel Density Estimation
- Kernel Density Estimation
- The central limit theorem
- Normal distribution
- Introducing kernel density estimation
- Application of the KDE
Part 6: Course Review
- Course Review
- Converting CSV variation files into SQLite3
- Using SQLite3 SELECT and the DescriptiveStats module for descriptive statistics
- Creating compelling visualizations using EasyPlot
- Reintroducing kernel density estimation
Professionals who would benefit from this training include:
- Developers
- Analysts
- IT Managers
- Product Managers
A working knowledge of Python is required to attend this course.
- Learn to parse a CSV file and read data into the Haskell environment
- Create Haskell functions for common descriptive statistics functions
- Create an SQLite3 database using an existing CSV file
- Learn the versatility of SELECT queries for slicing data into smaller chunks
- Apply regular expressions in large-scale datasets using both CSV and SQLite3 files
- Create a Kernel Density Estimator visualization using normal distribution