Posted on October 17, 2020 by Yash Sutrave -
Part 1: The Value and Challenges of Data-Driven Disruption
- Objectives and expectations
- Hurdles to becoming a data-driven organization
- Data empowerment
- Instilling data practices in the organization
- The CRISP-DM model of data projects
Part 2: Tying Data to Business Value
- What constitutes data-driven value
- Requirements gathering: How to approach it
- Kanban for data analysis
- Know your customers
- Stakeholder cheat sheets
- EXERCISE: Data-driven project checklist
- LAB: Data analysis techniques: Aggregations
Part 3: Understanding Your Data
- Data defined
- Data versus information
- Types of data
- Unstructured vs. Structured
- Time scope of data
- Sources of data
- Data in the real world
- The 3 V’s of data
- Data Quality
- Cleansing
- Duplicates
- SSOT
- Field standardization
- Identify sparsely populated fields
- How to fix common issues
- LAB: Prioritizing data quality
Part 4: Analyzing Data
- Analysis foundations
- Comparing programs and tools
- Words in English vs. data
- Concepts specific to data analysis
- Domains of data analysis
- Descriptive statistics
- Inferential statistics
- Analytical mindset
- Describing and solving problems
- Averages in data
- Mean
- Median
- Mode
- Range
- Central tendency
- Variance
- Standard deviation
- Sigma values
- Percentiles
- Demystifying statistical models
- Data analysis techniques
- LAB: Central tendency
- LAB: Variability
- LAB: Distributions
- LAB: Sampling
- LAB: Feature engineering
- LAB: Univariate linear regression
- LAB: Prediction
- LAB: Multivariate linear regression
- LAB: Monte Carlo simulation
Part 5: Thinking Critically About Your Analysis
- Descriptive analysis
- Diagnostic analysis
- Predictive analysis
- Prescriptive analysis
Part 6: Data Analysis in the Real World
- Deployment of analyses
- Best practices for BI
- Technology ecosystems
- Relational databases
- NoSQL databases
- Big data tools
- Statistical tools
- Machine learning
- Visualization and reporting tools
- Making data useable
Part 7: Data Visualization & Reporting
- Best practices for data visualizations
- Visualization essentials
- Users and stakeholders
- Stakeholder cheat sheet
- Common presentation mistakes
- Goals of visualization
- Communication and narrative
- Decision enablement
- Critical characteristics
- Communicating data-driven knowledge
- Formats and presentation tools
- Design considerations
Part 8: Hands-On Introduction to R and R Studio
- What is R?
- LAB: Intro to R Studio
- LAB: Univariate linear regression in R
- LAB: Multivariate linear regression in R