Data Analytics Cheat Sheet
The core ideas of Data Analytics distilled into a single, scannable reference — perfect for review or quick lookup.
Quick Reference
Descriptive Analytics
The foundational tier of analytics that summarizes historical data to answer the question 'what happened.' It uses aggregation, data visualization, and reporting to present past trends and key performance indicators in an understandable form.
Predictive Analytics
The use of statistical models, machine learning algorithms, and historical data to forecast future outcomes. It identifies patterns in past behavior to estimate probabilities and trends that have not yet occurred.
Prescriptive Analytics
The most advanced tier of analytics, which recommends specific actions by combining predictive models with optimization and simulation techniques. It answers the question 'what should we do' by evaluating trade-offs among possible decisions.
ETL (Extract, Transform, Load)
A data integration process that extracts data from source systems, transforms it into a consistent format through cleaning and enrichment, and loads it into a target data warehouse or data lake. ETL pipelines are essential for making raw data analysis-ready.
Data Visualization
The graphical representation of data and information using charts, maps, dashboards, and other visual elements. Effective visualization makes complex datasets accessible to non-technical audiences and helps reveal patterns that are difficult to detect in raw numbers.
Statistical Significance
A measure of whether an observed result is unlikely to have occurred by chance alone, typically evaluated using a p-value threshold (often 0.05). It helps analysts distinguish genuine effects from random variation in data.
Data Warehousing
The practice of collecting and managing data from multiple sources into a centralized repository optimized for analytical querying and reporting. Data warehouses use schemas designed for read-heavy workloads rather than transactional processing.
Regression Analysis
A set of statistical methods for estimating the relationships between a dependent variable and one or more independent variables. It is widely used for prediction, forecasting, and understanding which factors most influence an outcome.
A/B Testing
A controlled experiment in which two variants (A and B) are compared to determine which performs better on a defined metric. Participants are randomly assigned to each group to isolate the effect of a single change.
Data Governance
The framework of policies, processes, roles, and standards that ensures data is managed consistently, securely, and in compliance with regulations across an organization. Good governance addresses data quality, access control, lineage, and privacy.
Key Terms at a Glance
Get study tips in your inbox
We'll send you evidence-based study strategies and new cheat sheets as they're published.
We'll notify you about updates. No spam, unsubscribe anytime.