Home page Courses

Search

Search courses or pages...

Learn Statistics | Zoonk

Statistics

Statistics is the science of collecting, analyzing, and interpreting data to make better decisions. Covers how to spot patterns, measure uncertainty, and turn numbers into useful insights for business, research, healthcare, and public policy.

Turn questions into data

Turn questions into data

Start with cases, variables, units, tables, and the difference between data, information, and claims. You will practice turning everyday questions into data that can actually be analyzed.

Name the kind of data you have

Name the kind of data you have

Work with counts, categories, measurements, dates, and ordered ratings. This chapter shows how variable type controls the summaries, graphs, and models that make sense.

Keep data organized enough to trust

Keep data organized enough to trust

Use spreadsheets, data frames, code notebooks, and tidy data rules to keep datasets readable and safe to change. You will clean names, spot entry errors, and document what each column means.

See patterns with clear graphs

See patterns with clear graphs

Use dotplots, histograms, bar charts, boxplots, and scatterplots to see shape, spread, outliers, and relationships. The goal is to notice patterns before calculating too much.

Summarize data without hiding the story

Summarize data without hiding the story

Calculate means, medians, proportions, percentiles, variance, standard deviation, and z-scores. You will connect each number to the real-world question it helps answer.

Think clearly about variation

Think clearly about variation

Separate natural variation, measurement error, bias, and random noise. This chapter builds the habit of asking what could have produced the data besides the first explanation that comes to mind.

How statistics became a way to reason with uncertainty

How statistics became a way to reason with uncertainty

Follow how statistics grew from censuses, gambling problems, astronomy, agriculture, public health, industry, computing, and data science. This history explains why today’s methods care so much about uncertainty, design, evidence, and reproducibility.

Measure uncertainty with probability

Measure uncertainty with probability

Use events, sample spaces, probability rules, independence, and conditional probability. You will build probability statements from plain-language situations instead of memorizing formulas blindly.

Use distributions to model chance

Use distributions to model chance

Work with random variables, expected value, variance, and common probability models. Binomial, geometric, Poisson, uniform, normal, exponential, and related distributions become tools for matching data to real processes.

Describe how variables move together

Describe how variables move together

Use joint, marginal, and conditional distributions to describe several variables at once. This chapter covers covariance, correlation, dependence, and why association alone does not prove cause.

See what repeated sampling would do

See what repeated sampling would do

See why random samples behave predictably even when individual observations do not. You will use sampling distributions, standard error, and the central limit theorem as the bridge from data to inference.

Estimate a population from a sample

Estimate a population from a sample

Estimate unknown quantities with point estimates, margins of error, and confidence intervals. You will practice saying what an interval does and does not guarantee.

Test claims with evidence and restraint

Test claims with evidence and restraint

Use null hypotheses, test statistics, p-values, significance levels, and errors to judge whether data are surprising under a claim. This chapter also covers practical significance so results do not become empty rituals.

Compare groups without fooling yourself

Compare groups without fooling yourself

Compare means, proportions, rates, and paired measurements. You will choose between common one-sample, two-sample, paired, and proportion tests based on the structure of the data.

Plan studies with power and precision

Plan studies with power and precision

Plan sample sizes before collecting data and interpret power after a study is done. This chapter shows how effect size, noise, cost, and ethical limits shape a realistic study.

Design experiments that can answer cause-and-effect questions

Design experiments that can answer cause-and-effect questions

Use random assignment, control groups, blocking, blinding, factorial designs, and replication. You will see how good experiments create stronger evidence than analysis alone can rescue.

Run surveys that represent real people

Run surveys that represent real people

Build questionnaires, sampling frames, stratified samples, cluster samples, weights, and nonresponse checks. This chapter covers the practical choices behind polls, official statistics, and field surveys.

Use resampling when formulas are not enough

Use resampling when formulas are not enough

Use bootstrap intervals, permutation tests, and simulation when formulas are hard or assumptions are shaky. You will resample real datasets to measure uncertainty directly.

Draw a line that explains one relationship

Draw a line that explains one relationship

Fit and interpret simple linear regression with slopes, intercepts, residuals, fitted values, and uncertainty. You will connect a line on a graph to a claim about prediction or association.

Build regression models with several predictors

Build regression models with several predictors

Add several predictors, categorical variables, interactions, and transformations to regression models. This chapter teaches adjustment, confounding control, and the danger of over-reading coefficients.

Check whether a model deserves trust

Check whether a model deserves trust

Check residual plots, leverage, outliers, multicollinearity, nonlinearity, unequal variance, and influential cases. You will revise models based on evidence instead of treating software output as final truth.

Model outcomes that are not simple averages

Model outcomes that are not simple averages

Model yes-or-no outcomes, counts, rates, and other non-normal data with logistic, Poisson, and related models. This chapter turns odds ratios, rate ratios, and predicted probabilities into plain language.

Compare many groups at once

Compare many groups at once

Compare several groups with ANOVA, planned contrasts, and multiple-comparison controls. You will connect these tools to experiments, product tests, classrooms, farms, and clinical studies.

Work with counts, rates, and categories

Work with counts, rates, and categories

Analyze contingency tables, chi-square tests, Fisher’s exact test, risk differences, risk ratios, and odds ratios. This chapter supports work with surveys, medical studies, quality checks, and social data.

Handle data that breaks neat assumptions

Handle data that breaks neat assumptions

Use rank-based tests and robust summaries when outliers, skew, or small samples make standard methods fragile. You will know when a simpler assumption-light method is the better choice.

Find structure in many variables

Find structure in many variables

Work with clustering, principal components, factor analysis, and multidimensional scaling. This chapter shows how to reduce many variables into patterns people can inspect and discuss.

Forecast data that changes over time

Forecast data that changes over time

Analyze repeated observations over time with trend, seasonality, autocorrelation, smoothing, ARIMA ideas, and forecast checks. You will learn why time order changes the rules of ordinary regression.

Model how long things take

Model how long things take

Analyze time-to-event outcomes with censoring, Kaplan-Meier curves, hazard rates, and Cox regression. This chapter applies to medicine, reliability, churn, recidivism, and any setting where timing matters.

Use models for grouped data

Use models for grouped data

Handle nested and repeated data with random effects, partial pooling, and hierarchical models. You will model classrooms, clinics, stores, regions, users, and experiments where observations come in groups.

Update beliefs with Bayesian statistics

Update beliefs with Bayesian statistics

Use prior information, likelihoods, posterior distributions, credible intervals, and Bayes factors. This chapter builds Bayesian reasoning from probability rules and shows how it differs from frequentist inference.

Fit Bayesian models with modern computation

Fit Bayesian models with modern computation

Fit Bayesian models with MCMC, diagnostics, posterior predictive checks, and sensitivity analysis. You will see how tools like Stan, PyMC, and brms made complex Bayesian modeling practical.

Estimate causes, not just correlations

Estimate causes, not just correlations

Use DAGs, confounders, colliders, mediators, randomized trials, natural experiments, matching, weighting, difference-in-differences, instrumental variables, and regression discontinuity. This chapter gives a practical language for cause-and-effect claims.

Design target trials from messy real-world data

Design target trials from messy real-world data

Frame observational studies as if they were trials by defining eligibility, treatment, timing, outcomes, and follow-up. This modern workflow helps avoid hidden biases in health, policy, business, and platform data.

Deal with missing data honestly

Deal with missing data honestly

Detect missingness patterns and use complete-case analysis, weighting, single imputation, and multiple imputation. You will judge when missing data are a nuisance, a threat, or the main story.

Model data with more predictors than comfort allows

Model data with more predictors than comfort allows

Fit models with many predictors using cross-validation, ridge, lasso, elastic net, and feature selection. This chapter covers the high-dimensional problems common in genomics, text, sensors, marketing, and finance.

Judge prediction models fairly

Judge prediction models fairly

Use train-test splits, loss functions, calibration, ROC curves, precision-recall curves, and model comparison. You will connect statistical judgment to predictive modeling without treating prediction as magic.

Use machine learning as a statistician

Use machine learning as a statistician

Compare decision trees, random forests, gradient boosting, nearest neighbors, and support vector machines with statistical modeling habits. This chapter shows where machine learning extends statistics and where it creates new risks.

Run A/B tests at real-world scale

Run A/B tests at real-world scale

Analyze randomized online experiments with metrics, assignment units, guardrails, peeking risks, sequential tests, and heterogeneous effects. You will see how A/B testing works in products, marketing, policy pilots, and service delivery.

Make statistical work reproducible

Make statistical work reproducible

Use reproducible code, version control, notebooks, scripts, environments, data validation, and literate reports. This chapter turns one-off analysis into work another person can rerun and audit.

Protect privacy and fairness in statistical work

Protect privacy and fairness in statistical work

Protect people and organizations with de-identification limits, consent, secure data handling, fairness checks, and differential privacy. You will handle sensitive data with methods that match the risk.

Tell the statistical story without overselling it

Tell the statistical story without overselling it

Present uncertainty with interval plots, prediction displays, clear tables, plain-language caveats, and decision-focused summaries. This chapter helps you communicate results to people who must act on them.

Take a statistical project from question to decision

Take a statistical project from question to decision

Move from a real question to study design, data collection, cleaning, analysis, validation, reporting, and follow-up decisions. This end-to-end chapter ties together the habits of a working statistician.

Audit an analysis before trusting the answer

Audit an analysis before trusting the answer

Find mistakes before they spread by checking data lineage, assumptions, code, peer review, preregistration, replication, and sensitivity analyses. You will practice professional skepticism without becoming paralyzed.

Build your path as a statistician

Build your path as a statistician

Map the field’s paths in biostatistics, official statistics, social science, sports, finance, industry, data science, and research. This chapter covers portfolio projects, graduate routes, common tools, professional groups, certifications, and habits for staying current.