Coffee and Research
  • Home
  • cifmodeling
  • A Conversation on Causality at Our Table (EN)
    • Index
    • Study design
  • A Conversation on Causality at Our Table (JP)
    • Index
    • Study design

On this page

  • Glossary − Statistical Terms in Plain Language
    • Her father explains words in statistics and research

Statistical Terms in Plain Language

Statistical terms can quietly become barriers to understanding. A small glossary written by a father for his daughter, in plain and careful language. A place to pause and look up words during your coffee break.
Published

December 14, 2025

Glossary − Statistical Terms in Plain Language

Keywords: causal model, language & writing, probability model, survival & competing risks


Her father explains words in statistics and research

Me: “Dad, there’s a paper we’re doing a journal club on in my department right now, and I’m struggling with the nuance of some of the terminology. When you talk, a lot of statistical terms come up. To be honest, some of them still don’t feel very real to me. Would you mind rephrasing statistical terms in more everyday language?”

Dad: “I’ve got some time today, so sure. Let’s try a few. I’ll go through them in alphabetical order.”

Aalen-Johansen method

A method to estimate a cumulative incidence curve in competing risks analysis

Alpha error

An error where we conclude that a treatment is effective, or that an association exists, when in fact it does not. This is also called a false positive. In medical research, it is customary to control this error at 5% or less using p-values or similar criteria.

Attributable fraction

A measure obtained by transforming risks or incidence rates to express the proportion of disease occurrence in a specific population that can be attributed to exposure to a particular risk factor. Multiple definitions exist.

Beta error

A key quantity in sample size planning. It is the probability of concluding that a truly effective treatment has no effect. When large amounts of data can be collected and beta error is low, the study is said to have high power.

Bias

In general usage, bias refers to a distorted perspective or behavior. In statistics, it refers to the tendency of an estimator to deviate from the true value (the estimand), or the magnitude of that deviation. Once bias is detected after data collection, it is often difficult to correct.

Biostatistician

A statistical specialist working in medicine or life sciences. Biostatisticians belong to professional societies and communities distinct from those of bioinformaticians, data scientists, or epidemiologists. In a narrow sense, the term refers to professionals specializing in clinical trials, for which formal certification systems exist—though the “bio” prefix often makes the role seem less clear.

Censoring

In survival analysis, a situation in which observation is interrupted at a certain point in time, so that we only know that the event of interest would have occurred after that point.

Competing risk

An event that, once it occurs, prevents observation of the event of interest—for example, death from an accident when studying death from cancer.

Confounding

A type of bias that must be considered when comparing groups.

Cox regression

A statistical method developed by Professor Cox in 1972 that became widely used. It is a type of regression model, but not a generalized linear model. It treats survival time as the outcome and is used to summarize differences in survival curves via hazard ratios or to examine associations between survival time and various factors.

Design

Traditionally, “design” in statistics referred to experimental design, but the term has expanded to include study planning and the elements decided at that stage. Randomized trials, surveys, and cohort studies are all types of study design.

Directed acyclic graph (DAG) In this work, a model refers to an abstract way of viewing data. A DAG is one a structural causal model that represents causal relationships across variables using arrows.

Effect size

The magnitude of a treatment effect or the difference between treatments. In sample size planning, an effect size must be specified, which is paradoxical because the study is often conducted precisely to learn that value. This dilemma is sometimes mocked as “the smallest difference that can be calculated backward from the sample size.”

Estimand

In clinical trials, adverse events or treatment discontinuation after treatment initiation can make it unclear what treatment effect is actually being evaluated. An estimand is a statistical term that clarifies exactly what quantity the study aims to estimate. The ICH E9 guideline requires that estimands be specified at the planning stage of a clinical trial. They are the true values in statistical inference.

Fine-Gray model

A statistical method developed by Professor Fine in 1999 as an extension of Cox regression. It is a regression model for survival data with competing risks.

Generalized linear model

A class of “regression models” that includes regression analysis, analysis of variance, logistic regression, and related methods. In statistical software, it is often abbreviated as “GLM.”

Hazard ratio

A measure derived from Cox regression, used to compare survival curves.

Information bias

A type of bias that must be considered when collecting information.

Kaplan-Meier method

A method to estimate a survival curve in survival analysis

Logistic regression

A statistical method used to examine the association between the probability of an event and other variables. It is a type of regression model used when the outcome is binary. Taking the exponential of the regression coefficient yields an odds ratio.

One-sided p-value

A p-value used when interest lies only in one direction of an association—either positive or negative. In most situations, however, a two-sided p-value is preferred.

Probability model In this work, a model refers to an abstract way of viewing data. A probabilistic model is one that assumes what kind of probability distribution data are generated from.

Proportion

A measure representing the fraction of a whole. It is commonly used to summarize binary or categorical data. Because it is calculated by dividing a count by a count, the units cancel out and it is dimensionless.

Proportional hazards assumption

The assumption that the speed at which events such as death or disease progression occur differs between groups by a constant factor, and that this relationship remains unchanged over time.

p-value

A number that many readers look at first when examining study results. It is used to assess statistical significance—that is, whether an observed association exceeds what could be explained by random variation.

Randomization

The process of randomly assigning participants to interventions—such as a new treatment versus standard care—when comparing intervention effects. This is one way to reduce bias.

Rate

A measure of the speed at which events occur over time. In epidemiology, rates are often calculated using person-time (number of events divided by person-years), giving units of 1/year (or more generally, 1/time).

Ratio

A measure obtained by dividing one quantity by another. Proportions and rates are types of ratios.

Regression model

A statistical method used to examine how one variable is related to others. Although often used interchangeably with “regression analysis,” the term “model” typically emphasizes the underlying mathematical or probabilistic formulation rather than the analytical procedure itself.

Risk

The probability that a disease will occur. Because risk depends on the population being studied, using risk values without context should be avoided.

Rubin causal model

In this work, a model refers to an abstract way of viewing data. The Rubin causal model defines causal effects using potential outcome variables, which represent what the outcome would have been for the same unit had the cause taken a different value. The term is sometimes used to denote a broader framework that includes assignment mechanisms and probability distributions; however, in this work, these components are not made explicit, as the focus is on the definition of causal effects.

Sample size calculation

The process of determining the scale of a study, such as the number of participants. By specifying quantities such as the effect size, alpha error, and beta error (or power), the required sample size can be calculated.

Selection bias

A type of bias that must be considered when choosing the study population.

Surrogate endpoint

When a clinically meaningful outcome cannot be obtained, a substitute endpoint may be used. For example, in cancer trials, response rate has often been used as a surrogate for overall survival, though it has been criticized for not necessarily reflecting true survival benefit.

Survival curve

A graph that displays time on the horizontal axis and the proportion of individuals surviving at each time point on the vertical axis. In many disease areas, such as oncology and cardiovascular medicine, study results are often presented as survival curves.

Two-sided p-value

A p-value used when interest lies in both positive and negative associations. In most cases, two-sided p-values are preferred over one-sided ones.

Me: “Seeing them all laid out like this, the curse words I saw in the paper are starting to look a bit more like human language.”

Dad: “Exactly. Once you understand the terminology, you can focus on the clinical questions and the data themselves.”

Differences among statistical terms

A survival curve is a formal technical term in statistics, but many other expressions with the same or similar meanings are used, such as survival function, survival rate, overall survival curve, and Kaplan-Meier curve. In fact, each has a slightly different meaning.

  • Almost synonymous with survival curve Survival function

  • Similar meaning, but referring more to a value at a specific time point than to a curve Survival rate, survival probability, survival proportion

  • Specifying the statistical method used to estimate the survival curve Kaplan-Meier curve, Kaplan-Meier estimator, Kaplan-Meier estimate, Kaplan-Meier method

  • Specifying the type of endpoint Overall survival curve, disease-free survival curve, 5-year OS, 3-year DFS

In statistics, the method used for estimation is called an estimator, the resulting value is an estimate, and the target quantity is the estimand. This distinction is important when writing papers: in the Methods section, Kaplan-Meier estimator is natural, whereas in the Results section, Kaplan-Meier estimate is often more appropriate. Instead of “estimator,” the expression “method” (e.g., Kaplan-Meier method) can also be used in a broader sense.

Finally, special attention should be paid to cumulative incidence curves and cumulative incidence functions. They are often mistaken for survival curves, but they are terms used specifically when competing risks are present and are carefully distinguished by statisticians. In technical terms, it is also referred to as the Aalen-Johansen curve, named after the statisticians. Simply flipping a survival curve upside down and calling it “cumulative” is a conventional expression, not a formal statistical term, and can lead to misunderstanding if not clearly distinguished from cumulative incidence curves.

Quiz related to this episode

The following four terms are not formal statistical terms, but they are often seen in clinical trial literature. Which of them cannot be considered incorrect?

  1. Sample number
  2. COX regression
  3. OS curve
  4. T-test
Answer
  • The correct answer is 3 Some papers use “sample number” to mean sample size, but in clinical trials “sample number” would correspond to the number of groups, making this usage incorrect. Cox is a proper name, so only the first letter should be capitalized. “OS curve” can reasonably be interpreted as an overall survival curve, so it is not necessarily incorrect. “T-test” is sometimes used, but since it refers to a different statistical method from the t-test and is not used in clinical trials, it should be considered a typographical error.