How to use the sprtt package?

The sprtt package is a toolbox for sequential probability ratio tests (SPRTs). This vignette introduces the core functions and demonstrates a typical analysis workflow. For more comprehensive guides on specific topics, see the other package vignettes. If you are unfamiliar with SPRTs, please read first the vignette vignette("sprt").

Workflow

1. Understand the theoretical background of SPRTs

The foundational literature (Wald, 1945, 1947) established the mathematical framework for SPRTs. While these original papers provide theoretical depth, they require strong mathematical statistics background and focus primarily on abstract theory rather than practical application.

For practical implementation, we strongly recommend starting with the following simulation studies, which demonstrate robustness to assumption violations, explain common pitfalls, and provide actionable guidance:

Essential reading:

Sequential t-tests: Schnuerch & Erdfelder (2020) — Demonstrates performance under various assumption violations and provides practical guidance
Sequential one-way ANOVA: Steinhilber et al. (2024) — Extends SPRTs to the one-way ANOVA design with comprehensive simulation evidence
Understanding questionable research practices in sequential designs: Steinhilber et al. (2025) — Critical for avoiding misuse and ensuring appropriate application

For comparisons of different sequential designs (including SPRTs), see:

Schnuerch & Erdfelder (2020) – Group sequential designs vs Sequential Bayes Factor Test vs SPRT
Stefan et al. (2022) – Sequential Bayes Factor Test vs SPRT

2. When to use SPRTs

Whether a statistical tool is appropriate depends strongly on the research context and intended use. SPRTs are recommended when:

Resources are limited and efficiency is a priority
The research question translates to a clear hypothesis test: Concluding in favor of either \(H_0\) (no effect) or \(H_1\) (effect) provides useful information for your research question.
Data can be collected sequentially
Data can be analyzed sequentially (ideally after each new observation)
Long-term error rates need to be controlled (both \(\alpha\) and \(\beta\) error rates)

SPRTs are not recommended when:

Data are collected all at once or only in very large batches, see here fixed designs or Group Sequential Designs
Multiple hypotheses must be tested on the same dataset (theoretically possible but not yet implemented)
Groups cannot be sampled somewhat equally over time (e.g., collecting all participants from one group before starting the other)
Sufficient literature already establishes the existence of the effect and the main goal is precise effect size estimation (see Steinhilber et al. (2024))

3. Plan your resources

The plan_sample_size() function helps establish realistic expectations for data requirements and resource planning.

Unlike traditional designs, SPRTs do not require classical a priori power analysis. Power is controlled through the stopping boundaries, allowing you to start data collection immediately and stop once the test reaches a decision.

However, resource planning remains essential. While the boundaries control \(\alpha\) and \(\beta\) error rates in the long run, they cannot guarantee you will collect enough data to reach a decision within your available resources.

Two sample sizes are relevant for planning:

Expected sample size (\(N_{\text{median}}\)): The median sample size required to reach a decision. Most studies will finish near this value.
Maximum sample size (\(N_{\text{max}}\)): The maximum affordable sample size. Choose settings where \(N_{\text{max}}\) yields a decision rate of at least 80% (i.e., the test reaches a decision in at least 80% of cases where the maximum sample is reached).

The plan_sample_size() function provides tables and plots to help you find appropriate design parameters that balance efficiency with feasibility.

For detailed examples and guidance, see the vignette vignette("plan_sample_size").

4. Plan the data collection and register your test specifications

A thoughtful data collection plan is highly recommended to keep groups somewhat balanced and to track and balance (or randomize) potential confounders.

Outliers

The question of how one should deal with outliers in sequential testing is still an ongoing research topic. Note, that implementing a naive sequential outlier analysis can lead to a inflation of the \(\alpha\) error rate, see Steinhilber et al. (2025).

As SPRTs are best suited for confirmatory research, preregistering the data collection plan, hypothesis, and test specifications (e.g., effect size of interest, \(\alpha\) and \(\beta\) levels) is strongly recommended.

Preparing an analysis script in advance enables a smooth process for continuously analyzing incoming data. This ensures data collection stops immediately once the stopping criterion is reached, avoiding unnecessary additional data collection.

To test the analysis pipeline, use either:

The example datasets included in the sprtt package
The data generating functions, which allow specifying the true effect size for simulation testing

5. Collect the data and apply the SPRTs

The following test are currently implemented:

sequential t-tests, see vignette("t_test")
sequential one-way ANOVA with independent groups, see vignette("one_way_anova")

5. Reporting of the results

Guidelines for the reporting of SPRTs can be found in the paper of Schubert et al. (2025) that also explicitly covers sequential testing and specifically SPRTs .

References

Schnuerch, M., & Erdfelder, E. (2020). Controlling decision errors with minimal costs: The sequential probability ratio t test. Psychological Methods, 25(2), 206–226. https://doi.org/10.1037/met0000234

Schubert, A.-L., Steinhilber, M., Kang, H., & Quintana, D. (2025). Improving statistical reporting in psychology. OSF. https://doi. org/10.31234/osf. io/5gr9n_v.

Stefan, A. M., Schönbrodt, F. D., Evans, N. J., & Wagenmakers, E.-J. (2022). Efficiency in sequential testing: Comparing the sequential probability ratio test and the sequential Bayes factor test. Behavior Research Methods, 54(6), 3100–3117. https://doi.org/10.3758/s13428-021-01754-8

Steinhilber, M., Schnuerch, M., & Schubert, A.-L. (2024). Sequential analysis of variance: Increasing efficiency of hypothesis testing. Psychological Methods. https://doi.org/10.1037/met0000677

Steinhilber, M., Schnuerch, M., & Schubert, A.-L. (2025). The Dark Side of Sequential Testing: A Simulation Study on Questionable Research Practices. PsyArXiv. https://doi.org/10.31234/osf.io/vkbu3_v1

Wald, A. (1945). Sequential tests of statistical hypotheses. The Annals of Mathematical Statistics, 16(2), 117–186.

Wald, A. (1947). Sequential analysis. Wiley.

Meike Snijder-Steinhilber

2026-02-24