How to Set Delta in the Two-One-Sided T-tests Procedure (TOST)
Issue: Vol 5 No. 1-2 (2018)
Journal: Journal of Research Design and Statistics in Linguistics and Communication Science
Subject Areas: Linguistics
DOI: 10.1558/jrds.39002
Abstract:
The Two-One-Sided T-test procedure (TOST) is used to show that two samples are equivalent or similar, in contrast to classical statistical tests which check for dissimilarity. The TOST relies on a parameter called delta, which has to be set by the researcher using their intuition. Doing so can be difficult, because of complex interactions of relevant parameters. In this article we present a method to set delta, which is established and validated through extensive simulations based on real data sets from linguistics and other sciences. The presented method is shown to be sound and reliable, but we cannot exclude deviant early model behaviour (N≤10) and deviant late model behaviour (N>100,000).
Author: Tom S. Juzek, Johannes Kizach
References :
Aguilar-Sánchez, J. (2014). Replicability of (Socio)Linguistic Studies. Journal of Research Design and Statistics in Linguistics and Communication Science 1 (1): 1-21. | |
| |
Aguilar-Sánchez, J. (2018). Copula+ Adjective: An a-posteriori power analysis for the generalizability of results. Journal of Research Design and Statistics in Linguistics and Communication Science 4 (2): 91-123. | |
| |
Altman, D. G. and Bland, J. M. (1995). Statistics notes: Absence of evidence is not evidence of absence. BMJ 311 (7003): 485. | |
| |
Lin, M., Lucas Jr, H. C., and Galit, S. (2013). Research commentary - too big to fail: Large samples and the p-value problem. Information Systems Research 24 (4): 906-917. | |
| |
Liu, X. S. (2013). Statistical Power Analysis for the Social and Behavioral Sciences: Basic and Advanced Techniques. New York: Routledge. | |
| |
Richter, S. J. and Richter, C. (2002). A method for determining equivalence in industrial applications. Quality Engineering 14 (3): 375-380. doi:10.1081/QEN-120001876. | |
| |
Royall, R. M. (1986). The effect of sample size on the meaning of significance tests. The American Statistician 40 (4): 313-315. | |
| |
Schuirmann, D. L. (1981). On hypothesis-testing to determine if the mean of a normal-distribution is contained in a known interval. Biometrics 37: 617-617. | |
| |
Stegner, B. L., Bostrom, A. G., and Greenfield, T. K. (1996). Equivalence testing for use in psychosocial and services research: An introduction with examples. Evaluation and Program Planning 19 (3): 193-198. | |
| |
Vasishth, S. and Broe, M. (2011). The Foundations of Statistics: A Simulation-based Approach. Berlin: Springer. http://www.springer.com/mathematics/applications/book/978-3-642-16312-8 (20 May, 2014). | |
| |
Westlake, W. J. (1976). Symmetrical confidence intervals for bioequivalence trials. Biometrics 32 (4): 741-744. | |
|