<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.trialtree.ca/index.php?action=history&amp;feed=atom&amp;title=Multiple_testing</id>
	<title>Multiple testing - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.trialtree.ca/index.php?action=history&amp;feed=atom&amp;title=Multiple_testing"/>
	<link rel="alternate" type="text/html" href="https://wiki.trialtree.ca/index.php?title=Multiple_testing&amp;action=history"/>
	<updated>2026-05-15T01:33:33Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.45.1</generator>
	<entry>
		<id>https://wiki.trialtree.ca/index.php?title=Multiple_testing&amp;diff=254&amp;oldid=prev</id>
		<title>Lawrence: /* Conclusion */</title>
		<link rel="alternate" type="text/html" href="https://wiki.trialtree.ca/index.php?title=Multiple_testing&amp;diff=254&amp;oldid=prev"/>
		<updated>2025-06-04T11:17:53Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;Conclusion&lt;/span&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 11:17, 4 June 2025&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l71&quot;&gt;Line 71:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 71:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Pre-specify Outcomes ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Pre-specify Outcomes ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Clearly define one primary outcome in the trial protocol. This helps maintain analytical focus and ensures proper control of the family-wise error rate.   &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Clearly define one primary outcome in the trial protocol. This helps maintain analytical focus and ensures proper control of the family-wise error rate.   &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Limit the number of secondary outcomes to those that are clinically meaningful and hypothesis-driven. Avoid exploratory outcomes that introduce unnecessary multiplicity.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Limit the number of secondary outcomes to those that are clinically meaningful and &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[&lt;/ins&gt;hypothesis&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]]&lt;/ins&gt;-driven. Avoid exploratory outcomes that introduce unnecessary multiplicity.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Use Hierarchical Testing ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Use Hierarchical Testing ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l91&quot;&gt;Line 91:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 91:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Ensure Transparent Reporting ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Ensure Transparent Reporting ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Clearly describe how multiple testing was handled in the protocol, analysis, and publication.   &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Clearly describe how multiple testing was handled in the protocol, &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[&lt;/ins&gt;analysis&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]]&lt;/ins&gt;, and publication.   &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Follow CONSORT guidelines to report:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Follow &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[&lt;/ins&gt;CONSORT&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]] &lt;/ins&gt;guidelines to report:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* the number of hypotheses tested,&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* the number of hypotheses tested,&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* any interim or subgroup analyses performed,&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* any interim or subgroup analyses performed,&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l117&quot;&gt;Line 117:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 117:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Using appropriate statistical adjustments—such as Bonferroni, Holm’s, or FDR control—combined with pre-specification of outcomes, controlled interim analyses, and transparent reporting, ensures valid and reproducible conclusions.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Using appropriate statistical adjustments—such as Bonferroni, Holm’s, or FDR control—combined with pre-specification of outcomes, controlled interim analyses, and transparent reporting, ensures valid and reproducible conclusions.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;----&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;=== Bibliography ===&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;# Bender R, Lange S. Adjusting for multiple testing—when and how? &#039;&#039;Journal of Clinical Epidemiology&#039;&#039;. 2001;54(4):343–349.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;# Rothman KJ. No adjustments are needed for multiple comparisons. &#039;&#039;Epidemiology&#039;&#039;. 1990;1(1):43–46.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;# Cook RJ, Farewell VT. Multiplicity considerations in the design and analysis of clinical trials. &#039;&#039;Journal of the Royal Statistical Society: Series A&#039;&#039;. 1996;159(1):93–110.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;# Dmitrienko A, Tamhane AC, Bretz F. Multiple Testing Problems in Pharmaceutical Statistics. CRC Press; 2009.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;# EMEA. Points to Consider on Multiplicity Issues in Clinical Trials. European Medicines Agency; 2002. CPMP/EWP/908/99.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;----&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&#039;&#039;Adapted for educational use. Please cite relevant trial methodology sources when using this material in research or teaching.&#039;&#039;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Lawrence</name></author>
	</entry>
	<entry>
		<id>https://wiki.trialtree.ca/index.php?title=Multiple_testing&amp;diff=132&amp;oldid=prev</id>
		<title>Lawrence: Created page with &quot;= Multiple testing =  &#039;&#039;&#039;Multiple testing&#039;&#039;&#039;, also known as &#039;&#039;&#039;multiplicity&#039;&#039;&#039;, occurs in randomized controlled trials (RCTs) when multiple statistical comparisons are made. This increases the risk of a &#039;&#039;&#039;Type I error&#039;&#039;&#039; (false positives), potentially leading to incorrect conclusions about an intervention’s effectiveness or safety.  == Why Is Multiple Testing a Concern? ==  Each statistical test carries a risk of a false positive. As the number of tests increases, so...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.trialtree.ca/index.php?title=Multiple_testing&amp;diff=132&amp;oldid=prev"/>
		<updated>2025-03-28T02:51:16Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;= Multiple testing =  &amp;#039;&amp;#039;&amp;#039;Multiple testing&amp;#039;&amp;#039;&amp;#039;, also known as &amp;#039;&amp;#039;&amp;#039;multiplicity&amp;#039;&amp;#039;&amp;#039;, occurs in randomized controlled trials (RCTs) when multiple statistical comparisons are made. This increases the risk of a &amp;#039;&amp;#039;&amp;#039;Type I error&amp;#039;&amp;#039;&amp;#039; (false positives), potentially leading to incorrect conclusions about an intervention’s effectiveness or safety.  == Why Is Multiple Testing a Concern? ==  Each statistical test carries a risk of a false positive. As the number of tests increases, so...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;= Multiple testing =&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Multiple testing&amp;#039;&amp;#039;&amp;#039;, also known as &amp;#039;&amp;#039;&amp;#039;multiplicity&amp;#039;&amp;#039;&amp;#039;, occurs in randomized controlled trials (RCTs) when multiple statistical comparisons are made. This increases the risk of a &amp;#039;&amp;#039;&amp;#039;Type I error&amp;#039;&amp;#039;&amp;#039; (false positives), potentially leading to incorrect conclusions about an intervention’s effectiveness or safety.&lt;br /&gt;
&lt;br /&gt;
== Why Is Multiple Testing a Concern? ==&lt;br /&gt;
&lt;br /&gt;
Each statistical test carries a risk of a false positive. As the number of tests increases, so does the overall chance of observing at least one false positive result.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Example:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* Significance level (α): 0.05&lt;br /&gt;
* Number of independent tests (n): 10&lt;br /&gt;
* Probability of at least one false positive:&lt;br /&gt;
&lt;br /&gt;
  P = 1 − (1 − α)^n  &lt;br /&gt;
  P = 1 − (0.95)^10 = 0.40&lt;br /&gt;
&lt;br /&gt;
This means a 40% chance of at least one false positive, compared to the intended 5%.&lt;br /&gt;
&lt;br /&gt;
== Common Sources of Multiple Testing in RCTs ==&lt;br /&gt;
&lt;br /&gt;
=== Multiple Primary Outcomes ===&lt;br /&gt;
Trials may include more than one primary endpoint (e.g., survival and quality of life).  &lt;br /&gt;
&amp;#039;&amp;#039;Example:&amp;#039;&amp;#039; A cardiovascular trial measuring both heart attacks and strokes.&lt;br /&gt;
&lt;br /&gt;
=== Multiple Secondary Outcomes ===&lt;br /&gt;
Additional secondary outcomes (e.g., blood pressure, cholesterol) increase the number of statistical tests and the potential for false positives.&lt;br /&gt;
&lt;br /&gt;
=== Interim Analyses ===&lt;br /&gt;
Planned interim analyses, such as those conducted for early stopping due to efficacy or futility, can inflate the Type I error rate if not properly adjusted.&lt;br /&gt;
&lt;br /&gt;
=== Subgroup Analyses ===&lt;br /&gt;
Exploring treatment effects across different subgroups (e.g., by age, sex, or comorbidity status) increases the number of comparisons and the likelihood of spurious findings.&lt;br /&gt;
&lt;br /&gt;
=== Multiple Treatment Arms ===&lt;br /&gt;
Trials with multiple intervention groups (e.g., placebo vs. low-dose vs. high-dose) involve several pairwise comparisons, each of which requires control for Type I error.&lt;br /&gt;
&lt;br /&gt;
== Methods to Control for Multiple Testing ==&lt;br /&gt;
&lt;br /&gt;
=== Bonferroni Correction ===&lt;br /&gt;
This method adjusts the significance level by dividing α by the number of comparisons.  &lt;br /&gt;
&amp;#039;&amp;#039;Example:&amp;#039;&amp;#039; With 5 tests, the new significance threshold becomes 0.05 / 5 = 0.01.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Pros:&amp;#039;&amp;#039;&amp;#039; Simple and widely used.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Cons:&amp;#039;&amp;#039;&amp;#039; Very conservative; may increase the risk of Type II error (false negatives).&lt;br /&gt;
&lt;br /&gt;
=== Holm’s Step-Down Method ===&lt;br /&gt;
A sequential version of the Bonferroni method that offers more power while still controlling the family-wise error rate.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Pros:&amp;#039;&amp;#039;&amp;#039; Less conservative than Bonferroni.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Cons:&amp;#039;&amp;#039;&amp;#039; Still relatively strict.&lt;br /&gt;
&lt;br /&gt;
=== Hochberg’s Step-Up Method ===&lt;br /&gt;
More powerful than Holm’s method, particularly under conditions of independence or positive correlation among tests.&lt;br /&gt;
&lt;br /&gt;
=== False Discovery Rate (FDR) Control ===&lt;br /&gt;
Controls the expected proportion of false discoveries among the rejected hypotheses.  &lt;br /&gt;
Often implemented using the Benjamini-Hochberg procedure.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Pros:&amp;#039;&amp;#039;&amp;#039; More flexible for exploratory analyses.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Cons:&amp;#039;&amp;#039;&amp;#039; Does not strictly control the family-wise error rate.&lt;br /&gt;
&lt;br /&gt;
=== Gatekeeping Procedures ===&lt;br /&gt;
Establish a predefined testing hierarchy. Secondary outcomes are only tested if the primary outcome is significant.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Pros:&amp;#039;&amp;#039;&amp;#039; Maintains error control while allowing multiple testing.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Cons:&amp;#039;&amp;#039;&amp;#039; Requires strict pre-specification in the trial protocol.&lt;br /&gt;
&lt;br /&gt;
=== Group Sequential Methods ===&lt;br /&gt;
Used in interim analyses, these methods (e.g., O’Brien-Fleming or Pocock boundaries) adjust the significance threshold at each look at the data.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Pros:&amp;#039;&amp;#039;&amp;#039; Controls for inflated Type I error in interim analyses.  &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Cons:&amp;#039;&amp;#039;&amp;#039; Requires careful planning and statistical expertise.&lt;br /&gt;
&lt;br /&gt;
== Best Practices for Managing Multiple Testing in RCTs ==&lt;br /&gt;
&lt;br /&gt;
=== Pre-specify Outcomes ===&lt;br /&gt;
Clearly define one primary outcome in the trial protocol. This helps maintain analytical focus and ensures proper control of the family-wise error rate.  &lt;br /&gt;
Limit the number of secondary outcomes to those that are clinically meaningful and hypothesis-driven. Avoid exploratory outcomes that introduce unnecessary multiplicity.&lt;br /&gt;
&lt;br /&gt;
=== Use Hierarchical Testing ===&lt;br /&gt;
Establish a predefined order for testing outcomes (e.g., primary → key secondary → exploratory).  &lt;br /&gt;
Each subsequent outcome is only tested if the preceding one is statistically significant. This approach preserves the overall Type I error rate.&lt;br /&gt;
&lt;br /&gt;
=== Plan Interim Analyses with Proper Adjustments ===&lt;br /&gt;
If interim analyses are planned, specify their timing and statistical boundaries in advance.  &lt;br /&gt;
Use group sequential designs to control the overall error rate across multiple analyses (e.g., with O’Brien-Fleming or Pocock boundaries).&lt;br /&gt;
&lt;br /&gt;
=== Limit Subgroup Analyses ===&lt;br /&gt;
Restrict subgroup analyses to those that are biologically plausible and pre-specified in the protocol.  &lt;br /&gt;
Avoid post hoc analyses that may produce misleading or non-reproducible results.&lt;br /&gt;
&lt;br /&gt;
=== Adjust for Multiple Comparisons ===&lt;br /&gt;
Apply correction methods appropriate to the number and nature of the hypotheses being tested.  &lt;br /&gt;
Use Bonferroni or Holm’s methods for confirmatory testing and FDR control for exploratory work.  &lt;br /&gt;
Gatekeeping procedures can help prioritize testing while maintaining validity.&lt;br /&gt;
&lt;br /&gt;
=== Ensure Transparent Reporting ===&lt;br /&gt;
Clearly describe how multiple testing was handled in the protocol, analysis, and publication.  &lt;br /&gt;
Follow CONSORT guidelines to report:&lt;br /&gt;
* the number of hypotheses tested,&lt;br /&gt;
* any interim or subgroup analyses performed,&lt;br /&gt;
* and the statistical methods used to control for multiplicity.&lt;br /&gt;
&lt;br /&gt;
== Example Application ==&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Study Design:&amp;#039;&amp;#039;&amp;#039;  &lt;br /&gt;
A diabetes RCT compares a new drug vs. placebo across three primary outcomes: HbA1c, weight loss, and cholesterol.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Problem:&amp;#039;&amp;#039;&amp;#039;  &lt;br /&gt;
Testing all three outcomes at α = 0.05 inflates the probability of a false positive.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Solutions:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* Apply Bonferroni correction (adjusted α = 0.05 / 3 = 0.017).&lt;br /&gt;
* Use hierarchical testing: test weight loss and cholesterol only if the HbA1c result is significant.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Outcome:&amp;#039;&amp;#039;&amp;#039;  &lt;br /&gt;
These approaches help ensure that trial findings are robust and not driven by chance.&lt;br /&gt;
&lt;br /&gt;
== Conclusion ==&lt;br /&gt;
&lt;br /&gt;
Multiple testing is a common and important consideration in RCT design and analysis. Without proper correction, it can lead to misleading results and overestimation of treatment effects.&lt;br /&gt;
&lt;br /&gt;
Using appropriate statistical adjustments—such as Bonferroni, Holm’s, or FDR control—combined with pre-specification of outcomes, controlled interim analyses, and transparent reporting, ensures valid and reproducible conclusions.&lt;/div&gt;</summary>
		<author><name>Lawrence</name></author>
	</entry>
</feed>