Kruskal-Wallis Test: How It Works & When to Use It

Q: When should I use Kruskal-Wallis instead of ANOVA?

Use the Kruskal-Wallis test when your data violates ANOVA assumptions: non-normal distributions, ordinal data, severe outliers, or unequal variances across groups. It's the non-parametric alternative that ranks data instead of using raw values.

Q: What is the minimum sample size for Kruskal-Wallis test?

While the test can run with as few as 5 observations per group, you need at least 5-6 observations in each group for reliable results. For more robust conclusions, aim for 20+ observations per group when possible.

Q: How do I interpret Kruskal-Wallis test results?

The test produces a chi-square statistic and p-value. If p < 0.05 (typical threshold), at least one group differs significantly from others. The test doesn't tell you which groups differ—use post-hoc tests like Dunn's test for pairwise comparisons.

Q: Can Kruskal-Wallis handle tied ranks?

Yes, the test handles ties by assigning average ranks to tied values. However, excessive ties (more than 25% of values) can reduce statistical power. Most statistical software automatically adjusts for ties in the calculation.

Q: What are common mistakes when using Kruskal-Wallis test?

Common mistakes include: using it with only two groups (use Mann-Whitney instead), ignoring post-hoc tests after significant results, assuming it tests medians (it tests distributions), violating independence assumptions, and misinterpreting effect sizes.

The Kruskal-Wallis test is one of the most misunderstood statistical tools in data analysis. While many analysts reach for it as a simple alternative to ANOVA, they often make critical mistakes that invalidate their results. This comprehensive guide reveals the common pitfalls to avoid and compares different approaches to ensure you apply this non-parametric test correctly for reliable, data-driven decisions.

What is the Kruskal-Wallis Test?

The Kruskal-Wallis test, also known as the Kruskal-Wallis H test or one-way ANOVA on ranks, is a non-parametric statistical method used to determine whether three or more independent groups differ significantly in their distribution. Named after William Kruskal and W. Allen Wallis who developed it in 1952, this test serves as the non-parametric alternative to the one-way ANOVA.

Unlike parametric tests that work with raw data values and assume normal distributions, the Kruskal-Wallis test converts all observations to ranks before analysis. This rank-based approach makes it robust against outliers and applicable to ordinal data or continuous data that violates normality assumptions.

The test works by ranking all observations from lowest to highest across all groups, then comparing the sum of ranks between groups. If groups truly differ, their rank sums will be substantially different. The test statistic (H) follows approximately a chi-square distribution, allowing us to calculate a p-value to determine statistical significance.

Key Concept: Rank-Based Analysis

The Kruskal-Wallis test doesn't compare means or medians directly. Instead, it compares the distributions of ranks across groups. This subtle but important distinction means you're testing whether groups tend to have systematically higher or lower values, not whether their central tendencies differ.

When to Use the Kruskal-Wallis Test

Choosing the right statistical test is crucial for valid conclusions. The Kruskal-Wallis test is appropriate in specific scenarios where parametric alternatives like ANOVA fall short.

Ideal Use Cases

You should use the Kruskal-Wallis test when you have:

Three or more independent groups to compare (for two groups, use the Mann-Whitney U test instead)
Ordinal data such as Likert scale responses, satisfaction ratings, or ranked preferences
Non-normal distributions that cannot be transformed to normality
Severe outliers that would distort parametric test results
Small sample sizes where normality cannot be reliably assessed
Unequal variances across groups (heteroscedasticity)

Comparing Approaches: Kruskal-Wallis vs. ANOVA

Understanding when to choose Kruskal-Wallis over ANOVA requires comparing their fundamental differences:

Aspect	Kruskal-Wallis Test	One-Way ANOVA
Data type	Ordinal or continuous	Continuous only
Distribution assumption	None required	Normal distribution
What it compares	Distribution of ranks	Group means
Outlier sensitivity	Robust to outliers	Sensitive to outliers
Statistical power	Lower (95% of ANOVA)	Higher when assumptions met
Variance assumption	Similar shapes preferred	Equal variances required

The choice between these approaches isn't always clear-cut. If your data meets ANOVA assumptions, ANOVA provides more statistical power. However, when assumptions are violated, the Kruskal-Wallis test produces more reliable results despite slightly lower power.

Real-World Scenarios

The Kruskal-Wallis test excels in business and research contexts such as:

Comparing customer satisfaction scores across different product categories
Analyzing employee performance ratings between departments
Evaluating website conversion rates across multiple marketing channels
Comparing response times for customer support tickets across regions
Assessing product quality rankings from different suppliers

Key Assumptions of the Kruskal-Wallis Test

While the Kruskal-Wallis test requires fewer assumptions than parametric alternatives, it still has important requirements that must be met for valid results.

Independence of Observations

Each observation must be independent of all others. This is the most critical assumption. Violations occur when:

The same subjects are measured multiple times (use Friedman test instead)
Observations are clustered (e.g., multiple measurements from the same location)
There are temporal dependencies in time-series data

Independence violations cannot be fixed with the Kruskal-Wallis test. You must use alternative methods designed for dependent data.

Appropriate Measurement Scale

Data must be at least ordinal (ranked or ordered). The test works with:

Ordinal scales (rankings, ratings, levels)
Interval scales (temperature, dates)
Ratio scales (weight, height, revenue)

Nominal categorical data (colors, categories without order) cannot be used with this test.

Similar Distribution Shapes

For the most meaningful interpretation, groups should have similar distribution shapes. When this holds, the test effectively compares medians. When shapes differ substantially, the test compares entire distributions rather than location parameters.

This assumption is often overlooked but affects interpretation. You can assess it by creating histograms or density plots for each group and visually comparing their shapes.

Sufficient Sample Size

While no formal minimum exists, practical guidelines suggest:

At least 5 observations per group (absolute minimum)
Ideally 20+ observations per group for robust results
Larger samples needed when distributions differ substantially

With very small samples, the chi-square approximation may be inaccurate. Some statistical packages use exact distributions for small samples.

Common Mistakes to Avoid When Applying the Test

Even experienced analysts make critical errors when implementing the Kruskal-Wallis test. Understanding these pitfalls helps ensure your analysis produces valid, actionable insights.

Mistake #1: Using It for Only Two Groups

The Kruskal-Wallis test is designed for three or more groups. While it will technically run with two groups, you should use the Mann-Whitney U test instead. The Mann-Whitney test is specifically optimized for two-group comparisons and provides more appropriate statistics for that scenario.

This mistake wastes statistical power and produces less interpretable results. Always check your number of groups before selecting your test.

Mistake #2: Stopping After a Significant Result

A significant Kruskal-Wallis test tells you that at least one group differs from the others, but it doesn't identify which specific groups differ. Many analysts stop here, drawing unwarranted conclusions about specific group differences.

After a significant result, you must conduct post-hoc tests (such as Dunn's test or pairwise Mann-Whitney tests with Bonferroni correction) to identify which specific pairs of groups differ significantly. Skipping this step is like knowing someone in a room is lying but not investigating who.

Mistake #3: Misinterpreting What the Test Measures

A common misconception is that the Kruskal-Wallis test compares medians. While it can indicate median differences when distribution shapes are similar, the test actually compares the entire distribution of ranks across groups.

This distinction matters for interpretation. A significant result means groups have systematically different distributions, which could reflect differences in central tendency, spread, skewness, or any combination of distributional features.

Mistake #4: Ignoring Excessive Tied Ranks

Tied values receive averaged ranks in the Kruskal-Wallis test. While the test handles some ties, excessive ties (more than 25% of values being identical) can reduce statistical power and invalidate results.

This commonly occurs with:

Rounded or truncated data (e.g., ages in whole years)
Ordinal scales with few categories (e.g., 1-5 ratings)
Data with many zero values or detection limits

Most statistical software applies tie corrections automatically, but you should still check the proportion of ties in your data and consider whether an alternative approach might be more appropriate.

Mistake #5: Violating Independence Assumptions

Using the Kruskal-Wallis test with dependent data is a fundamental error that completely invalidates results. This happens when analysts fail to recognize:

Repeated measures from the same subjects (use Friedman test)
Matched or paired data (use Friedman test for multiple groups)
Hierarchical or nested data structures (use mixed models)
Time-series data with autocorrelation

Always carefully consider your study design and data collection method to ensure independence holds.

Mistake #6: Overlooking Effect Size

Statistical significance doesn't equal practical importance. A large sample can produce significant p-values for trivial differences, while important differences might not reach significance with small samples.

Always report effect sizes alongside p-values. For Kruskal-Wallis, use epsilon-squared (ε²) or eta-squared (η²) to quantify how much variance the group membership explains. Effect sizes provide context that p-values alone cannot.

Critical Mistakes Summary

The most damaging errors in Kruskal-Wallis testing are: (1) using it with dependent data, (2) stopping after the omnibus test without post-hoc comparisons, and (3) misinterpreting results as median comparisons when distributions differ. Avoiding these three mistakes will dramatically improve the validity of your conclusions.

How to Interpret Kruskal-Wallis Test Results

Proper interpretation transforms statistical output into actionable business insights. Here's how to read and communicate Kruskal-Wallis test results effectively.

Understanding the Test Statistic

The Kruskal-Wallis test produces an H statistic (sometimes called χ² or K) that measures how much the rank sums differ across groups. The formula accounts for:

The number of groups being compared
The total sample size
The sum of ranks in each group
Corrections for tied values

Larger H values indicate greater differences between groups. The H statistic follows approximately a chi-square distribution with k-1 degrees of freedom (where k is the number of groups).

Evaluating the P-Value

The p-value tells you the probability of observing rank differences as extreme as yours if all groups actually came from the same distribution (the null hypothesis).

Standard interpretation:

p < 0.05: Significant evidence that at least one group differs (reject null hypothesis)
p ≥ 0.05: Insufficient evidence of group differences (fail to reject null hypothesis)

Remember that 0.05 is a convention, not a law. Consider your field's standards and the consequences of errors when setting significance thresholds. High-stakes decisions might require p < 0.01 or even p < 0.001.

Calculating and Interpreting Effect Size

Effect size quantifies the magnitude of group differences. For Kruskal-Wallis, epsilon-squared (ε²) is calculated as:

ε² = H / (n² - 1) / (n + 1)

Where H is the test statistic and n is the total sample size. Interpretation guidelines:

ε² ≈ 0.01: Small effect (groups differ slightly)
ε² ≈ 0.06: Medium effect (moderate group differences)
ε² ≈ 0.14+: Large effect (substantial group differences)

Always report effect sizes in addition to p-values to give stakeholders a complete picture of both statistical significance and practical importance.

Post-Hoc Analysis for Pairwise Comparisons

After a significant Kruskal-Wallis result, post-hoc tests identify which specific groups differ. Common approaches include:

Dunn's test: The most appropriate post-hoc test, specifically designed for Kruskal-Wallis
Pairwise Mann-Whitney tests: With Bonferroni or other multiple comparison corrections
Conover-Iman test: More powerful but less conservative alternative

Dunn's test is generally recommended because it accounts for the overall rank structure from the original Kruskal-Wallis test. Multiple testing corrections (Bonferroni, Holm, Benjamini-Hochberg) control the false positive rate when making multiple comparisons.

Reporting Results

A complete results statement should include:

A Kruskal-Wallis test revealed a statistically significant difference
in customer satisfaction scores across the three service tiers,
H(2) = 18.47, p < 0.001, ε² = 0.21. Post-hoc Dunn's tests with
Bonferroni correction showed that Premium tier customers
(mean rank = 65.3) reported significantly higher satisfaction than
both Standard (mean rank = 42.1, p < 0.001) and Basic tier customers
(mean rank = 38.7, p < 0.001). No significant difference was found
between Standard and Basic tiers (p = 0.42).

This format provides the test statistic, degrees of freedom, p-value, effect size, mean ranks for context, and specific pairwise comparisons—everything readers need to understand both the statistical evidence and practical implications.

Comparing Approaches: Manual vs. Software Implementation

You can calculate the Kruskal-Wallis test manually or use statistical software. Understanding the trade-offs helps you choose the right approach for your situation.

Manual Calculation Approach

Computing the test by hand involves these steps:

Combine all observations and rank them from lowest to highest
Assign average ranks to tied values
Sum the ranks for each group separately
Calculate the H statistic using the formula
Apply tie corrections if necessary
Compare H to the chi-square distribution to find the p-value

Manual calculation offers deep understanding of the mechanics but is error-prone, time-consuming, and impractical for large datasets. It's valuable for learning but rarely appropriate for actual analysis.

Software Implementation Approach

Modern statistical software handles Kruskal-Wallis testing efficiently:

Python (SciPy):

from scipy.stats import kruskal
import numpy as np

# Example data: three groups
group1 = [23, 25, 28, 29, 31]
group2 = [18, 20, 22, 24, 26]
group3 = [30, 33, 35, 37, 40]

# Perform test
statistic, p_value = kruskal(group1, group2, group3)
print(f"H-statistic: {statistic:.2f}")
print(f"P-value: {p_value:.4f}")

# Using base R
group1 <- c(23, 25, 28, 29, 31)
group2 <- c(18, 20, 22, 24, 26)
group3 <- c(30, 33, 35, 37, 40)

# Combine into data frame
data <- data.frame(
  value = c(group1, group2, group3),
  group = factor(rep(c("A", "B", "C"), each = 5))
)

# Perform test
result <- kruskal.test(value ~ group, data = data)
print(result)

Software advantages include automatic tie corrections, exact p-values for small samples, built-in post-hoc tests, and integration with data visualization. The main disadvantage is treating the test as a "black box" without understanding the underlying mechanics.

Best Practice: Hybrid Approach

The optimal strategy combines conceptual understanding with software efficiency:

Learn the manual calculation process to understand what the test does
Use software for actual analyses to ensure accuracy and efficiency
Always verify software results make sense given your data
Visualize your data before and after testing to catch potential issues

Real-World Example: E-Commerce Platform Analysis

Let's walk through a complete Kruskal-Wallis analysis using a realistic business scenario.

The Business Question

An e-commerce company wants to determine if customer purchase amounts differ across three marketing channels: Email, Social Media, and Paid Search. They have purchase data from 75 customers (25 from each channel) over the past month.

Step 1: Examine the Data

Before running any test, create visualizations:

# Python visualization
import matplotlib.pyplot as plt
import seaborn as sns

# Create box plots
sns.boxplot(x='channel', y='purchase_amount', data=df)
plt.title('Purchase Amount Distribution by Channel')
plt.ylabel('Purchase Amount ($)')
plt.xlabel('Marketing Channel')
plt.show()

The box plots reveal that Social Media has several high-value outliers and the distributions are skewed, making ANOVA inappropriate. The Kruskal-Wallis test is the right choice.

Step 2: Check Assumptions

Independence: Each customer made one purchase, and customers are unrelated ✓
Measurement scale: Purchase amount is ratio data ✓
Sample size: 25 observations per group is adequate ✓
Distribution shapes: All three channels show right-skewed distributions with similar shapes ✓

Step 3: Conduct the Test

from scipy.stats import kruskal

email = df[df['channel'] == 'Email']['purchase_amount']
social = df[df['channel'] == 'Social Media']['purchase_amount']
search = df[df['channel'] == 'Paid Search']['purchase_amount']

H, p = kruskal(email, social, search)
print(f"Kruskal-Wallis H-statistic: {H:.3f}")
print(f"P-value: {p:.4f}")

Results: H(2) = 12.84, p = 0.0016

Step 4: Calculate Effect Size

n = len(df)
epsilon_squared = H / ((n**2 - 1) / (n + 1))
print(f"Effect size (ε²): {epsilon_squared:.3f}")

Effect size: ε² = 0.17 (large effect)

Step 5: Post-Hoc Testing

from scikit_posthocs import posthoc_dunn

# Perform Dunn's test with Bonferroni correction
dunn_results = posthoc_dunn(df, val_col='purchase_amount',
                            group_col='channel', p_adjust='bonferroni')
print(dunn_results)

Post-hoc results reveal:

Social Media vs. Email: p = 0.002 (significant)
Social Media vs. Paid Search: p = 0.018 (significant)
Email vs. Paid Search: p = 0.524 (not significant)

Step 6: Business Interpretation

The analysis reveals that Social Media drives significantly higher purchase amounts (median: $156) compared to both Email (median: $98) and Paid Search (median: $102). Email and Paid Search perform similarly.

Business recommendation: Increase investment in Social Media marketing campaigns, as they attract higher-value customers. The large effect size (ε² = 0.17) indicates this is a substantial, practically meaningful difference worth acting upon.

Analysis Checklist

For every Kruskal-Wallis analysis: (1) visualize distributions first, (2) verify all assumptions, (3) calculate effect sizes, (4) conduct appropriate post-hoc tests, and (5) translate statistical findings into clear business recommendations. This five-step process ensures rigorous, actionable insights.

Best Practices for Kruskal-Wallis Testing

Following these evidence-based practices will improve the reliability and impact of your analyses.

Always Visualize First

Create box plots, violin plots, or histograms before testing. Visualization helps you:

Identify outliers that might influence results
Assess whether distribution shapes are similar across groups
Spot data entry errors or impossible values
Develop intuition about likely test outcomes
Communicate findings more effectively to stakeholders

Never run statistical tests blindly on data you haven't visually examined.

Document Your Decision Process

Record why you chose Kruskal-Wallis over alternatives. Document:

Which ANOVA assumptions were violated and how you assessed them
Sample sizes and their adequacy for the test
Whether you checked for independence violations
What transformations you tried (if any) before choosing a non-parametric approach

This documentation justifies your analytical choices and helps reviewers understand your reasoning.

Report Complete Results

Include all relevant statistics in your reports:

Test statistic and degrees of freedom: H(df)
Exact p-value (not just "p < 0.05")
Effect size measure (ε² or η²)
Mean ranks for each group for context
Sample sizes per group
Results of post-hoc tests with corrections applied

Complete reporting enables others to evaluate your analysis and compare results across studies.

Consider Power and Sample Size

Before collecting data, conduct power analysis to determine adequate sample sizes. The Kruskal-Wallis test typically requires 10-20% larger samples than ANOVA to achieve equivalent power.

Post-analysis, consider whether non-significant results might reflect insufficient power rather than truly absent effects. This is especially important with small samples where important differences might not reach statistical significance.

Validate Findings When Possible

Whenever feasible, validate your results through:

Replication with new data samples
Sensitivity analyses using different statistical approaches
Cross-validation with domain expert knowledge
Comparison with historical patterns or benchmarks

A single statistical test is evidence, not proof. Converging evidence from multiple sources builds confidence in conclusions.

Pair Statistics with Subject Matter Expertise

Statistical significance means nothing without practical context. Always ask:

Is the observed difference large enough to matter to the business?
Do the results align with domain knowledge and theoretical expectations?
What are the costs and benefits of acting on these findings?
Could confounding variables explain the observed differences?

The best analyses combine statistical rigor with deep domain understanding.

Related Statistical Techniques

The Kruskal-Wallis test belongs to a family of non-parametric methods. Understanding related techniques helps you select the most appropriate tool for each situation.

Mann-Whitney U Test

The Mann-Whitney U test (also called Wilcoxon rank-sum test) compares two independent groups. Use it instead of Kruskal-Wallis when you have only two groups to compare. It's the non-parametric alternative to the independent samples t-test and provides more detailed output for two-group comparisons.

Friedman Test

The Friedman test handles repeated measures or matched data with three or more conditions. Use it when the same subjects are measured multiple times or when observations are matched across groups. It's the non-parametric alternative to repeated measures ANOVA.

One-Way ANOVA

When your data meets parametric assumptions (normality, equal variances, continuous data), one-way ANOVA offers more statistical power than Kruskal-Wallis. Always check assumptions before defaulting to non-parametric tests—parametric tests are preferable when assumptions hold.

Mood's Median Test

Mood's median test is another non-parametric alternative for comparing groups, but it's less powerful than Kruskal-Wallis. It's simpler to calculate manually but provides less information and has lower statistical power. Use Kruskal-Wallis unless you specifically need to test medians with very small samples.

Permutation Tests

Permutation tests provide flexible alternatives that work with various test statistics and make minimal assumptions. They're computationally intensive but increasingly practical with modern computing power. Consider permutation tests when your data has unusual features that violate standard test assumptions.

Choosing the Right Test

Follow this decision tree:

Number of groups? Two groups → Mann-Whitney; Three+ groups → continue
Independent or repeated measures? Independent → continue; Repeated → Friedman test
Data meet ANOVA assumptions? Yes → One-way ANOVA; No → Kruskal-Wallis
After Kruskal-Wallis? Significant result → Dunn's post-hoc test

Avoiding Critical Pitfalls: A Decision Framework

This framework helps you avoid the most common and damaging mistakes when applying the Kruskal-Wallis test.

Pre-Analysis Checks

Before running the test, verify:

Data structure: Do you have 3+ independent groups? (If not, use Mann-Whitney)
Independence: Are observations truly independent? (If not, use Friedman or mixed models)
Sample size: Do you have 5+ observations per group? (If not, consider exact tests)
Measurement scale: Is data at least ordinal? (If nominal, use chi-square test)

During Analysis

While conducting the test:

Visualize first: Create plots to understand data distributions and spot outliers
Check ties: Calculate the proportion of tied values (concerning if >25%)
Verify software settings: Ensure tie corrections are enabled
Document decisions: Record why you chose this test over alternatives

Post-Analysis Requirements

After obtaining results:

Calculate effect size: Always compute and report ε² or η²
Conduct post-hoc tests: If p < 0.05, run Dunn's test with appropriate corrections
Check assumptions held: Verify independence wasn't violated
Interpret cautiously: Remember the test compares distributions, not necessarily medians
Validate if possible: Cross-check findings with domain knowledge

Reporting Standards

Your final report must include:

Complete test statistics: H(df) = X.XX, p = X.XXX, ε² = X.XX
Descriptive statistics: sample sizes, medians, and mean ranks per group
Post-hoc test results with multiple testing corrections specified
Effect size interpretation (small/medium/large)
Practical implications for decision-making

See This Analysis in Action — View a live Non-Parametric Group Comparison report built from real data.

View Case Study

Analyze Your Own Data — upload a CSV and run this analysis instantly. No code, no setup.

Analyze Your CSV →

Ready to Apply Kruskal-Wallis Testing?

Master statistical analysis for data-driven decisions with expert guidance and advanced analytics tools.

Try MCP Analytics

Compare plans →

Conclusion

The Kruskal-Wallis test is a powerful tool for comparing three or more independent groups when parametric assumptions fail. However, its value depends entirely on correct application and interpretation. By understanding common mistakes—using it with dependent data, stopping before post-hoc tests, and misinterpreting what it measures—you can avoid the pitfalls that invalidate many analyses.

The comparison of approaches throughout this guide highlights that statistical testing isn't about blindly applying formulas. It requires understanding your data's characteristics, selecting appropriate methods, verifying assumptions, calculating effect sizes, and translating statistical findings into actionable business insights.

Whether you're analyzing customer behavior, evaluating product performance, or testing marketing strategies, the Kruskal-Wallis test provides a robust framework for data-driven decisions. Apply the best practices and decision frameworks outlined here to ensure your analyses are both statistically valid and practically valuable.

Remember: statistics is a tool for understanding reality, not a substitute for critical thinking. Combine rigorous methodology with domain expertise, always visualize your data, and focus on effect sizes alongside significance tests. These practices transform statistical analysis from a mechanical process into a source of genuine business insight.

Frequently Asked Questions

When should I use Kruskal-Wallis instead of ANOVA?

Use the Kruskal-Wallis test when your data violates ANOVA assumptions: non-normal distributions, ordinal data, severe outliers, or unequal variances across groups. It's the non-parametric alternative that ranks data instead of using raw values. However, if your data meets ANOVA assumptions, prefer ANOVA for its greater statistical power.

What is the minimum sample size for Kruskal-Wallis test?

While the test can run with as few as 5 observations per group, you need at least 5-6 observations in each group for reliable results. For more robust conclusions, aim for 20+ observations per group when possible. Smaller samples reduce statistical power and may make the chi-square approximation inaccurate.

How do I interpret Kruskal-Wallis test results?

The test produces a chi-square statistic (H) and p-value. If p < 0.05 (typical threshold), at least one group differs significantly from others. The test doesn't tell you which groups differ—use post-hoc tests like Dunn's test for pairwise comparisons. Always report effect sizes (ε²) alongside p-values to indicate practical significance.

Can Kruskal-Wallis handle tied ranks?

Yes, the test handles ties by assigning average ranks to tied values. However, excessive ties (more than 25% of values) can reduce statistical power. Most statistical software automatically adjusts for ties in the calculation. If your data has many ties, verify that tie corrections are enabled in your software.

What are common mistakes when using Kruskal-Wallis test?

Common mistakes include: using it with only two groups (use Mann-Whitney instead), ignoring post-hoc tests after significant results, assuming it tests medians (it tests distributions), violating independence assumptions, and misinterpreting effect sizes. Always visualize your data first, verify assumptions, and conduct appropriate post-hoc tests.