1. Mechanics

1.1 Forces and equilibrium

1.2 Kinematics of motion in a straight line

1.2.1 Scalar and vector quantities in motion

1.2.2 Displacement-time and velocity-time graphs

1.2.3 Calculus in kinematics

1.2.4 Constant acceleration equations

1.3 Momentum

1.3.1 Linear momentum and conservation in one dimension

1.3.2 Direct impact and combined bodies

1.4 Newton’s laws of motion

1.4.1 Applying Newton’s laws to linear motion with constant mass

1.4.2 Mass, weight and motion on inclined planes

1.4.3 Connected particles and pulley problems

1.5 Energy, work and power

1.5.1 Work done by a force and energy concepts

1.5.2 Kinetic and potential energy calculations

1.5.3 Conservation of energy and mechanical systems

1.5.4 Power, force and velocity relationships

2. Pure Mathematics 1

2.1 Trigonometry

2.1.1 Exact values and inverse trigonometric functions

2.1.2 Trigonometric identities and solving equations

2.1.3 Graphs of sine, cosine, and tangent functions

2.2 Series

2.2.1 Binomial expansion for positive integer powers

2.2.2 Arithmetic and geometric progression formulas

2.2.3 Convergence and sum to infinity for geometric series

2.3 Differentiation

2.3.1 Gradient as a limit and first principles

2.3.2 Basic rules and chain rule for differentiation

2.3.3 Tangents, normals, and rates of change

2.3.4 Stationary points and curve sketching

2.4 Integration

2.4.1 Basic integration rules and finding constants

2.4.2 Evaluation of definite integrals

2.4.3 Area under curves and volume of revolution

2.5 Quadratics

2.5.1 Completing the square and vertex form

2.5.2 Discriminant and nature of roots

2.5.3 Solving quadratic equations and inequalities

2.5.4 Simultaneous equations involving quadratics

2.5.5 Equations quadratic in a function of x

2.6 Functions

2.6.1 Function terminology, domain and range

2.6.2 Composition and inverse of functions

2.6.3 Graphical relationship between function and its inverse

2.6.4 Graph transformations including translation, reflection and stretch

2.7 Coordinate geometry

2.7.1 Equation and forms of a straight line

2.7.2 Line and circle geometry, intersections and tangents

2.7.3 Intersections of graphs and solutions of equations

2.8 Circular measure

2.8.1 Radian measure and conversion from degrees

2.8.2 Arc length and sector area calculations

3. Pure Mathematics 2

3.1 Algebra

3.1.1 Modulus functions and solving modulus equations and inequalities

3.1.2 Polynomial division, factor theorem and remainder theorem

3.2 Logarithmic and exponential functions

3.2.1 Laws of logarithms and relationship with indices

3.2.2 Graphs and inverse relationship of ex and ln x

3.2.3 Solving equations involving logarithms and exponents

3.2.4 Transforming functions to linear form using logarithms

3.3 Trigonometry

3.3.1 Graphs and properties of all six trigonometric functions

3.3.2 Identities and expansions including compound and double angles

3.4 Differentiation

3.4.1 Derivatives of standard functions and composite functions

3.4.2 Product and quotient rules in differentiation

3.4.3 Parametric and implicit differentiation

3.5 Integration

3.5.1 Integration of standard exponential and trigonometric forms

3.5.2 Trigonometric identities in integration

3.5.3 Trapezium rule for numerical integration

3.6 Numerical solution of equations

3.6.1 Root approximation using graphical methods

3.6.2 Fixed-point iteration and convergence of sequences

4. Probability & Statistics 1

4.1 Representation of data

4.1.1 Statistical diagrams and data presentation

4.1.2 Measures of central tendency and variation

4.1.3 Cumulative frequency and interpretation

4.1.4 Calculation of mean and standard deviation

4.2 Permutations and combinations

4.2.1 Concepts and basic problems of selections

4.2.2 Arrangements with repetition and restrictions

4.3 Probability

4.3.1 Basic probability rules and enumeration

4.3.2 Addition and multiplication of probabilities

4.3.3 Exclusive and independent events

4.3.4 Conditional probability and tree diagrams

4.4 Discrete random variables

4.4.1 Probability distributions and expectation

4.4.2 Binomial and geometric distributions

4.4.3 Mean and variance of binomial and geometric distributions

4.5 The normal distribution

4.5.1 Properties and use of the normal distribution

4.5.2 Standardisation and probability calculations

4.5.3 Normal approximation to the binomial distribution

5. Probability & Statistics 2

5.1 The Poisson distribution

5.1.1 Probability calculations and properties of Poisson distribution

5.1.2 Poisson as a model and approximation to binomial

5.1.3 Normal approximation to Poisson distribution

5.2 Linear combinations of random variables

5.2.1 Expectation and variance of linear combinations

5.2.2 Distributions resulting from combinations of normal and Poisson variables

5.3 Continuous random variables

5.3.1 Probability density functions and properties

5.3.2 Calculating mean, variance, and percentiles

5.4 Sampling and estimation

5.4.1 Sampling concepts and randomness

5.4.2 Distribution and variance of the sample mean

5.4.3 Unbiased estimation of mean and variance

5.4.4 Confidence intervals for mean and proportion

5.5 Hypothesis tests

5.5.1 Concepts and terminology of hypothesis testing

5.5.2 Tests for binomial, Poisson, and normal means

5.5.3 Type I and Type II errors and their probabilities

6. Pure Mathematics 3

6.1 Algebra

6.1.1 Modulus equations and inequalities

6.1.2 Polynomial division and factor theorem

6.1.3 Partial fractions and decomposition

6.1.4 Binomial expansion for rational indices and validity of expansion

6.2 Logarithmic and exponential functions

6.2.1 Properties of logarithms and exponents

6.2.2 Solving equations and transforming to linear form

6.3 Trigonometry

6.3.1 Graphs and properties of all six trigonometric functions

6.3.2 Advanced identities and trigonometric expansions

6.4 Differentiation

6.4.1 Derivatives including tan–1 x and composite functions

6.4.2 Product, quotient, parametric and implicit differentiation

6.5 Integration

6.5.1 Standard and advanced integrals including sec², partial fractions and rational functions

6.5.2 Integration by parts and substitution

6.6 Numerical solution of equations

6.6.1 Root approximation and iteration methods

6.7 Vectors

6.7.1 Vector operations, equations of lines, and intersection

6.7.2 Scalar product, angles and perpendicular distances

6.8 Differential equations

6.8.1 Formulating and solving first-order separable equations

6.8.2 Using initial conditions and interpreting solutions

6.9 Complex numbers

6.9.1 Cartesian form, operations, and Argand diagram

6.9.2 Polar form, roots, multiplication and division

6.9.3 Loci and geometric interpretation of complex numbers

Concepts and terminology of hypothesis testing

Topic 2/3

Your Flashcards are Ready!

15 Flashcards in this deck.

Concepts and Terminology of Hypothesis Testing

Introduction

Hypothesis testing is a fundamental statistical method used to make inferences about populations based on sample data. In the context of AS & A Level Mathematics (9709), understanding hypothesis testing is crucial for analyzing data, making decisions, and validating theories. This article delves into the core concepts and terminologies of hypothesis testing, providing a comprehensive guide for students aiming to excel in their Probability & Statistics coursework.

Key Concepts

1. Hypothesis Testing Defined

Hypothesis testing is a procedure in statistics used to determine whether there is enough evidence in a sample of data to infer that a certain condition holds for the entire population. It involves making an initial assumption, known as the null hypothesis, and determining whether the data provide sufficient evidence to reject this assumption in favor of an alternative hypothesis.

2. Null and Alternative Hypotheses

The two primary statements in hypothesis testing are the null hypothesis ($H_0$) and the alternative hypothesis ($H_1$ or $H_a$).

Null Hypothesis ($H_0$): This is a statement of no effect or no difference. It serves as the default or starting assumption. For example, $H_0$: The mean height of students is 170 cm.
Alternative Hypothesis ($H_1$ or $H_a$): This statement contradicts the null hypothesis, indicating the presence of an effect or a difference. For example, $H_a$: The mean height of students is not 170 cm.

3. Significance Level ($\alpha$)

The significance level, denoted by $\alpha$, is the probability of rejecting the null hypothesis when it is actually true. Commonly used significance levels are 0.05, 0.01, and 0.10. $$\alpha = 0.05$$ This means there is a 5% risk of concluding that a difference exists when there is no actual difference.

4. Test Statistic

A test statistic is a standardized value computed from sample data during a hypothesis test. It is used to decide whether to reject the null hypothesis. The choice of test statistic depends on the type of data and the hypothesis being tested. Common test statistics include the z-score and t-score. $$z = \frac{\bar{x} - \mu}{\sigma/\sqrt{n}}$$ Where:

$\bar{x}$ = sample mean
$\mu$ = population mean
$\sigma$ = population standard deviation
n = sample size

5. P-Value

The p-value is the probability of obtaining test results at least as extreme as the observed results, assuming that the null hypothesis is true. It quantifies the evidence against the null hypothesis.

If $ p \leq \alpha $, reject $ H_0 $.
If $ p > \alpha $, fail to reject $ H_0 $.

6. Type I and Type II Errors

In hypothesis testing, two types of errors can occur:

Type I Error: Rejecting the null hypothesis when it is true. The probability of making a Type I error is $\alpha$.
Type II Error: Failing to reject the null hypothesis when it is false. The probability of making a Type II error is $\beta$.

7. Power of a Test

The power of a test is the probability that it correctly rejects a false null hypothesis. It is calculated as $1 - \beta$, where $\beta$ is the probability of a Type II error. A higher power indicates a more effective test. $$\text{Power} = 1 - \beta$$

8. One-Tailed and Two-Tailed Tests

Hypothesis tests can be one-tailed or two-tailed, depending on the direction of the alternative hypothesis.

One-Tailed Test: Tests for the possibility of the relationship in one direction.
Two-Tailed Test: Tests for the possibility of the relationship in both directions.

9. Confidence Intervals

A confidence interval is a range of values derived from the sample data that is likely to contain the population parameter. It provides an estimated range of values which is likely to include an unknown population parameter. $$\bar{x} \pm z \left(\frac{\sigma}{\sqrt{n}}\right)$$ Where:

$\bar{x}$ = sample mean
z = z-score corresponding to the confidence level
$\sigma$ = population standard deviation
n = sample size

10. Assumptions of Hypothesis Testing

Certain assumptions must be met for hypothesis testing to be valid:

Random Sampling: The sample is randomly selected from the population.
Normality: The data follows a normal distribution, especially important for small sample sizes.
Independence: Observations are independent of each other.
Scale of Measurement: Data should be at least on an interval scale.

11. Steps in Hypothesis Testing

The hypothesis testing process involves several systematic steps:

**State the Hypotheses:** Define $H_0$ and $H_a$.
**Choose the Significance Level ($\alpha$):** Common choices are 0.05, 0.01.
**Select the Appropriate Test:** Depending on the data and hypothesis.
**Calculate the Test Statistic:** Compute using relevant formulas.
**Determine the P-Value:** Find the probability associated with the test statistic.
**Make a Decision:** Compare the p-value with $\alpha$ to accept or reject $H_0$.
**Interpret the Results:** Provide a conclusion in the context of the situation.

12. Types of Tests Based on Data

Hypothesis tests vary based on the type of data and the parameter being tested:

Z-Test: Used when the population variance is known and the sample size is large ($n \geq 30$).
T-Test: Applied when the population variance is unknown and the sample size is small ($n < 30$).
Chi-Square Test: Utilized for categorical data to assess how likely observed frequencies are due to chance.
ANOVA (Analysis of Variance): Compares means among three or more groups.

13. Effect Size

Effect size measures the magnitude of the difference or relationship. Unlike p-values, which indicate significance, effect size quantifies the strength of the effect. $$d = \frac{\bar{x}_1 - \bar{x}_2}{s}$$ Where:

$\bar{x}_1$ and $\bar{x}_2$ are sample means
s = pooled standard deviation

14. Sampling Distributions

A sampling distribution is the probability distribution of a given statistic based on a random sample. Understanding sampling distributions is essential for determining how a statistic behaves under repeated sampling.

Central Limit Theorem: States that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the population's distribution.

15. Power Analysis

Power analysis is used to determine the sample size required to detect an effect of a given size with a certain degree of confidence. It helps in designing experiments that are adequately powered to find meaningful results. $$n = \left(\frac{(z_{\alpha/2} + z_{\beta}) \cdot \sigma}{\Delta}\right)^2$$ Where:

z_α/2 = z-score for the desired confidence level
z_β = z-score for the desired power
$\sigma$ = population standard deviation
$\Delta$ = effect size

Advanced Concepts

1. Bayesian Hypothesis Testing

Bayesian hypothesis testing differs from the traditional frequentist approach by incorporating prior beliefs or evidence before observing the data. It calculates the posterior probability of hypotheses, updating beliefs based on new data. $$P(H_0 | \text{data}) = \frac{P(\text{data} | H_0) P(H_0)}{P(\text{data})}$$ Where:

P($H_0$) = Prior probability of the null hypothesis
P(data | $H_0$) = Likelihood of the data under $H_0$
P(data) = Marginal likelihood of the data

2. Multiple Comparison Problem

When conducting multiple hypothesis tests simultaneously, the probability of making Type I errors increases. Techniques like the Bonferroni correction adjust the significance level to account for multiple comparisons. $$\alpha_{\text{adjusted}} = \frac{\alpha}{m}$$ Where:

$\alpha$ = Original significance level
m = Number of comparisons/tests

3. Non-Parametric Tests

Non-parametric tests do not assume a specific distribution for the data and are useful when data do not meet the assumptions of parametric tests. Examples include the Mann-Whitney U test and the Kruskal-Wallis test.

Mann-Whitney U Test: Compares differences between two independent groups when the dependent variable is either ordinal or continuous but not normally distributed.
Kruskal-Wallis Test: An extension of the Mann-Whitney U test for comparing more than two groups.

4. Effect Modification and Interaction

Effect modification occurs when the effect of the primary exposure on an outcome differs depending on the level of another variable. Understanding interactions is crucial for accurate interpretation of results.

5. Sequential Hypothesis Testing

Sequential testing involves evaluating data as it is collected, allowing for early termination of a study if results are conclusive. This approach can lead to more efficient experiments but requires careful control of error rates.

6. Confidence Distribution

A confidence distribution represents a distribution of confidence intervals for a parameter and provides a complete inference about the parameter, integrating both frequentist and Bayesian perspectives.

7. Adjusted P-Values

Adjusted p-values account for multiple testing and control the false discovery rate. Methods like the Benjamini-Hochberg procedure are used to adjust p-values in the presence of multiple hypotheses. $$\text{Adjusted } p_i = \frac{p_i m}{R}$$ Where:

p_i = Original p-value
m = Total number of tests
R = Rank of the p-value

8. Likelihood Ratios

The likelihood ratio compares the likelihood of the data under two competing hypotheses. It is a fundamental concept in statistical inference and model comparison. $$\text{LR} = \frac{P(\text{data} | H_1)}{P(\text{data} | H_0)}$$

If LR > 1, data favor $H_1$.
If LR < 1, data favor $H_0$.

9. Sequential Probability Ratio Test (SPRT)

SPRT is a statistical method for testing hypotheses that allows for continuous monitoring of data and making decisions to accept or reject hypotheses as data is collected. It optimizes the testing process by minimizing the average sample number needed. $$\Lambda_n = \prod_{i=1}^{n} \frac{P(x_i | H_1)}{P(x_i | H_0)}$$ Where:

Λ_n = Likelihood ratio after n observations
x_i = Observations

10. Robustness of Tests

Robustness refers to the ability of a hypothesis test to remain valid under violations of its assumptions. Robust tests maintain their validity even when certain assumptions, such as normality or equal variances, are not strictly met.

11. Composite Hypotheses

Composite hypotheses specify a range of possible values for the parameter, as opposed to simple hypotheses which specify exact values. Testing composite hypotheses often requires more complex statistical methods.

12. Power Function

The power function maps the true parameter values to the probability of correctly rejecting the null hypothesis. It provides a detailed view of a test's ability to detect different effect sizes. $$\beta(\theta) = P(\text{Reject } H_0 | \theta)$$ Where:

$\theta$ = True parameter value

13. Meta-Analysis in Hypothesis Testing

Meta-analysis combines results from multiple studies to improve estimates of the effect size and increase the power of hypothesis testing. It provides a more comprehensive understanding of a research question by aggregating diverse data sources.

14. Multivariate Hypothesis Testing

Multivariate hypothesis testing involves multiple dependent variables and assesses the relationships between them. Techniques like MANOVA (Multivariate Analysis of Variance) extend traditional ANOVA to handle multiple outcomes simultaneously.

15. Resampling Methods

Resampling methods, including bootstrapping and permutation tests, involve repeatedly drawing samples from the data to assess the variability of a statistic. These methods are particularly useful when theoretical distributions are complex or unknown.

Comparison Table

Aspect	Z-Test	T-Test
Population Variance	Known	Unknown
Sample Size	Large ($n \geq 30$)	Small ($n < 30$)
Test Statistic	Z-Score	T-Score
Distribution	Normal Distribution	T-Distribution
Use Case	When $\sigma$ is known and sample size is large	When $\sigma$ is unknown and sample size is small

Summary and Key Takeaways

Hypothesis testing is essential for making informed inferences about populations.
Understanding null and alternative hypotheses is foundational.
Significance levels, p-values, and test statistics guide decision-making.
Advanced concepts like Bayesian testing and power analysis enhance analytical capabilities.
Proper application of hypothesis testing ensures accurate and reliable conclusions.

Examiner Tip

Tips

- **Remember the Alpha Level:** Always set your significance level ($\alpha$) before conducting the test to avoid bias.

- **Use Mnemonics:** For the steps in hypothesis testing, remember "SHOULD IT?" standing for State hypotheses, Choose $\alpha$, Select test, Compute statistic, Interpret results, Decide to reject or not.

- **Practice with Real Data:** Enhance your understanding by applying hypothesis tests to real-world datasets, which can improve retention and comprehension for exams.

Did You Know

1. The concept of hypothesis testing was first introduced by Ronald A. Fisher in the early 20th century, revolutionizing the way scientists validate theories.

2. Hypothesis testing is widely used in various fields, including medicine for clinical trials, economics for market research, and even in sports analytics to assess player performance.

3. The p-value concept has been so influential that it often sparks debates among statisticians about its interpretation and misuse in scientific research.

Common Mistakes

1. **Misinterpreting the P-Value:** Students often think a p-value indicates the probability that the null hypothesis is true. Instead, it represents the probability of observing the data if the null hypothesis is true.

2. **Ignoring Assumptions:** Applying hypothesis tests without verifying assumptions like normality or independence can lead to incorrect conclusions. For example, using a t-test on non-normally distributed data without transformation.

3. **Multiple Comparisons:** Conducting multiple tests without adjustment increases the risk of Type I errors. For instance, running several t-tests on the same dataset without using corrections like Bonferroni.

FAQ

What is the null hypothesis?

The null hypothesis ($H_0$) is a statement that there is no effect or no difference, serving as the default assumption in hypothesis testing.

How do you choose between a one-tailed and two-tailed test?

Choose a one-tailed test when the alternative hypothesis specifies a direction of the effect, and a two-tailed test when it does not.

What does a p-value indicate?

A p-value measures the probability of obtaining results at least as extreme as the observed data, assuming the null hypothesis is true.

What is a Type I error?

A Type I error occurs when the null hypothesis is incorrectly rejected when it is actually true.

Why is the power of a test important?

The power of a test indicates the probability of correctly rejecting a false null hypothesis, reflecting the test's ability to detect true effects.

How do you calculate a confidence interval?

A confidence interval is calculated using the sample mean, the z-score or t-score for the desired confidence level, and the standard error of the mean. For example: $\bar{x} \pm z \left(\frac{\sigma}{\sqrt{n}}\right)$.

1. Mechanics

1.1 Forces and equilibrium

1.1.1 Identifying and resolving forces

1.1.2 Equilibrium of particles and friction

1.1.3 Normal and frictional components of contact forces

1.1.4 Coefficient of friction and limiting equilibrium

1.1.5 Application of Newton’s third law

1.2 Kinematics of motion in a straight line

1.2.1 Scalar and vector quantities in motion

1.2.2 Displacement-time and velocity-time graphs

1.2.3 Calculus in kinematics

1.2.4 Constant acceleration equations