1. Mechanics

1.1 Forces and equilibrium

1.2 Kinematics of motion in a straight line

1.2.1 Scalar and vector quantities in motion

1.2.2 Displacement-time and velocity-time graphs

1.2.3 Calculus in kinematics

1.2.4 Constant acceleration equations

1.3 Momentum

1.3.1 Linear momentum and conservation in one dimension

1.3.2 Direct impact and combined bodies

1.4 Newton’s laws of motion

1.4.1 Applying Newton’s laws to linear motion with constant mass

1.4.2 Mass, weight and motion on inclined planes

1.4.3 Connected particles and pulley problems

1.5 Energy, work and power

1.5.1 Work done by a force and energy concepts

1.5.2 Kinetic and potential energy calculations

1.5.3 Conservation of energy and mechanical systems

1.5.4 Power, force and velocity relationships

2. Pure Mathematics 1

2.1 Trigonometry

2.1.1 Exact values and inverse trigonometric functions

2.1.2 Trigonometric identities and solving equations

2.1.3 Graphs of sine, cosine, and tangent functions

2.2 Series

2.2.1 Binomial expansion for positive integer powers

2.2.2 Arithmetic and geometric progression formulas

2.2.3 Convergence and sum to infinity for geometric series

2.3 Differentiation

2.3.1 Gradient as a limit and first principles

2.3.2 Basic rules and chain rule for differentiation

2.3.3 Tangents, normals, and rates of change

2.3.4 Stationary points and curve sketching

2.4 Integration

2.4.1 Basic integration rules and finding constants

2.4.2 Evaluation of definite integrals

2.4.3 Area under curves and volume of revolution

2.5 Quadratics

2.5.1 Completing the square and vertex form

2.5.2 Discriminant and nature of roots

2.5.3 Solving quadratic equations and inequalities

2.5.4 Simultaneous equations involving quadratics

2.5.5 Equations quadratic in a function of x

2.6 Functions

2.6.1 Function terminology, domain and range

2.6.2 Composition and inverse of functions

2.6.3 Graphical relationship between function and its inverse

2.6.4 Graph transformations including translation, reflection and stretch

2.7 Coordinate geometry

2.7.1 Equation and forms of a straight line

2.7.2 Line and circle geometry, intersections and tangents

2.7.3 Intersections of graphs and solutions of equations

2.8 Circular measure

2.8.1 Radian measure and conversion from degrees

2.8.2 Arc length and sector area calculations

3. Pure Mathematics 2

3.1 Algebra

3.1.1 Modulus functions and solving modulus equations and inequalities

3.1.2 Polynomial division, factor theorem and remainder theorem

3.2 Logarithmic and exponential functions

3.2.1 Laws of logarithms and relationship with indices

3.2.2 Graphs and inverse relationship of ex and ln x

3.2.3 Solving equations involving logarithms and exponents

3.2.4 Transforming functions to linear form using logarithms

3.3 Trigonometry

3.3.1 Graphs and properties of all six trigonometric functions

3.3.2 Identities and expansions including compound and double angles

3.4 Differentiation

3.4.1 Derivatives of standard functions and composite functions

3.4.2 Product and quotient rules in differentiation

3.4.3 Parametric and implicit differentiation

3.5 Integration

3.5.1 Integration of standard exponential and trigonometric forms

3.5.2 Trigonometric identities in integration

3.5.3 Trapezium rule for numerical integration

3.6 Numerical solution of equations

3.6.1 Root approximation using graphical methods

3.6.2 Fixed-point iteration and convergence of sequences

4. Probability & Statistics 1

4.1 Representation of data

4.1.1 Statistical diagrams and data presentation

4.1.2 Measures of central tendency and variation

4.1.3 Cumulative frequency and interpretation

4.1.4 Calculation of mean and standard deviation

4.2 Permutations and combinations

4.2.1 Concepts and basic problems of selections

4.2.2 Arrangements with repetition and restrictions

4.3 Probability

4.3.1 Basic probability rules and enumeration

4.3.2 Addition and multiplication of probabilities

4.3.3 Exclusive and independent events

4.3.4 Conditional probability and tree diagrams

4.4 Discrete random variables

4.4.1 Probability distributions and expectation

4.4.2 Binomial and geometric distributions

4.4.3 Mean and variance of binomial and geometric distributions

4.5 The normal distribution

4.5.1 Properties and use of the normal distribution

4.5.2 Standardisation and probability calculations

4.5.3 Normal approximation to the binomial distribution

5. Probability & Statistics 2

5.1 The Poisson distribution

5.1.1 Probability calculations and properties of Poisson distribution

5.1.2 Poisson as a model and approximation to binomial

5.1.3 Normal approximation to Poisson distribution

5.2 Linear combinations of random variables

5.2.1 Expectation and variance of linear combinations

5.2.2 Distributions resulting from combinations of normal and Poisson variables

5.3 Continuous random variables

5.3.1 Probability density functions and properties

5.3.2 Calculating mean, variance, and percentiles

5.4 Sampling and estimation

5.4.1 Sampling concepts and randomness

5.4.2 Distribution and variance of the sample mean

5.4.3 Unbiased estimation of mean and variance

5.4.4 Confidence intervals for mean and proportion

5.5 Hypothesis tests

5.5.1 Concepts and terminology of hypothesis testing

5.5.2 Tests for binomial, Poisson, and normal means

5.5.3 Type I and Type II errors and their probabilities

6. Pure Mathematics 3

6.1 Algebra

6.1.1 Modulus equations and inequalities

6.1.2 Polynomial division and factor theorem

6.1.3 Partial fractions and decomposition

6.1.4 Binomial expansion for rational indices and validity of expansion

6.2 Logarithmic and exponential functions

6.2.1 Properties of logarithms and exponents

6.2.2 Solving equations and transforming to linear form

6.3 Trigonometry

6.3.1 Graphs and properties of all six trigonometric functions

6.3.2 Advanced identities and trigonometric expansions

6.4 Differentiation

6.4.1 Derivatives including tan–1 x and composite functions

6.4.2 Product, quotient, parametric and implicit differentiation

6.5 Integration

6.5.1 Standard and advanced integrals including sec², partial fractions and rational functions

6.5.2 Integration by parts and substitution

6.6 Numerical solution of equations

6.6.1 Root approximation and iteration methods

6.7 Vectors

6.7.1 Vector operations, equations of lines, and intersection

6.7.2 Scalar product, angles and perpendicular distances

6.8 Differential equations

6.8.1 Formulating and solving first-order separable equations

6.8.2 Using initial conditions and interpreting solutions

6.9 Complex numbers

6.9.1 Cartesian form, operations, and Argand diagram

6.9.2 Polar form, roots, multiplication and division

6.9.3 Loci and geometric interpretation of complex numbers

Tests for binomial, Poisson, and normal means

Topic 2/3

Revision Notes
Flashcards
Past Paper Analysis
Questions
Videos

Your Flashcards are Ready!

15 Flashcards in this deck.

Tests for Binomial, Poisson, and Normal Means

Introduction

In the realm of Probability & Statistics, hypothesis testing serves as a fundamental tool for making inferences about population parameters. Understanding the tests for binomial, Poisson, and normal means is crucial for students preparing for the AS & A Level Mathematics examination (9709). These tests enable the evaluation of hypotheses regarding different types of data distributions, providing a structured approach to decision-making based on statistical evidence.

Key Concepts

1. Understanding Hypothesis Testing

Hypothesis testing is a statistical method used to determine whether there is enough evidence in a sample to infer that a certain condition holds true for the entire population. It involves formulating two competing hypotheses: the null hypothesis ($H_0$) and the alternative hypothesis ($H_a$).

Null Hypothesis ($H_0$): A statement of no effect or no difference, serving as the default or starting assumption.
Alternative Hypothesis ($H_a$): A statement indicating the presence of an effect or difference.

The process involves selecting a significance level ($\alpha$), calculating a test statistic, and determining whether to reject the null hypothesis based on the p-value.

2. Binomial Distribution and Mean

The binomial distribution models the number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success.

Definition: A random variable $X$ follows a binomial distribution with parameters $n$ (number of trials) and $p$ (probability of success) if:

$$ P(X = k) = \binom{n}{k} p^k (1-p)^{n-k} \quad \text{for } k = 0, 1, 2, \dots, n $$

The mean ($\mu$) of a binomial distribution is given by:

$$ \mu = n \cdot p $$

3. Poisson Distribution and Mean

The Poisson distribution models the number of events occurring in a fixed interval of time or space, assuming these events occur with a known constant mean rate and independently of the time since the last event.

Definition: A random variable $X$ follows a Poisson distribution with parameter $\lambda$ (the average rate) if:

$$ P(X = k) = \frac{\lambda^k e^{-\lambda}}{k!} \quad \text{for } k = 0, 1, 2, \dots $$

The mean ($\mu$) of a Poisson distribution is:

$$ \mu = \lambda $$

4. Normal Distribution and Mean

The normal distribution is a continuous probability distribution characterized by its symmetric bell-shaped curve, defined by its mean ($\mu$) and standard deviation ($\sigma$).

Definition: A random variable $X$ follows a normal distribution with mean $\mu$ and variance $\sigma^2$ if its probability density function is:

$$ f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{ -\frac{(x - \mu)^2}{2\sigma^2} } $$

The mean of a normal distribution is:

$$ \mu = E(X) = \int_{-\infty}^{\infty} x f(x) \, dx $$

5. Hypothesis Testing for Binomial Mean

When conducting hypothesis tests for a binomial mean, the following steps are typically followed:

Formulate Hypotheses:
- $H_0$: $\mu = \mu_0$
- $H_a$: $\mu \neq \mu_0$ (two-tailed), $\mu > \mu_0$ (right-tailed), or $\mu < \mu_0$ (left-tailed)
Choose Significance Level: Commonly $\alpha = 0.05$.
Calculate Test Statistic: For binomial tests, the test statistic can be based on the observed proportion compared to the expected proportion under $H_0$.
Determine p-value: The probability of observing a test statistic as extreme as, or more extreme than, the observed statistic under $H_0$.
Make Decision: Reject $H_0$ if p-value < $\alpha$; otherwise, fail to reject $H_0$.

6. Hypothesis Testing for Poisson Mean

Hypothesis testing for a Poisson mean follows a similar structure but is specifically tailored to the properties of the Poisson distribution.

Formulate Hypotheses:
- $H_0$: $\lambda = \lambda_0$
- $H_a$: $\lambda \neq \lambda_0$, $\lambda > \lambda_0$, or $\lambda < \lambda_0$
Choose Significance Level: Typically $\alpha = 0.05$.
Calculate Test Statistic: Often based on the observed number of events compared to the expected number under $H_0$.
Determine p-value: Using the cumulative distribution function of the Poisson distribution.
Make Decision: Reject or fail to reject $H_0$ based on the p-value.

7. Hypothesis Testing for Normal Mean

For normally distributed data, hypothesis testing for the mean involves Z-tests or t-tests, depending on whether the population standard deviation is known.

Formulate Hypotheses:
- $H_0$: $\mu = \mu_0$
- $H_a$: $\mu \neq \mu_0$, $\mu > \mu_0$, or $\mu < \mu_0$
Choose Significance Level: Commonly $\alpha = 0.05$.
Calculate Test Statistic:
- If $\sigma$ is known: $Z = \frac{\overline{X} - \mu_0}{\sigma / \sqrt{n}}$
- If $\sigma$ is unknown: $t = \frac{\overline{X} - \mu_0}{s / \sqrt{n}}$
Determine p-value: Based on the standard normal or t-distribution.
Make Decision: Reject $H_0$ if p-value < $\alpha$; otherwise, fail to reject $H_0$.

8. Assumptions in Hypothesis Testing

Each hypothesis test relies on specific assumptions to ensure the validity of the results:

Binomial Tests: Fixed number of trials, independent trials, two possible outcomes, constant probability of success.
Poisson Tests: Events occur independently, a constant average rate, and the probability of more than one event in an infinitesimally small interval is negligible.
Normal Tests: Data is normally distributed, observations are independent, and the scale of measurement is continuous.

9. Example Problems

Example 1: Binomial Mean Test

A factory claims that 5% of its products are defective. A quality inspector selects 200 products and finds that 12 are defective. Test the manufacturer's claim at the 5% significance level.

Solution:

Formulate Hypotheses:
- $H_0$: $\mu = 200 \times 0.05 = 10$ defective products
- $H_a$: $\mu \neq 10$
Calculate Test Statistic: $$ Z = \frac{12 - 10}{\sqrt{200 \times 0.05 \times 0.95}} = \frac{2}{\sqrt{9.5}} \approx \frac{2}{3.082} \approx 0.65 $$
Determine p-value: For $Z = 0.65$, the p-value is approximately 0.513 (two-tailed)
Make Decision: Since p-value > 0.05, fail to reject $H_0$. There is insufficient evidence to dispute the manufacturer's claim.

Example 2: Poisson Mean Test

On average, a call center receives 3 calls per minute. During one minute, it receives 7 calls. Test whether this is unusual at the 1% significance level.

Solution:

Formulate Hypotheses:
- $H_0$: $\lambda = 3$
- $H_a$: $\lambda \neq 3$
Calculate Test Statistic: $$ P(X \geq 7) = 1 - P(X \leq 6) $$ Using Poisson tables or a calculator, $P(X \leq 6) \approx 0.9161$, so $P(X \geq 7) \approx 0.0839$
Determine p-value: $2 \times 0.0839 = 0.1678$ (two-tailed)
Make Decision: Since p-value > 0.01, fail to reject $H_0$. Receiving 7 calls in a minute is not statistically unusual.

Example 3: Normal Mean Test

A sample of 30 students has an average test score of 78 with a standard deviation of 10. The school's average score is 75. Test at the 5% significance level whether the sample provides evidence that students perform better.

Solution:

Formulate Hypotheses:
- $H_0$: $\mu = 75$
- $H_a$: $\mu > 75$
Calculate Test Statistic: $$ t = \frac{78 - 75}{10 / \sqrt{30}} \approx \frac{3}{1.8257} \approx 1.643 $$
Determine p-value: For $t = 1.643$ and df = 29, p-value ≈ 0.055 (one-tailed)
Make Decision: Since p-value > 0.05, fail to reject $H_0$. There is insufficient evidence to conclude that students perform better.

10. Common Pitfalls

Assumption Violations: Ignoring the underlying assumptions of each test can lead to incorrect conclusions.
Misinterpreting p-values: A p-value does not measure the probability that $H_0$ is true.
Overlooking Effect Size: Statistical significance does not necessarily imply practical significance.

Advanced Concepts

1. Mathematical Derivation of the Test Statistics

Delving deeper into the mathematics behind hypothesis testing enhances comprehension and allows for customization of tests based on specific scenarios.

Binomial Test Statistic Derivation:

For a binomial distribution with parameters $n$ and $p$, the standard error (SE) of the mean is:

$$ SE = \sqrt{n p (1 - p)} $$

The Z-test statistic is then derived as:

$$ Z = \frac{\overline{X} - \mu_0}{SE} = \frac{k - n p_0}{\sqrt{n p_0 (1 - p_0)}} $$

Where $k$ is the observed number of successes and $p_0$ is the hypothesized probability of success under $H_0$.

Poisson Test Statistic Derivation:

For a Poisson distribution with parameter $\lambda$, the standard error is:

$$ SE = \sqrt{\lambda} $$

The Z-test statistic is:

$$ Z = \frac{X - \lambda_0}{\sqrt{\lambda_0}} $$

Where $X$ is the observed count and $\lambda_0$ is the hypothesized rate.

Normal Test Statistic Derivation:

For normally distributed data, when the population standard deviation $\sigma$ is known, the Z-test statistic is:

$$ Z = \frac{\overline{X} - \mu_0}{\sigma / \sqrt{n}} $$

When $\sigma$ is unknown and the sample standard deviation $s$ is used instead, the t-test statistic is:

$$ t = \frac{\overline{X} - \mu_0}{s / \sqrt{n}} $$

2. Confidence Intervals and Their Relationship with Hypothesis Testing

Confidence intervals provide a range of plausible values for a population parameter and are closely related to hypothesis tests. Specifically, a $(1 - \alpha) \times 100\%$ confidence interval for a parameter $\theta$ will not contain the hypothesized value $\theta_0$ if and only if the corresponding hypothesis test at the $\alpha$ significance level rejects $H_0$.

Example: If a 95% confidence interval for $\mu$ is (74, 82), then testing $H_0: \mu = 75$ versus $H_a: \mu \neq 75$ at $\alpha = 0.05$ would fail to reject $H_0$ because 75 is within the interval.

3. Effect of Sample Size on Hypothesis Testing

The sample size ($n$) plays a critical role in hypothesis testing:

Power of the Test: Larger sample sizes increase the power of a test, making it more likely to detect a true effect.
Standard Error: As $n$ increases, the standard error decreases, leading to narrower confidence intervals and smaller p-values for the same effect size.
Assumptions Validity: With larger samples, the Central Limit Theorem ensures the sampling distribution of the mean approaches normality, which is beneficial for tests requiring this assumption.

4. Type I and Type II Errors

Understanding the types of errors in hypothesis testing is essential for proper interpretation of results:

Type I Error ($\alpha$): Incorrectly rejecting the null hypothesis when it is true.
Type II Error ($\beta$): Failing to reject the null hypothesis when the alternative hypothesis is true.

The balance between Type I and Type II errors is often managed by adjusting the significance level and considering the power of the test.

5. Power Analysis

Power analysis determines the probability that a test will correctly reject a false null hypothesis (i.e., avoid a Type II error). It is influenced by the significance level ($\alpha$), sample size ($n$), effect size, and variability within the data.

Formula for Power:

$$ \text{Power} = 1 - \beta $$

Conducting a power analysis before data collection can inform decisions about appropriate sample sizes to achieve desired power levels.

6. Non-Parametric Alternatives

When data do not meet the assumptions required for parametric tests (e.g., normality), non-parametric tests provide alternative methods:

Binomial Alternatives: Exact tests based on the binomial distribution.
Poisson Alternatives: Exact Poisson tests or permutation tests.
Normal Alternatives: Mann-Whitney U test, Wilcoxon signed-rank test.

These alternatives are less powerful but more robust to violations of assumptions.

7. Bayesian Hypothesis Testing

Unlike frequentist hypothesis testing, Bayesian methods incorporate prior beliefs or information about parameters and update these beliefs based on observed data.

Bayes Factor: A ratio that compares the likelihood of the data under two competing hypotheses, providing evidence in favor of one hypothesis over the other.

Bayesian methods offer a probabilistic interpretation of hypotheses but require the specification of prior distributions.

8. Multiple Testing and Adjustments

Conducting multiple hypothesis tests increases the risk of Type I errors. To address this, adjustments such as the Bonferroni correction are applied:

Bonferroni Correction: Adjust the significance level by dividing it by the number of tests ($\alpha' = \alpha / m$), where $m$ is the number of comparisons.

This method controls the family-wise error rate but can be overly conservative, reducing the power of individual tests.

9. Effect Size Measures

Effect size quantifies the magnitude of a phenomenon, providing context beyond p-values:

Cohen's d: Measures the difference between two means in terms of standard deviation units.
Odds Ratio: Used in binomial tests to compare the odds of an event occurring between two groups.
Rate Ratio: Compares the rates of events between different populations or time periods.

Including effect sizes enhances the interpretability of hypothesis testing results.

10. Interpreting Results in Context

Statistical significance does not equate to practical significance. It is imperative to interpret results within the context of the research question, considering the real-world implications of the findings.

Example: A drug may show a statistically significant effect in lowering blood pressure, but if the effect size is minimal, its practical benefits may be limited.

Comparison Table

Aspect	Binomial Mean Test	Poisson Mean Test	Normal Mean Test
Distribution Type	Discrete	Discrete	Continuous
Parameters	Number of trials ($n$), Probability of success ($p$)	Rate parameter ($\lambda$)	Mean ($\mu$), Standard deviation ($\sigma$)
Assumptions	Fixed trials, Independent trials, Constant $p$	Independent events, Constant rate, Rare events	Normality, Independent observations
Test Statistic	Z-test based on proportion	Z-test based on count	Z-test or t-test based on sample mean
Applications	Quality control, Success rates	Event counts over intervals, Rare event analysis	Measurement data, Heights, Test scores
Pros	Simplicity, Applicable to binary data	Models event rates effectively	Widely applicable, Powerful with large samples
Cons	Limited to binary outcomes	Assumes independence and constant rate	Sensitivity to outliers, Assumes normality

Summary and Key Takeaways

Hypothesis testing for binomial, Poisson, and normal means is integral to statistical inference.
Each test has specific assumptions and applications tailored to different data types.
Understanding the underlying distributions enhances accurate hypothesis formulation and testing.
Advanced concepts like power analysis and effect sizes provide deeper insights into test results.
Proper interpretation of results requires contextual understanding beyond statistical significance.

Examiner Tip

Tips

Enhance your understanding and performance with these tips:

Mnemonic for Test Selection: Remember "BN-PoNV" – Binomial for Binary data, Poisson for Points/Counts, and Normal for Numerical data.
Always Check Assumptions: Before performing a test, ensure that the data meets the necessary assumptions to validate the results.
Practice with Examples: Regularly solve diverse example problems to become comfortable with different scenarios and test applications.

Did You Know

Did you know that the Poisson distribution is extensively used in telecommunications to model the number of phone calls received by a call center per minute? Additionally, the normal distribution, often dubbed the "bell curve," arises naturally in countless real-world scenarios due to the Central Limit Theorem, which states that the sum of many independent random variables tends toward a normal distribution. Furthermore, binomial tests play a critical role in quality control within manufacturing industries, helping to determine the proportion of defective products in a production line.

Common Mistakes

Many students stumble when applying hypothesis tests due to common errors:

Incorrect Test Selection: Choosing a normal mean test for count data that follows a Poisson distribution instead of using the appropriate Poisson test.
Misstating Hypotheses: Formulating the null and alternative hypotheses incorrectly, such as reversing them or not specifying the direction in one-tailed tests.
P-value Misinterpretation: Believing that a p-value represents the probability that the null hypothesis is true, rather than understanding it as the probability of observing the data given that the null hypothesis is true.

FAQ

When should I use a binomial mean test instead of a normal mean test?

Use a binomial mean test when dealing with binary outcomes (success/failure) in a fixed number of trials. A normal mean test is appropriate for continuous data, especially with larger sample sizes due to the Central Limit Theorem.

How can I determine if my data follows a Poisson distribution?

Data follows a Poisson distribution if it represents the number of events occurring within a fixed interval, events occur independently, and the average rate (λ) is constant. Additionally, the variance should be approximately equal to the mean.

What is the main difference between a Z-test and a t-test?

A Z-test is used when the population standard deviation is known and the sample size is large. A t-test is applied when the population standard deviation is unknown and/or the sample size is small.

Can I use a normal mean test for small sample sizes?

Yes, but only if the population is normally distributed. Otherwise, it's better to use non-parametric tests or consider the t-test, which is more suitable for small samples.

How does the Central Limit Theorem relate to hypothesis testing?

The Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the population's distribution. This allows the use of normal-based hypothesis tests for large samples.

1. Mechanics

1.1 Forces and equilibrium

1.1.1 Identifying and resolving forces

1.1.2 Equilibrium of particles and friction

1.1.3 Normal and frictional components of contact forces

1.1.4 Coefficient of friction and limiting equilibrium

1.1.5 Application of Newton’s third law

1.2 Kinematics of motion in a straight line

1.2.1 Scalar and vector quantities in motion

1.2.2 Displacement-time and velocity-time graphs

1.2.3 Calculus in kinematics

1.2.4 Constant acceleration equations