Finding the MLE for μ in Branching Process Theory with Example Data

Math Problem Statement

One common distribution that appears in branching process theory is a DRV with pmf: fX (x; μ) = e−μx(μx)x−1 x! where x ∈ {1, 2, . . .} and μ ∈ (0, 1) (a) Find the MLE for μ given iid X1, . . . , Xn. Then, find the MLE for the particular data x1 = 2, x2 = 1, x3 = 6. (b) Using Desmos, draw a graph of the likelihood function (not log-likelihood) for the data x1 = 2, x2 = 1, x3 = 6. It should be maximal at the μ value you found in part a. Include a sketch of the graph from Desmos (or a screenshot if you’re tech-fancy). (Note: In Desmos, if you click on the wrench icon in the upper-right, you can change the range of values on the x and y axes.)

Solution

Let's address the problem systematically:

(a) Find the MLE for $\mu$

The given pmf is: $f_X(x; \mu) = e^{-\mu x} \frac{(\mu x)^{x-1}}{x!}, \quad x \in \{1, 2, \ldots\}, \, \mu \in (0, 1).$

For a set of iid observations $X_1, X_2, \ldots, X_n$ , the likelihood function is: $L(\mu; X_1, \ldots, X_n) = \prod_{i=1}^n f_X(X_i; \mu).$

Substituting $f_X(x; \mu)$ : $L(\mu; X_1, \ldots, X_n) = \prod_{i=1}^n e^{-\mu X_i} \frac{(\mu X_i)^{X_i - 1}}{X_i!}.$

Simplifying: $L(\mu; X_1, \ldots, X_n) = e^{-\mu \sum_{i=1}^n X_i} \prod_{i=1}^n \frac{(\mu X_i)^{X_i - 1}}{X_i!}.$

Taking the log-likelihood: $\ell(\mu) = \log L(\mu; X_1, \ldots, X_n) = -\mu \sum_{i=1}^n X_i + \sum_{i=1}^n (X_i - 1) \log (\mu X_i) - \sum_{i=1}^n \log (X_i!).$

Focus on terms involving $\mu$ : $\ell(\mu) = -\mu \sum_{i=1}^n X_i + \sum_{i=1}^n (X_i - 1) \log \mu + \text{(terms independent of$ \mu $)}.$

Differentiating with respect to $\mu$ and setting the derivative to zero: $\frac{\partial \ell}{\partial \mu} = -\sum_{i=1}^n X_i + \sum_{i=1}^n \frac{X_i - 1}{\mu} = 0.$

Simplify: $\mu \sum_{i=1}^n X_i = \sum_{i=1}^n (X_i - 1).$

Rearrange: $\mu = \frac{\sum_{i=1}^n (X_i - 1)}{\sum_{i=1}^n X_i}.$

For $x_1 = 2, x_2 = 1, x_3 = 6$ :

$\sum_{i=1}^3 (X_i - 1) = (2 - 1) + (1 - 1) + (6 - 1) = 1 + 0 + 5 = 6,$ $\sum_{i=1}^3 X_i = 2 + 1 + 6 = 9.$

Thus: $\mu = \frac{6}{9} = \frac{2}{3}.$

(b) Graph the Likelihood Function

The likelihood function is: $L(\mu) = e^{-\mu \cdot 9} \cdot \frac{(\mu \cdot 2)^{1} \cdot (\mu \cdot 1)^{0} \cdot (\mu \cdot 6)^{5}}{2! \cdot 1! \cdot 6!}.$

Simplify constants: $L(\mu) = C \cdot e^{-9\mu} \cdot (\mu)^{1 + 0 + 5},$ where $C$ is independent of $\mu$ .

Explicitly: $L(\mu) \propto \mu^6 e^{-9\mu}.$

Plot on Desmos

Define $L(\mu) = \mu^6 e^{-9\mu}$ .
Choose a range for $\mu$ (e.g., $\mu \in (0, 1)$ ).
Observe the maximum near $\mu = \frac{2}{3}$ .

Would you like help generating or uploading the graph? I can also generate the graph for you if needed.

Follow-Up Questions:

Why is MLE an important method for parameter estimation in statistics?
What are the practical interpretations of $\mu$ in this context?
How does the likelihood function differ from the log-likelihood function?
What are the benefits of using Desmos for visualization in statistical analysis?
What conditions ensure the uniqueness of the MLE in this type of problem?

Tip: Always validate the MLE by checking the second derivative to confirm it is a maximum, not a minimum or inflection point.

Ask a new question for Free

By Image

Drop file here or Click Here to upload

Math Problem Analysis

Mathematical Concepts

Probability Theory
Maximum Likelihood Estimation (MLE)
Branching Processes

Formulas

Likelihood function: L(μ; X1, ..., Xn) = Π fX(Xi; μ)
Log-likelihood: ℓ(μ) = -μ Σ Xi + Σ (Xi - 1) log(μ) + constant
MLE for μ: μ = (Σ (Xi - 1)) / (Σ Xi)

Theorems

Properties of Maximum Likelihood Estimation

Suitable Grade Level

Undergraduate Level (Statistics/Probability)

Related Recommendation

MLE for Poisson Distribution: Estimating P(X = 0) from Observations

Find Maximum Likelihood Estimator for θ in PDF f(x; θ) = θx^(θ-1)

Finding the MLE of λ in a Poisson Distribution

Maximum Likelihood Estimation (MLE) for P(X > 1) in Exponential Distribution

Using Regular Exponential Family Properties to Solve Normal Distribution MLE