標本比率の漸近分散 - 問題演習問題17

標本比率の漸近分散レベル1

ある製品の不良率$p$を調査するため、$n = 400$個の製品を検査したところ、80個の不良品が見つかった。標本比率$\hat{p} = \frac{80}{400} = 0.2$について、$\hat{p}$の漸近分散を求めよ。ただし、真の不良率を$p = 0.2$として計算し、答えは小数第4位まで求めよ。

解説

解答と解説を表示

<h4>標本比率の漸近正規性</h4><p>標本比率は二項分布から導かれる統計量で、中心極限定理により漸近正規分布に従います。</p><h4>二項分布と標本比率の基本理論</h4><p class='step'><strong>Step 1: 二項分布の設定</strong></p><p>$n$個の独立試行で成功回数を$X$とすると：</p><div class='formula'>$X \sim \text{Binomial}(n, p)

lt;/div><p>標本比率は：</p><div class='formula'>$\hat{p} = \frac{X}{n}

lt;/div><p>ここで$p$は真の成功確率（不良率）です。</p><p class='step'><strong>Step 2: 二項分布の性質</strong></p><p>二項分布$\text{Binomial}(n, p)$の期待値と分散：</p><div class='formula'>$E[X] = np, \quad \text{Var}(X) = np(1-p)

lt;/div><p>したがって、標本比率$\hat{p}$について：</p><div class='formula'>$E[\hat{p}] = E\left[\frac{X}{n}\right] = \frac{E[X]}{n} = \frac{np}{n} = p

lt;/div><div class='formula'>$\text{Var}(\hat{p}) = \text{Var}\left(\frac{X}{n}\right) = \frac{\text{Var}(X)}{n^2} = \frac{np(1-p)}{n^2} = \frac{p(1-p)}{n}

lt;/div><h4>中心極限定理の適用</h4><p class='step'><strong>Step 3: 漸近正規性の理論</strong></p><p>独立なベルヌーイ試行$Y_1, Y_2, \ldots, Y_n$（$Y_i \sim \text{Bernoulli}(p)$）において、$X = \sum_{i=1}^n Y_i$とすると：</p><div class='formula'>$\hat{p} = \frac{X}{n} = \frac{1}{n}\sum_{i=1}^n Y_i

lt;/div><p>中心極限定理により、$n$が十分大きいとき：</p><div class='formula'>$\sqrt{n}(\hat{p} - p) \xrightarrow{d} N(0, p(1-p))

lt;/div><p>これは以下を意味します：</p><div class='formula'>$\hat{p} \xrightarrow{d} N\left(p, \frac{p(1-p)}{n}\right)

lt;/div><p class='step'><strong>Step 4: 漸近分散の定義</strong></p><p>標本比率$\hat{p}$の漸近分散は：</p><div class='formula'>$\text{avar}(\hat{p}) = \frac{p(1-p)}{n}

lt;/div><p>これは真の母数$p$に依存する理論的分散です。</p><h4>具体的な計算</h4><p class='step'><strong>Step 5: 与えられた条件の整理</strong></p><p>問題の設定：</p><ul><li>標本サイズ：$n = 400

lt;/li><li>観測された不良品数：80個</li><li>標本比率：$\hat{p} = \frac{80}{400} = 0.2

lt;/li><li>真の不良率：$p = 0.2$（仮定）</li></ul><p class='step'><strong>Step 6: 漸近分散の計算</strong></p><p>標本比率の漸近分散：</p><div class='formula'>$\text{avar}(\hat{p}) = \frac{p(1-p)}{n} = \frac{0.2 \times (1-0.2)}{400} = \frac{0.2 \times 0.8}{400}

lt;/div><div class='formula'>$= \frac{0.16}{400} = 0.0004 = 4.0 \times 10^{-4}

lt;/div><h4>漸近正規性の応用</h4><p class='step'><strong>Step 7: 標準化統計量</strong></p><p>標準化された標本比率：</p><div class='formula'>$Z = \frac{\hat{p} - p}{\sqrt{\frac{p(1-p)}{n}}} = \frac{\hat{p} - p}{\sqrt{\text{avar}(\hat{p})}}

lt;/div><p>$n$が大きいとき、$Z \xrightarrow{d} N(0, 1)$です。</p><p>今回の例では：</p><div class='formula'>$Z = \frac{0.2 - 0.2}{\sqrt{0.0004}} = \frac{0}{0.02} = 0

lt;/div><p class='step'><strong>Step 8: 信頼区間への応用</strong></p><p>$p$の漸近95%信頼区間（ワルド信頼区間）：</p><div class='formula'>$\hat{p} \pm z_{\alpha/2} \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}

lt;/div><p>ここで$\hat{p}$を$p$の推定値として使用します：</p><div class='formula'>$0.2 \pm 1.96 \times \sqrt{\frac{0.2 \times 0.8}{400}} = 0.2 \pm 1.96 \times 0.02 = 0.2 \pm 0.0392

lt;/div><p>したがって95%信頼区間は$[0.1608, 0.2392]$です。</p><h4>漸近正規性の妥当性条件</h4><div class='key-point'><div class='key-point-title'>正規近似の妥当性</div><p><strong>経験則</strong>：</p><ul><li><strong>$np \geq 5$かつ$n(1-p) \geq 5

lt;/strong>：基本的な目安</li><li><strong>$np \geq 10$かつ$n(1-p) \geq 10

lt;/strong>：より安全な基準</li></ul><p><strong>今回の検証</strong>：</p><ul><li>$np = 400 \times 0.2 = 80 \geq 10$ ✓</li><li>$n(1-p) = 400 \times 0.8 = 320 \geq 10$ ✓</li></ul><p>両条件を満たすので、正規近似は十分に妥当です。</p></div><h4>実際の分散推定</h4><p class='step'><strong>Step 9: 推定分散の計算</strong></p><p>実際には真の$p$は未知なので、推定分散を用います：</p><div class='formula'>$\widehat{\text{Var}}(\hat{p}) = \frac{\hat{p}(1-\hat{p})}{n} = \frac{0.2 \times 0.8}{400} = 0.0004

lt;/div><p>この場合、偶然にも真の値と一致しています。</p><p class='step'><strong>Step 10: 他の信頼区間法との比較</strong></p><table class='table table-bordered'><tr><th>手法</th><th>95%信頼区間</th><th>特徴</th></tr><tr><td>ワルド法</td><td>$[0.1608, 0.2392]

lt;/td><td>最も単純、極端な$p$で問題</td></tr><tr><td>ウィルソン法</td><td>$[0.1625, 0.2423]

lt;/td><td>より安定、推奨される</td></tr><tr><td>クロッパー・ピアソン法</td><td>$[0.1614, 0.2437]

lt;/td><td>正確、保守的</td></tr></table><h4>デルタ法との関連</h4><p class='step'><strong>Step 11: ロジット変換</strong></p><p>$p$が0や1に近い場合、ロジット変換が有用です：</p><div class='formula'>$g(p) = \log\left(\frac{p}{1-p}\right) = \text{logit}(p)

lt;/div><p>デルタ法により：</p><div class='formula'>$g'(p) = \frac{1}{p(1-p)}

lt;/div><div class='formula'>$\text{avar}(g(\hat{p})) = [g'(p)]^2 \cdot \frac{p(1-p)}{n} = \frac{1}{[p(1-p)]^2} \cdot \frac{p(1-p)}{n} = \frac{1}{np(1-p)}

lt;/div><p>今回の例では：</p><div class='formula'>$\text{avar}(\text{logit}(\hat{p})) = \frac{1}{400 \times 0.2 \times 0.8} = \frac{1}{64} = 0.015625

lt;/div><h4>漸近有効性</h4><p class='step'><strong>Step 12: フィッシャー情報量との関係</strong></p><p>ベルヌーイ分布のフィッシャー情報量：</p><div class='formula'>$I(p) = \frac{1}{p(1-p)}

lt;/div><p>$n$個の観測に対して：</p><div class='formula'>$I_n(p) = \frac{n}{p(1-p)}

lt;/div><p>クラメール・ラオ下界：</p><div class='formula'>$\text{CRLB}(p) = \frac{1}{I_n(p)} = \frac{p(1-p)}{n}

lt;/div><p>標本比率の分散$\frac{p(1-p)}{n}$は下界に一致するため、$\hat{p}$は漸近有効推定量です。</p><p>したがって、標本比率$\hat{p}$の漸近分散は$0.0004$です。</p>

統計的推測（推定）