2pKf/wFk6/aswU1lBoBz9Lza2j7dXosmgCO7Jp9jti8FXejqsmetDActW52WLADvJGTdlj+msSr0tBquEtXUyjSncrbh0+KE3CplRJo0UnCce+Hl9sCGn6YPkyr0KB7FLDDg3nVwx6V7LuTwwTdSGvr6doD1vaC9JD3RRTlVnJQ2ZwCWE8tVV2k4wVd03tr/rL0i1uy1WINPHaTn0MLeA9wIjDPm8sJwOhtRN8dC7Zgx+nPKptWvqUa3TZ57UbYnfi4qq1jxTU5SAi7eJYJqHHeuS/71CbwBQy2nnRSZVTpfBKJS7x9yCz0IxZzemjaSbZN+iWW1QUZIAQlVAm9ovrg2hFTyNq4JtwfmJbrOH+C8i3m5VUq9B9J5Ap7zad1VymoCkuTdzvSc/6teDjzoLW5QM4jBf/3aAvgEESvP99jldohQr0gNfssBX/TeHbwkx7TYRhaOOeZgfMW9Se8GojHO5EdP2ThXHGB0VfRwKa/Uh+AMxVMoSWj5LpT5FsaCF55pbdbT4ZFAQ9HKBv1Ksz4UaPOJlTE4UlsdjiHhPhXWZAXF/E/DwV/Pj8TJUzhL5BZG/S7GypK3WE6/7iClqwrkK2LdjA+v8H1xe9i/3Ahv/uGu+3c70QaR2RF6PzD0WKnFZlcF/PbHanoLsRsvAjs4/wTz9EkeJ3JIw1vRucRacPWNV0CcYzVEFo6YkRXG+mPQxT18A3jXHToc7DWO94zSBgtqZU9ocvi4WLro1IZ1l2D1yxkVA4r4FQ7iadEY+Nooyz5dZK8EorLbNz/YuLGN+3m1l/yENyJ2Jbx/0W3jWPXhbnk89z8GwPUUFGQ4pvAqCQzdI6gFCchtBqNa4o3l7WZc9

Probability & Distributions

Model uncertainty with probability and understand how random variables behave.

Probability Basics

P(A) = favorable outcomes / total outcomes
0 ≤ P(A) ≤ 1
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
P(A') = 1 − P(A)

Probability quantifies uncertainty. These rules come from set theory — unions and intersections. Counting techniques like combinatorics (permutations and combinations) are essential for computing probabilities in finite sample spaces.

Conditional Probability & Bayes' Theorem

P(A|B) = P(A ∩ B)/P(B)
Independent events: P(A ∩ B) = P(A)·P(B)
Bayes: P(A|B) = P(B|A)·P(A)/P(B)

Bayes' Theorem is the backbone of Bayesian inference and machine learning. It lets us update beliefs with new evidence. Total probability connects this to partition: P(B) = Σ P(B|Aᵢ)·P(Aᵢ).

Example: Disease Testing

Disease prevalence: 1%. Test sensitivity: 99%. False positive rate: 5%.

P(Disease | Positive) = (0.99×0.01)/(0.99×0.01 + 0.05×0.99) ≈ 0.0099/0.0594 ≈ 16.7%

Even with a good test, a positive result is only 16.7% likely to be a true positive when the disease is rare!

Discrete Distributions

  • Bernoulli: Single trial, p = success. E[X] = p, Var = p(1−p)
  • Binomial(n, p): P(X = k) = C(n,k)·pᵏ(1−p)ⁿ⁻ᵏ — uses polynomial expansion
  • Poisson(λ): P(X = k) = e⁻λ·λᵏ/k! — models rare events per interval
  • GbtRAakcNpcEAOa2ulMtWzFpPloiZbndzTYMc2Zy+A4SkWaDiE69LLMWv6RlgoF7/y1kBQ09La1szCgAzfUckpMV4CkkEYla0GykzOt0ODI2uhlQejB1gUqs2yZHZ1aPhbUz2cNQPzy8jfguc/DWa2HtyRsgIJS5MrzGYQ/usrObOc4K7oF1Rvu9ot0zNFdy7s3yKW55AGR6wwkYlWAxgz+sooMlosR2CCLV1ec3j/tiJlICMDOusPovhqoHzatyEF4dKgEpQ5KiRz9a986cAjrzqb+le0OP0J2FGtXn4PtLa0N82+8ZxewxkzCpJgaD7YIh49pg91WUtGMxC42axWx4vg15YB1uLpY5kyBqHJPSYDbISDoBEa+0AWE1XJE1+mqoQ6dsjORaflCOaphrpjJyn3aYWgWHUrMYNwCXNgAJmy5rY4v4hW1tX8UnRz/FOiT7ymKtJ5MdMP6rsd3ElL7K7IrXAnX5XpS17qX4Ot7GSHWxVo/W9aPsw3tfly3KBnJUfDFSxdIv05vGtNBr7NRyrkWed/fznGqv5rD6AoPiNCD3P5LHHl5GBeGzFJhO/xczSN9yK+mV8eEGZIpXtgSD/kShU0X+2FL47p808CwP6D8UGTLcnXFw7PD07uPEQgQDcbdA0lwa+4n63A/gIR4NhVeJuDOj2zQu+pU1C2mBh/VSSA3ezg216HhAO29L3YQ/zne8KfpwC4hHwKbK48PM5xDSLv24RxWslpYvwdFfYkKlOs7+KJ+0E7unXyKh5AgLAzOEQ3Prn0577MHtwBbEV/Vxm9Mmrmpv77cYtkZz9qya7lPFrfwW6iN3LGqP+Q8eGUl59aZQ9EU7LQ6IbGC8gXzlIt
  • Geometric(p): P(X = k) = (1−p)ᵏ⁻¹·p — trials until first success. Connects to exponential functions

Continuous Distributions

Normal: f(x) = (1/σ√(2π))·e^(−(x−μ)²/(2σ²))
Z-score: z = (x − μ)/σ
Standard normal: μ = 0, σ = 1

For continuous distributions, probabilities are areas under the curve — you need integration. The normal (Gaussian) distribution is the most important, governing everything from measurement error to stock prices.

  • Uniform(a, b): f(x) = 1/(b−a) — constant density
  • Exponential(λ): f(x) = λe⁻λˣ — time between events
  • t-distribution: Used in hypothesis testing with small samples
  • Chi-squared: Sum of squared standard normals — used in goodness-of-fit tests

Central Limit Theorem

Central Limit Theorem: Regardless of the population distribution, the sample mean X̄ approaches a normal distribution N(μ, σ²/n) as n → ∞. This is why the normal distribution is so important, and why statistical inference works. The convergence concept mirrors limits in calculus.
NRk6frRvKhf8l1YhTttTbEo7h1gIS1Tmi19tRcR8cJmol8vq6JLrimOh3SAPnmeB66rVQdFuNfBSwPMyeKaZfHWO9rRY6dF2k+Oo6gVGsBtxkIg+8eQinIgwnCedwwz7rAYNGj85dZCceNgLZOYQkolPhIqwHMdQqqG8voPkvpJSSzb1dgLdm8Kkv5xGVjgMWfkNTQlUVfkKayGFni+5qsoewSockotwsjU5M7JjgQUhkW27ehR+30XsDWdsA5K6CqEuRXyDB/Et2NaOO/cQQE9jiKKYM8SunbM4X5oIQfu+mia1MOof8WnWqRUdp5VRs1/csAhpbdkLpLb8Y27NZ6ciJyLDmFnvU6aZa30ODbpY6FCpLstXZgg0hjTC2eOAs29LcuNpv0hFpu07JNT+ZDlTyrkL9c+KHVQGf2Ebn/WmhRY0luknNNcGJue1HjKlgu9TMBMPodZC2u5NqtL8BFuQg3icTN47ewameNQmbl53WDok6VaFxXbkQoG7ApeClrRZ/tGxAkY10+OIzTDPrb5k924Jfe0C/DSX82cDHXk1bsWIHzt5DrJOQziXl4Ov4hEJHyNEZ54NnGSgAyS/GfHfdFHvlcyudEAGp33my7250RaneDIel6rYMnLCkv2PU5BDxPZF2ALrOTaHodMVq9U984xYt1TsMrcRUw/fzbffuf3+rX+3erMA5lY4Tm2JclD3SMkOYzmV8FRXWQWdkOqBD1cnIjCnLMs+iFbp3x/ohSGsiZDVPYpvEwmf2KCDr6/wx+AAAIbxPf2Hk9VKQMDu0uI/I558JP/xW3/bNAFSvnaoYPdPDCXUI1+DR2tU02cPmIUX5XUrVExrMOiywSRp0rlaW