Probability & Distributions

bqeF4CXWcYVJ6LiJVh6ZnkFfNPrRt2Sjhl6y9bzZOzc0LVokXfSj61aqnDVNrkOA5lG4qePqBJ1vFX/4sBJWdINHDAzTC4+GmgJXORCzPTX4mVKd1pLPif11BgO3gulYgCdmnCAG3uib+0I0xuBRA+XaWueIOnoxsioHk+/E6mV90+okpSl8fPEcvjIxxfChNGrX1jheUElXa2PrlkJdg4n7dwvFFEXuTL7IEDrnYrO0ozRaSd9TboqVVz3VDFz7SufNoGvfmL8TUimBKbMcLNSeWxnkLn62n3dp80zaVs4ZT8vV0zYyII3MD+BzK7RvnCnrKuns3NAE1LMzM3FUvwaqgINjOcPXpie+vbp5uPxdN0WdPI9ioAGGDA5eqx9xxfmZngYbRUGB4BLP8PdA8nQia1ZLs0EP1+f4IsH8or3jFRLST9c4p6OWFgrZQgvVl+ZWblqfob7P5FQC7qfj3j4So+yLDGijDfhENN1H+pffakHZprrR06Jf7EfKWnlN087iLhEUJzWyVc+MNZeS2aidpxT1pqby9bjw4WNpvidPTj0Ne73tmObvfhSEScuCoMLF+zr5i9y+v+LdGdK5f/MyZVYBedEGuEMHzrkjEj6F5ZiJnmb1dgSVP0f/0E5hUBz7ELmMAVPtbDYUp3xYgajp2GB1ESYBFJtmnQ68thLbqUoM+GnskvbGs5EJsg/2all+dRAjqJaqGUbGTcwNzP0j1rKNcdxoAUseNfItUEv/Vfq871nNV+p5QA7AZt5oF2jXfd9OZFQqziODq3JMS3D/ssWpjceeAa/Y5ts990N9lev4bC0ETVyHQj+r0W/bFgBMvT9GTWEAgwdbYRBsKBVJ9Qq

Model uncertainty with probability and understand how random variables behave.

Probability Basics

P(A) = favorable outcomes / total outcomes
0 ≤ P(A) ≤ 1
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
P(A') = 1 − P(A)
DOAYISpvF6R0s9GTHoB/nwWGzsD+b2IAjM0KLnTo3zxYKCQUTQe6SsrE8Qg2+RAx7RBdycBP935hHcWNO/AxBuudX7QnJYjJaTlMWtCqPbRD/26yWaq/zsGIEJDKV1dG69hpYCl8bS/787IjWUfA0bsnoR/b5KP7oZPOoRt+3tjYQPkHo6UIp51DN4x3evk4E4ned+5LJjh3NNwju3CEXU4DxpC934clJ01Bgf02FXYEtiXp1vhLRlCKKY4zBwaBEzkGngXk0g2tqTRUkmbAcF1kcUUBTzhAwjKPLxDLg3K/h6psbFSHI7zS7knULzeJ07ekKZkWJ+ZY3XbouUTPeh35L2Ssu76O8+IHhtZ0yjF+HQdD+zTcMC+RjfSgsWNnYQt1epggIrWwgU7TXEuIbTBignGkeIqgq9Ey+VY02yPm/dmfpF8pbOOtSZZxHF02WcKXWFKFsrdOaervr3Uv3VN/FXKDHpDjgE0xq7z5XVqh44DP74PzTBbSTrgQuDqDJ5aAUmdUqmSyvMDP6sJHI4qeAglsM8Y24m8W4aSJi6AThV5cZFhiU0Nvp9EdOOUCj+HkytUrWCy845e9S5/Gz0uejUWewYxzXYqOWLJciIBd8qcmyKWZR7JzGF0bNT+n5XNdEXomVLvqb8/ryTHEqtAiUzrws1+0ZlpRDhU6+OnbulHHi8Gy84LNSv0YUhlTQ9ji1Q2vpdjvBd7joi0hi6HnYjIs/bQQYdd7GxspjpFqFWC62Uv/4RFawWcsGYW8XQB2UKOiuWyyUj8gf2k5O8pL5BlES9v5ogtfvFmw+LgNpNWnmDvmhUjJEDRN0a2X2+KX8dtB+Kbg59QTLNY0bngOjVDl

Probability quantifies uncertainty. These rules come from set theory — unions and intersections. Counting techniques like combinatorics (permutations and combinations) are essential for computing probabilities in finite sample spaces.

Conditional Probability & Bayes' Theorem

P(A|B) = P(A ∩ B)/P(B)
Independent events: P(A ∩ B) = P(A)·P(B)
Bayes: P(A|B) = P(B|A)·P(A)/P(B)

Bayes' Theorem is the backbone of Bayesian inference and machine learning. It lets us update beliefs with new evidence. Total probability connects this to partition: P(B) = Σ P(B|Aᵢ)·P(Aᵢ).

Example: Disease Testing

Disease prevalence: 1%. Test sensitivity: 99%. False positive rate: 5%.

P(Disease | Positive) = (0.99×0.01)/(0.99×0.01 + 0.05×0.99) ≈ 0.0099/0.0594 ≈ 16.7%

Even with a good test, a positive result is only 16.7% likely to be a true positive when the disease is rare!

TIrTS1wRZZ03Q4uduwyZLkBE/EcQpMoiVU4FYX7iKEidiTROXw7jJaexSxgVzFC0tuAcoj4kHYpFkaf2Vs4+tDFIWnQCYlCColadCujW/ThN9y3mGu3UEBVPQtvBUQs3GaNOO+MOi9MY+wGJilpNZnIIY3NnyicyTvNTG+AZrwR4Y50ZPVe4wFQqtJ53pz7JfATjEO1h08Lqnxb8Shkde7IN9WoYQgMr6vpOaaap9hkROnzbvdleeEOecgXAMQbz+oiYoynvYwxNYGP0G88E39/amF+V8MvRmG6zolQv8tbzr5y4OZVL+8TDvkZx4GT5BU04dJr273BR1JGcBeglvZW2RGepzThDs71RDQMn+W6q5xiZxeHcZu6VE23uarBnzR4EnY1wVKj/uT+9ThNBHVCWy448nyzayEHAxkHY6CVmo73j6amp6HDFZQHnl5Vq0VZA9vEZJU6UQNWECWXYYuZwn3evrs8jkvWKhIXUnjmG4xjmMCa9sx3EF+nQy6rKIvHILNRESI3VlfqJB/iTfrfkxpgJ3guU366mnlkMKHV9eCW+XTS5VzFp3BjlgJEadhCzveKb0Uf30T/QlyZxKZvEB0aXzhXgBoibY19Km9UzOHZcmolLxn/pIMuj4woTtdJ5DNYDc6cmIXs7+QGVQDoYGvBcc36hay/oRFSTkMlsQepm6uNEXEoRMGQ5IWCSCHk0NnuoRYs8H46qT+VhpEOht8sr8dyMwObV4q6Wxk7FuiX+H3RsOAAmvdXdPFBxThsAl07tHQgUvej9BOwwkjc8BzfRy7GgOsjXrXCTR9d55DeHQt/u2IQOf1gzeXjs9nsh5LLq0yVvzkJJLqg94L/YHU=

Discrete Distributions

  • Bernoulli: Single trial, p = success. E[X] = p, Var = p(1−p)
  • Binomial(n, p): P(X = k) = C(n,k)·pᵏ(1−p)ⁿ⁻ᵏ — uses polynomial expansion
  • Poisson(λ): P(X = k) = e⁻λ·λᵏ/k! — models rare events per interval
  • Geometric(p): P(X = k) = (1−p)ᵏ⁻¹·p — trials until first success. Connects to exponential functions

Continuous Distributions

Normal: f(x) = (1/σ√(2π))·e^(−(x−μ)²/(2σ²))
Z-score: z = (x − μ)/σ
Standard normal: μ = 0, σ = 1

For continuous distributions, probabilities are areas under the curve — you need integration. The normal (Gaussian) distribution is the most important, governing everything from measurement error to stock prices.

  • Uniform(a, b): f(x) = 1/(b−a) — constant density
  • PW0o7hp+C9AZBVqfhbxU1CMWDucPf92OslVBEOX6+YfQ+jCaNrfS2V/RqvidlnfcxwEDyehmQnrOph5X8ZnQYCyMP4I0gmIZLnWHtJg4IVPwHpW7Ul9HNfex8eEHlL486NC9pxShc0XgDuuk6Rn8cciKbBnxWS3l7HXEII+OAB/F0EahCrds01JcxnoD9q3W3xAbmKtuJ29H84KN/miXlmtffqnzESKj3RqDwa1EfLzXY4r2Om+BxaPR+in/urr8xrYY2qveRvqVrSuWxWN886MQG5hrdEIfdyZw5PD8bnSxr2+qgv9k+6EZ+CbE1xP8mRyJ9jrbEghxJ6LQ92aNDEonI1+/222QryINZgzWIKGrIIewlgBS93XxXKLn/A7vMtmJLUwNBdGMgjTogKAvL7p59wuUJ4UyFqydLjlzqKEC6dEsVQrVAbvcwV9Z4NkmKKMdXBOf1dDC/KgTNvvJTAsmjRXw94c7drTDxITDvwg2xL1C7VdHrVD46u4lOPi+gmvGx3hq8mfXTaI+CE+/y7Nyiu2f49kbau8O0WYPJJp32zcn7XqrJXcfVtVa2zp9vi9AO2Q/gV2TYPPlyUXyWy96BLDk7WeSiSpUGeVTGE/S1xTJkA6Gtdpah6gJQCOSkF0FiuN7mq75/v6n6agmyTHlw77BF5oWO+TPzZMgw9NCfER2DXD2T6Aos6OS1icAsBOS5jS1IzYMTWwECUDY0oF5OQwmJRqcsD6cV4B4Nkejq+LBvof2GEl3Hz3Ck+xPrhVxs7H35832liez16uBCxaJjAuiiV4LXaHwOM/8zOolX+WcW1/bttqUrFiYX9xilH0cNIeaeOmWegJOVagAt5VVolEx
  • Exponential(λ): f(x) = λe⁻λˣ — time between events
  • t-distribution: Used in hypothesis testing with small samples
  • Chi-squared: Sum of squared standard normals — used in goodness-of-fit tests

Central Limit Theorem

Central Limit Theorem: Regardless of the population distribution, the sample mean X̄ approaches a normal distribution N(μ, σ²/n) as n → ∞. This is why the normal distribution is so important, and why statistical inference works. The convergence concept mirrors limits in calculus.
C5nfzgHNNzqmaAok87FPg3Uu/04W0DlSTB3dcwYpsnlUrC8UPIgyqC1IilO1xP/04tRpITn2DiWpfNZZiDMAeFJDwayAl2qjMBvWL7pwuZU7utUx3c1IwnvQPpF1t332DjDAV+9iEWRmVSTl3XAoi1nfKZq3nRLxOmBynsU219VlnmfVRLPMY5uW6IyNce2eJWbHK1Q+bEYlMwxZ+ZRdVhWaoxEkDR+d7OVyRFsZbxct33lemwGK//LP80MeC64q/kTcnFfi5iOZKk2i4s6T6Dc45q1n+lSAfbwcuvBz4DV4JUP83+QmM5yrbcM6pg+fLx9pWxoh6yeaU4RbOYRYVyX7e3u4egf6axIPnuTeuNVaaJMQWUFtiGJEvgR3+GS8kfTxIMtkF2kZlgRibfC9ht2SUHjRMU9tRr6ZvV9jQUZjeqC/ho2mvGkN7d+99lhh9Pit93nlw9PNwLcOdm9SBgJpRb/IYNW543u4BcjiB/cD5vksEQX2WEqtgb3j7obnUqt7X6TOv+G+ee3+Zg0oFPErZ0BoAl0MxzSDPgRnnvUma8rBX81BVQ+bEz53bCkk4J99k+UNq0g8g8xFsZpj49UqJOMcpx4++RdP9TXzPBEvqQlL/ir/0g4dbvxXXanIIWmRzbl4jg2K2qG9V1EQN6gWltxYpnq7p1PLWEGKdmizr7qXzBsNRN+3rQxaPNgYqvFbpvtdtVgClxS3+DIelsx6pSyn3+YFgctuk65XqsuQSwO4Uj4VL2RlsfCMy6BpRrqbz2oGjsHh6OfCYgEQEBZ4pplbE3ASe2K22bY472KQdReD3EoefNor8XwCwpFkKBG/WuSCLWg+y/xGveJRMoagIIl xBZqdZD2quD3xeN27aSbnHANOMK5Ats7qdW360GRcfoxN4i/fDs2Dhxj44aIgv6nwqqCRA++OruPz0rSaz+k0vSvLcWvwMj/kJFuVGfV3IytfbblNZ0dxoPRhT67xH5iq2X/9pMWWgWacfHocemFUanUy+tOBS8mFNYWNUM7QTOffI+uC7n0MwtbsnDbsgfkDj7Y4aI9PTDV4aNjK34mKkohGCzY2wbY5gJ1oik78eq3iwxdDmDamfyjksNPvtXXtlg33PVfhQ2uWubnNEcrsa0/zu45XSxT4klzu1zhmlaf7zeryHtmzxh1n3IjxKNu/HLNqIeg3ZWCppABOeZMIf3X/o4x2bcusNs0ih2uahFeLdpWZRRUzNwAXDhPFlLGRsvHlDVfcFafG1aekWAVTZ/GaDWHiqrufUrNy5GXi+04uYKhfXylvLuznPfcNCOZKY4IO8nonymlcyNH7xp7ofqxbPFbiu0Y9whjYF1fqKYzZh8XQjwgV7FzIZx+HUG5Y8zXwz2iQNgOnK2hQD+BAXu8JElr4EQBF7fDdIxMMCfVIww4scufS5gtI/K+FXxXtLyX+4eM7JEXw67t8XjWuHTD/IKUiKtBYWpgK3rPCrJ03xmERhMxECK7NJ4nJDd+tPIgD0Ky/80X1JqNMjdMBviKoeJuKOMtYu3SS4YB7Ss52ZyrFK5gWOJ1QOkPGe0wC6RbfzOxOsZMFCTy6rYys9G9el84o93yfCt02ajUux66QStY97Xvg8LsACA6+t5b/IztJxdNTEaY1pzmoxskyyC3/ArJo1nPmSlAMPa7+0FeCupRZGV6A3s/V1SlZALzw3jP3DcdxM6r/Yzswpg01WYppScl