Descriptive Statistics

Summarize, visualize, and understand data before diving into inference.

mu9jADGz/GGs8eCd0G+f03FFvtaKHzSWFYMsQEVu1oSFM8Yk1COtnN2RyBxmzq8+sD9zklikYOOrYEQ62dwkgTdurEvjKPP2VQY4hr2mp6jvBieVAKdk5gJxOxE/h3Uhhh/IO+FnmNtbqVF+hgKz7qQAF4QwLSZRx9fRwcjW1ptX00abaG6Tai9MR4EmfQWeMZQ1d2yi/q+zg7X7/odg/XtmSZcgcjxFJ8rLX9y+YAIQW8WSW1M/8r4L7M97BU3J5mkZ1dbjTigPjNWaKKVH5E8HHzf/aDJD2iZUOkfNPxdhMvuMZsQCTZK9eF8vXUXUxHojGlQZ98btHJwf6MFL/6Gg/GRGzlt2DB6EVusq0M2avBzdgL15x8Z5zqOl45mLVhO4eQgrLz3mtzb1oYLiy6Kw1XFef86APOj+G8wGnraGokVim9WR+FU9oRxVP2OSO9lazdpBDkj5/lcvYW5pQY2HuUSIgdR8ILapcD6blDHB8gDXOxjD3r5ndITY87y/qRBvN5Z5pPuvZ05kZdoICZlOq7Iuv4iCVjTMtfGzmYdcFj/Izj4rZVDnPtMW0MrC/OnxXgERBu0eEDd3K4q1rmkfuTFlNW0N6lhFCC1EUMJ56g7OOt2S7zqd+POI44JkdM3YC4lhrgbG9ZYYtxz5jZKjP6r0xc56/pJkXpZO6ChGssUnKAtd4H9/xE1YnQyOeJoYRkivnQrvhYhLYRyDMTNQ6JeSM71XTKjiInfhvbud4K3uYUnstUxaJnluCIl8Nzzjq7IGyO+stYRJWWF6Ydy9zWz6l8u6iFsKKkIBHvyUlR0416LuFFXn+yT3jD6v0B/oaCsCkK2fyrEL9NUErYfwS7fmUx1QoMAiudzIPWys4yYfgTpVOSzgz7fk/sHAGBzoLg0tvu9QNVu3BUp88kLE9
mjyZ5+NSV4D3tHQS7zEEjTN8WNpLr3VtIh/cqUxE+BT2o5F3Z1IHbf4+52Eonlu8pS6P7oOauBMn1LOdsNZIOIRNf28DxewDzfJy9AV9w8J+J5XXbhojMGm7X3nzowJVmf8aLbets+4NV8txcU+j+62hyrpl1izejSHOUfrcoUL1Ch6izqgM46+dQjwsxTOdQttetBuzgunggGhLaNQzbLIJzuck1Bpuism7V5Ga+rMLTuh9uAyRy5gTUM+tYMdmIgK9A2b1CmkGW+Vb6Xv969GKVSO9Sf/rYK/YYM6YB3hrYQjyf2TSyvptUpnJ+KtigReaVjs0ZX/x997qUZqAtI29Sv/sQGpPIKkbFgrORWL8XuHVW6x8KomN7+iS/3sqKKSEMVDl9aF51bnDIduQCeqM+YmMpV4FK3NUTKRC7d7Ooa4drruNHh3VucMO25AJd7cZ5VlqkcTeHPpwltKrg2nRggHgzlLl02G6bzQWFlgcJ9bfQAesTNBBNdQidBsdbHslpTSz5RDGVQI2DAixfCsO8/+qv26iFTixpyJCJeRYaGAg/wKiXVQ626ZAa5mNnUdQQygq+iLQCAok99xdrI0bA0c4GtNCYh+ZUAXyHyioZnxXveaPJ2p3+hk1mQd48svQc/nZEj5N9sYOc7nvXt5WPEKaj1VhUBZSGbW06Afgp0YAIp4C31jU/i2wFNNRM9yMyO3lpgYPt9zkakqp5PLJlxJOlzpZTX3aDoxVbTpm15rKIMwrr5LrLgPkboig6tGQfOYjhhGLP2pXWahFIFV3+fBN8Ss+0yKfNuMtM6o8RgK7eqsj0bD/rISUN9baKpFf+ZQry1HuBfezq3kSTNuniEB7x5niLHLVsyftji8x40rCBrjORT2Sk0l6geeS7RGxafUJqDqpN6kse2lGaXVF

Measures of Center

Mean: x̄ = (Σxᵢ)/n
Median: middle value when sorted
Mode: most frequent value

The mean uses algebraic operations; it's sensitive to outliers. The median is more robust. For symmetric distributions, mean ≈ median ≈ mode.

Measures of Spread

Variance: σ² = Σ(xᵢ − x̄)²/n
Standard deviation: σ = √(σ²)
Range: max − min
IQR: Q3 − Q1

Variance measures average squared deviation from the mean. Standard deviation has the same units as the data — it's the most widely used spread measure. These connect to the normal distribution via the 68-95-99.7 rule.

WMP0WmM3NDozAzKVM1utXOo5F9plZYmTCGWRaChfKbq7O6NqK5r/oCEDCUEA6hC55m5wcObo/PIWXuEDzPXEdFRPnOAQxN5hE+cZkg4X99+hnfcz4UiCZwGV2YEePms45mc8md1fLesftIeqqb3SlPDwayYw2ovl0IAZYbDRBECSeVTnE5OOEwOXFkzTuU2OpEJqsiJ32i97NNSc60Se7EjfzM1xVD1MFB+495X0dKQE32MVb3vbM0pJl9K0zi+aBlzIYweMGXS68ktdGmrkCFG0BNO7YjmQbirTqRJ3lz8IZpGmF4BgXy6Bg5x92UPA7UJMMD+hvWXVqt910M50iKsH1jJ8KV3x9H7p71OmIf773VHFBuVZul/7dSoSh/aHCpA4M0GSeWDheCWrTMB4r4q6u7mT+vWDbCpNLTLfTwlzPjV8f7Wh3YZEvETSz+qL9LCb5Fq7Km/euzwjtg3jOcuCeUv7beew1E3g9ee+KBR7jESsjaNsWPlO39Y9Ccx7eA+50ettYeJoFiaeAXxL0ZqBchlQze0ItKLXNfFu9HpmZ7jRiWBPC3eBLg+rkMJqQvKB98zRh1nepJRrWa8M0tGSpxMkIsEvaht5FMjxSmrXLlyF3rYtputWd5qphzinCbNK8KcJZmVs1bjRybCdA3nsL6ikyeHOOFZKR15rnM/6Nz4K0QKhXGHQ1NLlNDj7du7DLdM/pibgZVOoM+TGATBL5muDBJ+ZPMIYJksxcFtWed4L6HA66FgOjFADYWlTaGve0lusfCqGHeeH9VJh2VZbhrMgK8G8LDD4UKKB1F0TLOgrug+1iuiCZHmk7n8sb7EV71ONudOF/eNudtCXv0ulyPC+RACZr3Tsjkmz0zszSVS0wByqSC3HlOxmavrsMx1yamDEjX8hq6qjHKQIlA==

Example: Data: 4, 7, 8, 10, 11

Mean = 40/5 = 8

Deviations: −4, −1, 0, 2, 3. Squared: 16, 1, 0, 4, 9

Variance = 30/5 = 6. SD = √6 ≈ 2.45

Data Visualization

  • Histograms: Show distribution shape and frequency (area under the curve connects to integration)
  • Box plots: Display median, quartiles, and outliers
  • Scatter plots: Show relationships between two variables → regression
  • Bar/pie charts: Compare categorical data

Distribution Shape

Skewness describes asymmetry: right-skewed (mean > median, long right tail), left-skewed (mean < median). Kurtosis describes tail heaviness. The normal distribution has skewness 0 and kurtosis 3 (by convention, "excess kurtosis" = 0).

Descriptive statistics is the first step of any data analysis. Before fitting regression models or running hypothesis tests, always visualize and summarize your data. As the saying goes: "Plot your data."
xBZqdZD2quD3xeOW7USjnHBLzPetWWfs41HUHcq3I/1yc8xbKyE0DpwY+E2BM7Ckr7v3+EKf6LMO2OX8a8r5j1BRcZkfe3KnbNjWXN7WUY5Dv9krSVydl7rkhUBKBKPjnEJje1A1PV8PEhF0/QUnWxJoBkeDFa3CNHQyxUGLgO947ZQDZD95FboGtdw8LXvmSbUJWcy17MoADlD8P5MmdxI29RZJ/oEucbEhkG2tqgDOW7zl57xJn7yj8p0nTlalbf83QyKaBA2sUTrSc/Tf3cq6KAEBEh3Xj/eGf7gOxi6WuycrHtqHa5503LlW9MkJqiBnqN5LUezX7GoRReBOhoO7281ktfXaaUQmfwThiIBMmhbhKP2+UOEy7EcNaTaZAdM3HPQnAE5gAYlyf7uj8lVD9wWKRcP90pABSOTFm+44pWtN5S1I42LckS9ZjAR5yuHKFTLB90J7MlV4wFu50quV6XZxxHXxSw9/Vt2AWr5nwb4R+T9Kn7zjj7M3fvnd4UsboEiggHvNNJ3FkxjjYVO2O6u+NX0Gacq+i4sCbUBKHg0ew749e6Akdkw8OEhlWj+GM4IOZixNyOPg0R3guC029eLN8BXAVlurZSaqYdbweaf4bjmwmlnHGu+9Y/5z4EOjbKXlt0m/Cy2fgOKZkET18kDw+UcxIEe2IxYgtcdLWeuJE8XIyRaR5jGP8UTztJ3rNA5vShsRa88YU/MsPsIwY7R0dEAcu+2f10g8DZSKAfZsy4XdfcLNXHcIw1V8IfBwhLiDEov7FZjCtGjOPQ0mCpmVyNJ1R9M0eBrecx1S22hagb9Cx1QIE+q1eD4BnEwBO0HAU3yyZPZYXcY1aIpKtYPAEN+T1rPk+TbYqRyrnuCBc/qH+5WSXZE91Abr0fQXoh2jA453C++JD2fCnJ5wK
AyW7qrCkobP8RLkGGGxYZxEEJ/lcphegDuM8KSuSIloLQpsvloCMnjDF47htP7OWPknqv9HdQKwzRJBYN3uq6u3hi124glpv9W9ySu+DSXxAy2vovi/BHCOXY6W3/OEReBis88Snn8R13PcyYgb+C+rMwgDj6wJlWuA3QJW7barNbMtNG//WVuPq3l3bdynNu4zkyKGG4FNk/FZb4VO02zzsB1dc+mfV12spA5FhhJon9KTwTm91dSExVzwlndG/pA8Abib5mdryUPgq/c0AGXYbsygUfdXzZy6fbb9pqgkuHmATpclQUKQwgJ2zsH3xWoYGtz4j9kt20P4YGqVlLO3FIjYUSpqOtJkba7YZMExFY1dqnStgtKxgUf8CGFOKCgwEj2K8inuduNDK7cOP8xhUqVNMmXkE2HbkhU1/XSUEHs4uD8d9XAWPTtXWKXBRJ0JZuVnrN5OLL7+GpDOqUqvlWlm+N+HfCtaO5qoK2RAGBiJ7XimDtZKomedEi1U6RZz7Esskx1QvMb5naggvwZLN3XibZvRTDfaYAzZtoilTl8iAfiiA0GhjUWSSocbTJ1Z4jhsHdl3V1+tat2qiR/Hz/dt74tO26uUWqhy6x/GmEdekG4f4Rv+wuIBywW7xwDTfsJvcYf16bjVJMgruf4v58ZybPShsMG6S4d3jRID8qxADCzEfvfeJ9pngky8SVPeoHJdW5Y3CZtDMDzCEOW0NRsaen9lqmsOraaqKTzHL+Vok5KFllKXh8Rh2KY39hRIBxUk1HjuVCXjzTpROKRsZXgcyKRAB0aX3+qFkcgG0LzlieBZZoEnha+C9yj6cnITzLa0C7GFxmTgIYJTEoHFDfbyP5LyZwlMLITpxEIU4XwE3mCqwdTFhRqaJlxU4OX73TECcYD22nXZVZBUQhPbZ