<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Stats on Arshad Siddiqui</title><link>https://arshadhs.github.io/tags/stats/</link><description>Recent content in Stats on Arshad Siddiqui</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://arshadhs.github.io/tags/stats/index.xml" rel="self" type="application/rss+xml"/><item><title>Hypothesis Testing</title><link>https://arshadhs.github.io/docs/ai/statistics/040-hypothesis-testing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://arshadhs.github.io/docs/ai/statistics/040-hypothesis-testing/</guid><description>&lt;h1 id="hypothesis-testing">
 Hypothesis Testing
 
 &lt;a class="anchor" href="#hypothesis-testing">#&lt;/a>
 
&lt;/h1>
&lt;p>Hypothesis testing is a statistical decision-making method used to decide whether sample evidence is strong enough to reject an initial assumption about a population.&lt;/p>
&lt;p>It connects probability, sampling distributions, confidence intervals, significance levels, and decision rules.&lt;/p>
&lt;blockquote class="book-hint info">
&lt;p>&lt;strong>Key takeaway:&lt;/strong>&lt;br>
Hypothesis testing is not about proving something with certainty.&lt;/p>
&lt;p>It is about asking:&lt;/p>

&lt;blockquote class='book-hint '>
 &lt;p>If the null hypothesis were true, how surprising would this sample result be?&lt;/p></description></item><item><title>Prediction &amp; Forecasting</title><link>https://arshadhs.github.io/docs/ai/statistics/050-prediction-forecasting/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://arshadhs.github.io/docs/ai/statistics/050-prediction-forecasting/</guid><description>&lt;h1 id="prediction--forecasting">
 Prediction &amp;amp; Forecasting
 
 &lt;a class="anchor" href="#prediction--forecasting">#&lt;/a>
 
&lt;/h1>
&lt;p>Prediction and forecasting use statistical models to estimate unknown or future values.&lt;/p>
&lt;p>In this module, the focus is on correlation, regression, and time series forecasting.&lt;/p>
&lt;blockquote class="book-hint info">
&lt;p>&lt;strong>Key takeaway:&lt;/strong>&lt;br>
Prediction estimates a value using a model.&lt;/p>
&lt;p>Forecasting is prediction where the order of time matters.&lt;/p>
&lt;/blockquote>
&lt;ul>
&lt;li>Correlation&lt;/li>
&lt;li>Regression&lt;/li>
&lt;li>Time series analysis&lt;/li>
&lt;li>Components of time series data&lt;/li>
&lt;li>Moving average and weighted moving average&lt;/li>
&lt;li>AR model&lt;/li>
&lt;li>ARMA model&lt;/li>
&lt;li>ARIMA model&lt;/li>
&lt;li>SARIMA and SARIMAX&lt;/li>
&lt;li>VAR and VARMAX&lt;/li>
&lt;li>Simple exponential smoothing&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="prediction-vs-forecasting-">
 Prediction vs Forecasting ☆
 
 &lt;a class="anchor" href="#prediction-vs-forecasting-">#&lt;/a>
 
&lt;/h2>
&lt;table>
 &lt;thead>
 &lt;tr>
 &lt;th>Concept&lt;/th>
 &lt;th>Meaning&lt;/th>
 &lt;th>Example&lt;/th>
 &lt;/tr>
 &lt;/thead>
 &lt;tbody>
 &lt;tr>
 &lt;td>Prediction&lt;/td>
 &lt;td>Estimate an unknown output&lt;/td>
 &lt;td>Predict house price from area and rooms&lt;/td>
 &lt;/tr>
 &lt;tr>
 &lt;td>Forecasting&lt;/td>
 &lt;td>Predict future values using time order&lt;/td>
 &lt;td>Forecast sales for next month&lt;/td>
 &lt;/tr>
 &lt;/tbody>
&lt;/table>
&lt;span style="color: red;">
 All forecasting is prediction, but not all prediction is forecasting.
&lt;/span>
&lt;hr>
&lt;h2 id="overall-workflow">
 Overall Workflow
 
 &lt;a class="anchor" href="#overall-workflow">#&lt;/a>
 
&lt;/h2>


&lt;pre class="mermaid">
flowchart LR
 A[Data] --&amp;gt; B[Explore Pattern]
 B --&amp;gt; C[Choose Model]
 C --&amp;gt; D[Train or Fit]
 D --&amp;gt; E[Validate]
 E --&amp;gt; F[Predict or Forecast]
 F --&amp;gt; G[Interpret Error]

 style A fill:#E1F5FE
 style B fill:#C8E6C9
 style C fill:#FFF9C4
 style D fill:#EDE7F6
 style E fill:#C8E6C9
 style F fill:#E1F5FE
 style G fill:#FFF9C4
&lt;/pre>

&lt;hr>
&lt;h2 id="correlation-">
 Correlation ☆
 
 &lt;a class="anchor" href="#correlation-">#&lt;/a>
 
&lt;/h2>
&lt;p>Correlation measures the direction and strength of linear relationship between two variables.&lt;/p></description></item><item><title>Gaussian Mixture Model &amp; Expectation Maximization</title><link>https://arshadhs.github.io/docs/ai/statistics/060-gaussian-mixture-model-em/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://arshadhs.github.io/docs/ai/statistics/060-gaussian-mixture-model-em/</guid><description>&lt;h1 id="gaussian-mixture-model--expectation-maximization">
 Gaussian Mixture Model &amp;amp; Expectation Maximization
 
 &lt;a class="anchor" href="#gaussian-mixture-model--expectation-maximization">#&lt;/a>
 
&lt;/h1>
&lt;p>A Gaussian Mixture Model represents data as a weighted combination of multiple Gaussian distributions.&lt;/p>
&lt;p>It is commonly used for soft clustering and density estimation.&lt;/p>
&lt;blockquote class="book-hint info">
&lt;p>&lt;strong>Key takeaway:&lt;/strong>&lt;br>
K-means gives hard cluster membership.&lt;/p>
&lt;p>GMM gives probabilities of belonging to each cluster.&lt;/p>
&lt;/blockquote>
&lt;ul>
&lt;li>Gaussian Mixture Model&lt;/li>
&lt;li>soft clustering&lt;/li>
&lt;li>mixing coefficients&lt;/li>
&lt;li>latent variables&lt;/li>
&lt;li>likelihood and log-likelihood&lt;/li>
&lt;li>Expectation-Maximization algorithm&lt;/li>
&lt;li>E-step and M-step&lt;/li>
&lt;li>responsibilities&lt;/li>
&lt;li>convergence&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="motivation-">
 Motivation ☆
 
 &lt;a class="anchor" href="#motivation-">#&lt;/a>
 
&lt;/h2>
&lt;p>Many real datasets are not described well by one Gaussian distribution.&lt;/p></description></item></channel></rss>