<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Regularisation on Arshad Siddiqui</title><link>https://arshadhs.github.io/tags/regularisation/</link><description>Recent content in Regularisation on Arshad Siddiqui</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://arshadhs.github.io/tags/regularisation/index.xml" rel="self" type="application/rss+xml"/><item><title>Regularisation for Deep models</title><link>https://arshadhs.github.io/docs/ai/deep-learning/110-regularisation-deep-models/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://arshadhs.github.io/docs/ai/deep-learning/110-regularisation-deep-models/</guid><description>&lt;h1 id="regularisation-for-deep-models">
 Regularisation for Deep models
 
 &lt;a class="anchor" href="#regularisation-for-deep-models">#&lt;/a>
 
&lt;/h1>
&lt;p>Regularisation means adding constraints or techniques that prevent a model from becoming too complex and memorising the training data.&lt;/p>
&lt;p>The goal is not only low training error.&lt;/p>
&lt;p>The goal is good performance on unseen data.&lt;/p>
&lt;blockquote class="book-hint info">
&lt;p>&lt;strong>Key takeaway:&lt;/strong>&lt;br>
Regularisation helps the model generalise by controlling complexity, stabilising training, and reducing overfitting.&lt;/p>
&lt;/blockquote>
&lt;ul>
&lt;li>Generalization for regression&lt;/li>
&lt;li>Training Error and Generalization Error&lt;/li>
&lt;li>Underfitting or Overfitting&lt;/li>
&lt;li>Model Selection&lt;/li>
&lt;li>Weight Decay and Norms&lt;/li>
&lt;li>Generalization in Classification&lt;/li>
&lt;li>Environment and Distribution Shift&lt;/li>
&lt;li>Generalization in Deep Learning&lt;/li>
&lt;li>Dropout&lt;/li>
&lt;li>Batch Normalization&lt;/li>
&lt;li>Layer Normalization&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="underfitting-good-fit-and-overfitting-">
 Underfitting, Good Fit, and Overfitting ☆
 
 &lt;a class="anchor" href="#underfitting-good-fit-and-overfitting-">#&lt;/a>
 
&lt;/h2>
&lt;table>
 &lt;thead>
 &lt;tr>
 &lt;th>Case&lt;/th>
 &lt;th>Model behaviour&lt;/th>
 &lt;th>Training error&lt;/th>
 &lt;th>Test error&lt;/th>
 &lt;/tr>
 &lt;/thead>
 &lt;tbody>
 &lt;tr>
 &lt;td>Underfitting&lt;/td>
 &lt;td>too simple&lt;/td>
 &lt;td>high&lt;/td>
 &lt;td>high&lt;/td>
 &lt;/tr>
 &lt;tr>
 &lt;td>Good fit&lt;/td>
 &lt;td>captures useful pattern&lt;/td>
 &lt;td>low&lt;/td>
 &lt;td>low&lt;/td>
 &lt;/tr>
 &lt;tr>
 &lt;td>Overfitting&lt;/td>
 &lt;td>memorises training noise&lt;/td>
 &lt;td>very low&lt;/td>
 &lt;td>high&lt;/td>
 &lt;/tr>
 &lt;/tbody>
&lt;/table>


&lt;pre class="mermaid">flowchart LR
 A[&amp;#34;Model Complexity&amp;#34;] --&amp;gt; B[&amp;#34;Too Simple: Underfitting&amp;#34;]
 A --&amp;gt; C[&amp;#34;Just Right: Good Fit&amp;#34;]
 A --&amp;gt; D[&amp;#34;Too Complex: Overfitting&amp;#34;]

 style A fill:#E1F5FE,stroke:#4A90E2
 style B fill:#FFF9C4,stroke:#FBC02D
 style C fill:#C8E6C9,stroke:#43A047
 style D fill:#EDE7F6,stroke:#7E57C2&lt;/pre>
&lt;hr>
&lt;h2 id="training-error-and-generalisation-error-">
 Training Error and Generalisation Error ☆
 
 &lt;a class="anchor" href="#training-error-and-generalisation-error-">#&lt;/a>
 
&lt;/h2>
&lt;p>Training error measures performance on data used for learning.&lt;/p></description></item></channel></rss>