From 4e58f2d643759d6628e33e8acda6d609e512fe6b Mon Sep 17 00:00:00 2001
From: yanirs <yanirs@users.noreply.github.com>
Date: Mon, 9 Sep 2024 00:56:47 +0000
Subject: [PATCH] deploy: dd7b0d6e75ef6b21582eaa8979f3c4640706fa07

---
 .../index.html                                |  2 +-
 .../index.html                                |  8 +-
 .../index.html                                |  2 +-
 .../index.html                                |  2 +-
 .../index.html                                |  2 +-
 causal-inference-resources/index.html         | 42 ++++++++++-
 deep-learning-resources/index.html            | 75 ++++++++++++++++++-
 index.html                                    |  2 +-
 index.xml                                     | 43 ++++++++++-
 9 files changed, 159 insertions(+), 19 deletions(-)
diff --git a/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/index.html b/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/index.html
index 5a8f21c21..bccebaadf 100644
--- a/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/index.html
+++ b/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/index.html
@@ -6,7 +6,7 @@
 https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiment-toy-dataset.png 858w," src=https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiment-toy-dataset_hu1768420409549174028.png alt="Gradient Boosting out of bag experiment (toy dataset)" loading=lazy></a></figure><p>My approach (TSO) beat both 5-fold cross-validation (CV) and the GBM/scikit-learn method (SKO), as TSO obtains its minimum at the closest number of iterations to the test set&rsquo;s (T) optimal value.</p><p>The next step in testing TSO&rsquo;s viability was to rerun <a href=http://cran.open-source-solution.org/web/packages/gbm/vignettes/gbm.pdf target=_blank rel=noopener>Ridgeway&rsquo;s experiments from Section 3.3 of the GBM documentation</a> (<a href=https://github.com/harrysouthworth/gbm/blob/master/demo/OOB-reps.R target=_blank rel=noopener>R code here</a>). I used the same 12 UCI datasets that Ridgeway used, running 5×2 cross-validation on each one. For each dataset, the score was obtained by dividing the mean loss of the best method on the dataset by the loss of each method. Hence, all scores are between 0.0 and 1.0, with the best score being 1.0. The following figure summarises the results on the 12 datasets.</p><figure><a href=gradient-boosting-out-of-bag-experiments-uci-datasets target=_blank rel=noopener><img sizes="(min-width: 768px) 591px,
 100vw" srcset="https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiments-uci-datasets_hu407379034768005208.png 360w,
 https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiments-uci-datasets_hu7972683063563127752.png 480w,
-https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiments-uci-datasets.png 591w," src=https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiments-uci-datasets.png alt="Gradient Boosting out of bag experiment (UCI datasets)" loading=lazy></a></figure><p>The following table shows the raw data that was used to produce the figure.</p><table><thead><tr><th>Dataset</th><th>CV</th><th>SKO</th><th>TSO</th></tr></thead><tbody><tr><td>creditrating</td><td>0.9962</td><td>0.9771</td><td>1</td></tr><tr><td>breastcancer</td><td>1</td><td>0.6675</td><td>0.4869</td></tr><tr><td>mushrooms</td><td>0.9588</td><td>0.9963</td><td>1</td></tr><tr><td>abalone</td><td>1</td><td>0.9754</td><td>0.9963</td></tr><tr><td>ionosphere</td><td>0.9919</td><td>1</td><td>0.8129</td></tr><tr><td>diabetes</td><td>1</td><td>0.9869</td><td>0.9985</td></tr><tr><td>autoprices</td><td>1</td><td>0.9565</td><td>0.5839</td></tr><tr><td>autompg</td><td>1</td><td>0.8753</td><td>0.9948</td></tr><tr><td>bostonhousing</td><td>1</td><td>0.8299</td><td>0.5412</td></tr><tr><td>haberman</td><td>1</td><td>0.9793</td><td>0.9266</td></tr><tr><td>cpuperformance</td><td>0.9934</td><td>0.9160</td><td>1</td></tr><tr><td>adult</td><td>1</td><td>0.9824</td><td>0.9991</td></tr></tbody></table><p>The main finding is that CV remains the most reliable approach. Even when CV is not the best-performing method, it&rsquo;s not much worse than the best method (this is in line with Ridgeway&rsquo;s findings). TSO yielded the best results on 3/12 of the datasets, and beat SKO 7/12 times. However, TSO&rsquo;s results are the most variant of the three methods: when it fails, it often yields very poor results.</p><p>In conclusion, stick to cross-validation for the best results. It&rsquo;s more computationally intensive than SKO and TSO, but can be parallelised. I still think that there may be a way to avoid cross-validation, perhaps by extending SKO/TSO in more intelligent ways (see some interesting ideas by Eugene Dubossarsky <a href=http://cavemoosum.blogspot.com.au/2014/02/cross-validation-is-over-long-live.html target=_blank rel=noopener>here</a> and <a href=http://cavemoosum.blogspot.com.au/2014/03/cross-validation-is-not-quite-kaput-but.html target=_blank rel=noopener>here</a>). Any comments/ideas are very welcome.</p></div><footer class=post-footer><ul class=post-tags><li><a href=https://yanirseroussi.com/tags/data-science/>Data Science</a></li><li><a href=https://yanirseroussi.com/tags/gradient-boosting/>Gradient Boosting</a></li><li><a href=https://yanirseroussi.com/tags/machine-learning/>Machine Learning</a></li><li><a href=https://yanirseroussi.com/tags/predictive-modelling/>Predictive Modelling</a></li><li><a href=https://yanirseroussi.com/tags/scikit-learn/>Scikit-Learn</a></li></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on x" href="https://x.com/intent/tweet/?text=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f&amp;hashtags=datascience%2cgradientboosting%2cmachinelearning%2cpredictivemodelling%2cscikit-learn"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f&amp;title=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;summary=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;source=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f&title=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on whatsapp" href="https://api.whatsapp.com/send?text=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations%20-%20https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on telegram" href="https://telegram.me/share/url?text=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on ycombinator" href="https://news.ycombinator.com/submitlink?t=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&u=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
+https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiments-uci-datasets.png 591w," src=https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/gradient-boosting-out-of-bag-experiments-uci-datasets.png alt="Gradient Boosting out of bag experiment (UCI datasets)" loading=lazy></a></figure><p>The following table shows the raw data that was used to produce the figure.</p><table><thead><tr><th style=text-align:left>Dataset</th><th style=text-align:left>CV</th><th style=text-align:left>SKO</th><th style=text-align:left>TSO</th></tr></thead><tbody><tr><td style=text-align:left>creditrating</td><td style=text-align:left>0.9962</td><td style=text-align:left>0.9771</td><td style=text-align:left>1</td></tr><tr><td style=text-align:left>breastcancer</td><td style=text-align:left>1</td><td style=text-align:left>0.6675</td><td style=text-align:left>0.4869</td></tr><tr><td style=text-align:left>mushrooms</td><td style=text-align:left>0.9588</td><td style=text-align:left>0.9963</td><td style=text-align:left>1</td></tr><tr><td style=text-align:left>abalone</td><td style=text-align:left>1</td><td style=text-align:left>0.9754</td><td style=text-align:left>0.9963</td></tr><tr><td style=text-align:left>ionosphere</td><td style=text-align:left>0.9919</td><td style=text-align:left>1</td><td style=text-align:left>0.8129</td></tr><tr><td style=text-align:left>diabetes</td><td style=text-align:left>1</td><td style=text-align:left>0.9869</td><td style=text-align:left>0.9985</td></tr><tr><td style=text-align:left>autoprices</td><td style=text-align:left>1</td><td style=text-align:left>0.9565</td><td style=text-align:left>0.5839</td></tr><tr><td style=text-align:left>autompg</td><td style=text-align:left>1</td><td style=text-align:left>0.8753</td><td style=text-align:left>0.9948</td></tr><tr><td style=text-align:left>bostonhousing</td><td style=text-align:left>1</td><td style=text-align:left>0.8299</td><td style=text-align:left>0.5412</td></tr><tr><td style=text-align:left>haberman</td><td style=text-align:left>1</td><td style=text-align:left>0.9793</td><td style=text-align:left>0.9266</td></tr><tr><td style=text-align:left>cpuperformance</td><td style=text-align:left>0.9934</td><td style=text-align:left>0.9160</td><td style=text-align:left>1</td></tr><tr><td style=text-align:left>adult</td><td style=text-align:left>1</td><td style=text-align:left>0.9824</td><td style=text-align:left>0.9991</td></tr></tbody></table><p>The main finding is that CV remains the most reliable approach. Even when CV is not the best-performing method, it&rsquo;s not much worse than the best method (this is in line with Ridgeway&rsquo;s findings). TSO yielded the best results on 3/12 of the datasets, and beat SKO 7/12 times. However, TSO&rsquo;s results are the most variant of the three methods: when it fails, it often yields very poor results.</p><p>In conclusion, stick to cross-validation for the best results. It&rsquo;s more computationally intensive than SKO and TSO, but can be parallelised. I still think that there may be a way to avoid cross-validation, perhaps by extending SKO/TSO in more intelligent ways (see some interesting ideas by Eugene Dubossarsky <a href=http://cavemoosum.blogspot.com.au/2014/02/cross-validation-is-over-long-live.html target=_blank rel=noopener>here</a> and <a href=http://cavemoosum.blogspot.com.au/2014/03/cross-validation-is-not-quite-kaput-but.html target=_blank rel=noopener>here</a>). Any comments/ideas are very welcome.</p></div><footer class=post-footer><ul class=post-tags><li><a href=https://yanirseroussi.com/tags/data-science/>Data Science</a></li><li><a href=https://yanirseroussi.com/tags/gradient-boosting/>Gradient Boosting</a></li><li><a href=https://yanirseroussi.com/tags/machine-learning/>Machine Learning</a></li><li><a href=https://yanirseroussi.com/tags/predictive-modelling/>Predictive Modelling</a></li><li><a href=https://yanirseroussi.com/tags/scikit-learn/>Scikit-Learn</a></li></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on x" href="https://x.com/intent/tweet/?text=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f&amp;hashtags=datascience%2cgradientboosting%2cmachinelearning%2cpredictivemodelling%2cscikit-learn"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f&amp;title=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;summary=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;source=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f&title=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on whatsapp" href="https://api.whatsapp.com/send?text=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations%20-%20https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on telegram" href="https://telegram.me/share/url?text=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&amp;url=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Stochastic Gradient Boosting: Choosing the Best Number of Iterations on ycombinator" href="https://news.ycombinator.com/submitlink?t=Stochastic%20Gradient%20Boosting%3a%20Choosing%20the%20Best%20Number%20of%20Iterations&u=https%3a%2f%2fyanirseroussi.com%2f2014%2f12%2f29%2fstochastic-gradient-boosting-choosing-the-best-number-of-iterations%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
 </a><script>const mailingListButton=document.getElementById("mailing-list-link");window.onscroll=function(){document.body.scrollTop>800||document.documentElement.scrollTop>800?(mailingListButton.style.visibility="visible",mailingListButton.style.opacity="1"):(mailingListButton.style.visibility="hidden",mailingListButton.style.opacity="0")}</script><div class=mailing-list-container><script src=https://f.convertkit.com/ckjs/ck.5.js></script><form class="mailing-list seva-form formkit-form" action=https://app.convertkit.com/forms/6549537/subscriptions method=post data-sv-form=6549537 data-uid=9157759fce data-format=inline data-version=5 data-options='{"settings":{"after_subscribe":{"action":"message","redirect_url":"","success_message":"Success! Now check your email to confirm your subscription."},"recaptcha":{"enabled":false},"return_visitor":{"action":"show","custom_content":""}},"version":"5"}'><div data-style=clean><ul class="formkit-alert formkit-alert-error" data-element=errors data-group=alert></ul><div data-element=fields data-stacked=false><label for=mailing-list-email>Get weekly posts in your mailbox</label>
 <input id=mailing-list-email name=email_address aria-label="Email address" placeholder="Email address" required type=email>
 <button data-element=submit>Subscribe</button></div></div></form><div class=footer>Join hundreds of subscribers. No spam or AI-generated slop. Unsubscribe any time.</div></div><section class=comment-section><p class="post-content contact-cta">Public comments are closed, but I love hearing from readers. Feel free to
diff --git a/2015/07/06/learning-about-deep-learning-through-album-cover-classification/index.html b/2015/07/06/learning-about-deep-learning-through-album-cover-classification/index.html
index 28d094ba5..0b5ec6bef 100644
--- a/2015/07/06/learning-about-deep-learning-through-album-cover-classification/index.html
+++ b/2015/07/06/learning-about-deep-learning-through-album-cover-classification/index.html
@@ -7,19 +7,19 @@
 </span></span></span><span style=display:flex><span><span style=color:#ae81ff></span>  --dataset-path /path/to/dataset <span style=color:#ae81ff>\
 </span></span></span><span style=display:flex><span><span style=color:#ae81ff></span>  --model-architecture AlexNet <span style=color:#ae81ff>\
 </span></span></span><span style=display:flex><span><span style=color:#ae81ff></span>  --model-params lc0_num_filters<span style=color:#f92672>=</span><span style=color:#ae81ff>64</span>
-</span></span></code></pre></div><p>There are many more command line flags (possibly too many), which make it easy to both tinker with various settings, and also run more rigorous experiments. My initial tinkering with convnets didn&rsquo;t yield impressive results in terms of predictive accuracy on my dataset. It turned out that this was partly due to the lack of preprocessing – the less exciting but crucial part of any predictive modelling work.</p><h3 id=the-importance-of-preprocessing>The importance of preprocessing<a hidden class=anchor aria-hidden=true href=#the-importance-of-preprocessing>#</a></h3><p>My initial focus was on getting things to work on the dataset without worrying too much about preprocessing. I haven&rsquo;t done any image classification work in the past, so I had to learn about the right type of preprocessing to use. I kept it pretty simple and applied the following transformations:</p><ul><li>Downsampling: all images were scaled down to 256×256. I played briefly with other sizes, but decided on this size to make it easy to use models pretrained on ImageNet.</li><li>Cropping & mirroring: during training time, each image was cropped to random 224×224 slices. Deterministic slices were used in test time. In addition, each crop was mirrored horizontally. In most cases I used ten overall crops. Again, these numbers were chosen for comparability with ImageNet-trained models.</li><li>Mean subtraction: the training mean of each pixel was subtracted from each instance.</li><li>Shuffling: probably the most important preprocessing step. Initially I had the instances sorted by their class, as an artifact of the way the dataset was constructed. Due to the relatively small number of instances the network sees in each batch, this meant that in each epoch, the network first fitted on all the instances from class 1, then all the instances from class 2, etc. This led to very poor performance, which was fixed by shuffling the data once at the start of the training procedure (shuffling every epoch could potentially make things even better).</li></ul><h3 id=baselines>Baselines<a hidden class=anchor aria-hidden=true href=#baselines>#</a></h3><p>After building the experimental environment and a fair bit of tinkering, I decided it was time for some more serious experiments. The results of my initial games were rather disappointing – slightly better than a random baseline, which yields an accuracy score of 10%. Therefore, I ran some baselines to get an idea of what&rsquo;s possible on this dataset.</p><p>The first baseline I tried was a random forest with 1,000 trees, which yielded 15.25% accuracy. This baseline was trained directly on the pixel values without any preprocessing other than downsampling. It&rsquo;s worth noting that the downsampling size didn&rsquo;t make much of a difference to this baseline (I tried a few values in the range 50×50-350×350). This baseline was also not particularly sensitive to whether RGB or grayscale values were used to represent the images.</p><p>The next experiments were with baselines that utilised pretrained <a href=http://caffe.berkeleyvision.org/ target=_blank rel=noopener>Caffe</a> models. Training a random forest with 1,000 trees on features extracted from the highest fully-connected layer (fc7) in the <a href=http://caffe.berkeleyvision.org/model_zoo.html target=_blank rel=noopener>CaffeNet</a> and <a href=https://gist.github.com/ksimonyan/3785162f95cd2d5fee77#file-readme-md target=_blank rel=noopener>VGGNet-19</a> models yielded accuracies of 16.72% and 16.40% respectively. This was pretty disappointing, as I expected these features to perform much better. The reason may be that album covers are very different from ImageNet images, and the representations in fc7 are too specific to ImageNet. Indeed, when fine-tuning the CaffeNet model (following the procedure outlined <a href=http://caffe.berkeleyvision.org/gathered/examples/finetune_flickr_style.html target=_blank rel=noopener>here</a>), I got the best accuracy on the dataset: 22.60%. Using Caffe to train the same network from scratch didn&rsquo;t even get close to this accuracy. However, I didn&rsquo;t try to tune Caffe&rsquo;s learning parameters. Instead, I went back to running experiments with my code.</p><p>It&rsquo;s worth noting that the classes identified by the CaffeNet model often have little to do with the actual content of the image. Better baseline results may be obtained by using models that were pretrained on a richer dataset than ImageNet. The following table presents three example covers together with the top-five classes identified by the CaffeNet model for each image. The tags assigned by <a href=http://clarifai.com target=_blank rel=noopener>Clarifai&rsquo;s API</a> are also presented for comparison. From this example, it looks like Clarifai&rsquo;s model is more successful at identifying the correct elements than the CaffeNet model, indicating that a baseline that uses the Clarifai tags may yield competitive performance.</p><table><thead><tr><th>Album</th><th>CaffeNet</th><th>Clarifai</th></tr></thead><tbody><tr><td><figure><a href=october-by-wille-p.jpg target=_blank rel=noopener><img sizes="(min-width: 768px) 700px,
+</span></span></code></pre></div><p>There are many more command line flags (possibly too many), which make it easy to both tinker with various settings, and also run more rigorous experiments. My initial tinkering with convnets didn&rsquo;t yield impressive results in terms of predictive accuracy on my dataset. It turned out that this was partly due to the lack of preprocessing – the less exciting but crucial part of any predictive modelling work.</p><h3 id=the-importance-of-preprocessing>The importance of preprocessing<a hidden class=anchor aria-hidden=true href=#the-importance-of-preprocessing>#</a></h3><p>My initial focus was on getting things to work on the dataset without worrying too much about preprocessing. I haven&rsquo;t done any image classification work in the past, so I had to learn about the right type of preprocessing to use. I kept it pretty simple and applied the following transformations:</p><ul><li>Downsampling: all images were scaled down to 256×256. I played briefly with other sizes, but decided on this size to make it easy to use models pretrained on ImageNet.</li><li>Cropping & mirroring: during training time, each image was cropped to random 224×224 slices. Deterministic slices were used in test time. In addition, each crop was mirrored horizontally. In most cases I used ten overall crops. Again, these numbers were chosen for comparability with ImageNet-trained models.</li><li>Mean subtraction: the training mean of each pixel was subtracted from each instance.</li><li>Shuffling: probably the most important preprocessing step. Initially I had the instances sorted by their class, as an artifact of the way the dataset was constructed. Due to the relatively small number of instances the network sees in each batch, this meant that in each epoch, the network first fitted on all the instances from class 1, then all the instances from class 2, etc. This led to very poor performance, which was fixed by shuffling the data once at the start of the training procedure (shuffling every epoch could potentially make things even better).</li></ul><h3 id=baselines>Baselines<a hidden class=anchor aria-hidden=true href=#baselines>#</a></h3><p>After building the experimental environment and a fair bit of tinkering, I decided it was time for some more serious experiments. The results of my initial games were rather disappointing – slightly better than a random baseline, which yields an accuracy score of 10%. Therefore, I ran some baselines to get an idea of what&rsquo;s possible on this dataset.</p><p>The first baseline I tried was a random forest with 1,000 trees, which yielded 15.25% accuracy. This baseline was trained directly on the pixel values without any preprocessing other than downsampling. It&rsquo;s worth noting that the downsampling size didn&rsquo;t make much of a difference to this baseline (I tried a few values in the range 50×50-350×350). This baseline was also not particularly sensitive to whether RGB or grayscale values were used to represent the images.</p><p>The next experiments were with baselines that utilised pretrained <a href=http://caffe.berkeleyvision.org/ target=_blank rel=noopener>Caffe</a> models. Training a random forest with 1,000 trees on features extracted from the highest fully-connected layer (fc7) in the <a href=http://caffe.berkeleyvision.org/model_zoo.html target=_blank rel=noopener>CaffeNet</a> and <a href=https://gist.github.com/ksimonyan/3785162f95cd2d5fee77#file-readme-md target=_blank rel=noopener>VGGNet-19</a> models yielded accuracies of 16.72% and 16.40% respectively. This was pretty disappointing, as I expected these features to perform much better. The reason may be that album covers are very different from ImageNet images, and the representations in fc7 are too specific to ImageNet. Indeed, when fine-tuning the CaffeNet model (following the procedure outlined <a href=http://caffe.berkeleyvision.org/gathered/examples/finetune_flickr_style.html target=_blank rel=noopener>here</a>), I got the best accuracy on the dataset: 22.60%. Using Caffe to train the same network from scratch didn&rsquo;t even get close to this accuracy. However, I didn&rsquo;t try to tune Caffe&rsquo;s learning parameters. Instead, I went back to running experiments with my code.</p><p>It&rsquo;s worth noting that the classes identified by the CaffeNet model often have little to do with the actual content of the image. Better baseline results may be obtained by using models that were pretrained on a richer dataset than ImageNet. The following table presents three example covers together with the top-five classes identified by the CaffeNet model for each image. The tags assigned by <a href=http://clarifai.com target=_blank rel=noopener>Clarifai&rsquo;s API</a> are also presented for comparison. From this example, it looks like Clarifai&rsquo;s model is more successful at identifying the correct elements than the CaffeNet model, indicating that a baseline that uses the Clarifai tags may yield competitive performance.</p><table><thead><tr><th style=text-align:left>Album</th><th style=text-align:left>CaffeNet</th><th style=text-align:left>Clarifai</th></tr></thead><tbody><tr><td style=text-align:left><figure><a href=october-by-wille-p.jpg target=_blank rel=noopener><img sizes="(min-width: 768px) 700px,
 100vw" srcset="https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/october-by-wille-p_hu12073017295419717844.jpg 360w,
 https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/october-by-wille-p_hu14656508719904332258.jpg 480w,
 https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/october-by-wille-p.jpg 700w," src=https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/october-by-wille-p.jpg alt="October by Wille P
-hiphop_rap" loading=lazy></a><figcaption><p><a href=https://emigrant.bandcamp.com/album/october target=_blank rel=noopener>October by Wille P</a><br><strong>hiphop_rap</strong></p></figcaption></figure></td><td>digital clock, spotlight, jack-o&rsquo;-lantern, volcano, traffic light</td><td>tree, landscape, sunset, desert, sun, sunrise, nature, evening, sky, travel</td></tr><tr><td><figure><a href=demo-by-blackrat.jpg target=_blank rel=noopener><img sizes="(min-width: 768px) 700px,
+hiphop_rap" loading=lazy></a><figcaption><p><a href=https://emigrant.bandcamp.com/album/october target=_blank rel=noopener>October by Wille P</a><br><strong>hiphop_rap</strong></p></figcaption></figure></td><td style=text-align:left>digital clock, spotlight, jack-o&rsquo;-lantern, volcano, traffic light</td><td style=text-align:left>tree, landscape, sunset, desert, sun, sunrise, nature, evening, sky, travel</td></tr><tr><td style=text-align:left><figure><a href=demo-by-blackrat.jpg target=_blank rel=noopener><img sizes="(min-width: 768px) 700px,
 100vw" srcset="https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/demo-by-blackrat_hu10628863010204709545.jpg 360w,
 https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/demo-by-blackrat_hu9601479284023659210.jpg 480w,
 https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/demo-by-blackrat.jpg 700w," src=https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/demo-by-blackrat.jpg alt="Demo by Blackrat
-metal" loading=lazy></a><figcaption><p><a href=https://blackrat.bandcamp.com/album/demo target=_blank rel=noopener>Demo by Blackrat</a><br><strong>metal</strong></p></figcaption></figure></td><td>spider web, barn spider, chain, bubble, fountain</td><td>skull, bone, nobody, death, vector, help, horror, medicine, black and white, tattoo</td></tr><tr><td><figure><a href=the-kool-aid-album-by-mr-merge.jpg target=_blank rel=noopener><img sizes="(min-width: 768px) 700px,
+metal" loading=lazy></a><figcaption><p><a href=https://blackrat.bandcamp.com/album/demo target=_blank rel=noopener>Demo by Blackrat</a><br><strong>metal</strong></p></figcaption></figure></td><td style=text-align:left>spider web, barn spider, chain, bubble, fountain</td><td style=text-align:left>skull, bone, nobody, death, vector, help, horror, medicine, black and white, tattoo</td></tr><tr><td style=text-align:left><figure><a href=the-kool-aid-album-by-mr-merge.jpg target=_blank rel=noopener><img sizes="(min-width: 768px) 700px,
 100vw" srcset="https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/the-kool-aid-album-by-mr-merge_hu9128645313018378691.jpg 360w,
 https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/the-kool-aid-album-by-mr-merge_hu7938070308744482589.jpg 480w,
 https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/the-kool-aid-album-by-mr-merge.jpg 700w," src=https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/the-kool-aid-album-by-mr-merge.jpg alt="The Kool-Aid Album by Mr. Merge
-soul" loading=lazy></a><figcaption><p><a href=https://redesignyourmindmuzik.bandcamp.com/album/the-kool-aid-album target=_blank rel=noopener>The Kool-Aid Album by Mr. Merge</a><br><strong>soul</strong></p></figcaption></figure></td><td>dishrag, paper towel, honeycomb, envelope, chain mail</td><td>symbol, nobody, sign, illustration, color, flag, text, stripes, business, character</td></tr></tbody></table><h3 id=training-from-scratch>Training from scratch<a hidden class=anchor aria-hidden=true href=#training-from-scratch>#</a></h3><p>My initial experiments were with various convnet architectures, where I manually varied the filter sizes and number of layers to have a reasonable number of parameters and ensure that the model is trainable on a GPU with 4GB of memory. As mentioned, this approach yielded unimpressive results. Following the relative success of the fine-tuned CaffeNet baseline, I decided to run more rigorous experiments on variants of AlexNet (which is very similar to CaffeNet).</p><p>Given the large number of hyperparameters that need to be set when training deep convnets, I realised that setting values manually or via grid search is unlikely to yield the best results. To address this, I used <a href=https://github.com/hyperopt/hyperopt target=_blank rel=noopener>hyperopt</a> to search for the best configuration of values. The hyperparameters that were included in the search were the learning method (Nesterov momentum versus Adam with their respective parameters), the learning rate, whether crops are mirrored or not, the number of crops to use (1 or 5), dropout probabilities, the number of hidden units in the fully-connected layers, and the number of filters in each convolutional layer.</p><p>Each configuration suggested by hyperopt was trained for 10 epochs, and the promising setups were trained until results stopped improving. The results of the search were rather disappointing, with the best accuracy being 17.19%. However, I learned a lot by finding hyperparameters in this manner – in the past I&rsquo;ve only used a combination of manual settings with grid search.</p><p>There are many possible reasons for why the results are so poor. It could be that there&rsquo;s just too little data to train a good classifier, which is supported by the inability to beat the fine-tuned results. This is in line with the results obtained by <a href=http://arxiv.org/pdf/1311.2901v3.pdf target=_blank rel=noopener>Zeiler and Fergus (2013)</a>, who found that convnets pretrained on ImageNet performed much better on the Caltech-101 and Caltech-256 datasets than the same networks trained from scratch. However, it could also be that I just didn&rsquo;t run enough experiments – I definitely feel like I haven&rsquo;t explored everything as well as I&rsquo;d like. In addition, I&rsquo;m still building my intuition for what works and why. I should work more on visualising the way the network learns to uncover more hidden gotchas in addition to those I&rsquo;ve already found. Finally, it could be that it&rsquo;s just too hard to distinguish between covers from the genres I chose for the study.</p><h3 id=ideas-for-future-work>Ideas for future work<a hidden class=anchor aria-hidden=true href=#ideas-for-future-work>#</a></h3><p>There are many avenues for improving on the work I&rsquo;ve done so far. The code could definitely be made more robust and better tested, optimised and parallelised. It would be worth investing more in hyperparameter and architecture search, including incorporation of ideas from non-vanilla convnets (e.g., <a href=http://arxiv.org/pdf/1409.4842.pdf target=_blank rel=noopener>GoogLeNet</a>). This search should be guided by visualisation and a deeper understanding of the trained networks, which may also come from analysing class-level accuracy (certain genres seem to be easier to distinguish than others). In addition, more sophisticated preprocessing may yield improved results.</p><p>If the goal were to get the best possible performance on my dataset, I&rsquo;d invest in establishing the human performance baseline on the dataset by running some tests with Mechanical Turk. My guess is that humans would perform better than the algorithms tested so far due to access to external knowledge. Therefore, incorporating external knowledge in the form of manual features or additional data sources may yield the most substantial performance boosts. For example, text on an album cover may contain important clues about its genre, and models pretrained on style datasets may be more suitable than ImageNet models. In addition, it may be beneficial to use a model to detect multiple elements in images where the universe is not restricted to ImageNet classes. This approach was taken by <a href=http://apassant.net/2015/05/14/album-covers-music-deep-learning/ target=_blank rel=noopener>Alexandre Passant, who used Clarifai&rsquo;s API to tag and classify doom metal and K-pop album covers</a>. Finally, using several different models in an ensemble is likely to help squeeze a bit more accuracy out of the dataset.</p><p>Another direction that may be worth exploring is using image data for recommendation work. The reason I chose to work on this problem was my exposure to album covers through my work on <a href=http://www.bcrecommender.com target=_blank rel=noopener>Bandcamp Recommender – a music recommendation system</a>. It is well-known that visual elements influence the way users interact with recommender systems. This is especially true in Bandcamp Recommender&rsquo;s case, as users see the album covers before they choose to play them. This leads me to conjecture that considering features that describe the album covers when generating recommendations would increase user interaction with the system. However, it&rsquo;s hard to tell whether it&rsquo;d increase the overall relevance of the results. You can&rsquo;t judge an album by its cover. Or can you&mldr;?</p><h3 id=conclusion>Conclusion<a hidden class=anchor aria-hidden=true href=#conclusion>#</a></h3><p>While I&rsquo;ve learned a lot from working on this project, there&rsquo;s still much more to discover. It was especially great to learn some generally-applicable lessons about hyperparameter optimisation and improvements to vanilla gradient descent. Despite the many potential ways of improving performance on my dataset, my next steps in the field would probably include working on problems for which obtaining a good solution is feasible and useful. For example, I have some ideas for applications to marine creature identification.</p><p>Feedback and suggestions are always welcome. Please feel free to <a href=https://yanirseroussi.com/about/>contact me privately</a> or via the comments section.</p><p><small><strong>Acknowledgement:</strong> Thanks to Brian Basham and Diogo Moitinho de Almeida for useful tips and discussions.</small></p></div><footer class=post-footer><ul class=post-tags><li><a href=https://yanirseroussi.com/tags/data-science/>Data Science</a></li><li><a href=https://yanirseroussi.com/tags/deep-learning/>Deep Learning</a></li><li><a href=https://yanirseroussi.com/tags/machine-learning/>Machine Learning</a></li><li><a href=https://yanirseroussi.com/tags/predictive-modelling/>Predictive Modelling</a></li></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on x" href="https://x.com/intent/tweet/?text=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f&amp;hashtags=datascience%2cdeeplearning%2cmachinelearning%2cpredictivemodelling"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f&amp;title=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;summary=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;source=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f&title=Learning%20about%20deep%20learning%20through%20album%20cover%20classification"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on whatsapp" href="https://api.whatsapp.com/send?text=Learning%20about%20deep%20learning%20through%20album%20cover%20classification%20-%20https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on telegram" href="https://telegram.me/share/url?text=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on ycombinator" href="https://news.ycombinator.com/submitlink?t=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&u=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
+soul" loading=lazy></a><figcaption><p><a href=https://redesignyourmindmuzik.bandcamp.com/album/the-kool-aid-album target=_blank rel=noopener>The Kool-Aid Album by Mr. Merge</a><br><strong>soul</strong></p></figcaption></figure></td><td style=text-align:left>dishrag, paper towel, honeycomb, envelope, chain mail</td><td style=text-align:left>symbol, nobody, sign, illustration, color, flag, text, stripes, business, character</td></tr></tbody></table><h3 id=training-from-scratch>Training from scratch<a hidden class=anchor aria-hidden=true href=#training-from-scratch>#</a></h3><p>My initial experiments were with various convnet architectures, where I manually varied the filter sizes and number of layers to have a reasonable number of parameters and ensure that the model is trainable on a GPU with 4GB of memory. As mentioned, this approach yielded unimpressive results. Following the relative success of the fine-tuned CaffeNet baseline, I decided to run more rigorous experiments on variants of AlexNet (which is very similar to CaffeNet).</p><p>Given the large number of hyperparameters that need to be set when training deep convnets, I realised that setting values manually or via grid search is unlikely to yield the best results. To address this, I used <a href=https://github.com/hyperopt/hyperopt target=_blank rel=noopener>hyperopt</a> to search for the best configuration of values. The hyperparameters that were included in the search were the learning method (Nesterov momentum versus Adam with their respective parameters), the learning rate, whether crops are mirrored or not, the number of crops to use (1 or 5), dropout probabilities, the number of hidden units in the fully-connected layers, and the number of filters in each convolutional layer.</p><p>Each configuration suggested by hyperopt was trained for 10 epochs, and the promising setups were trained until results stopped improving. The results of the search were rather disappointing, with the best accuracy being 17.19%. However, I learned a lot by finding hyperparameters in this manner – in the past I&rsquo;ve only used a combination of manual settings with grid search.</p><p>There are many possible reasons for why the results are so poor. It could be that there&rsquo;s just too little data to train a good classifier, which is supported by the inability to beat the fine-tuned results. This is in line with the results obtained by <a href=http://arxiv.org/pdf/1311.2901v3.pdf target=_blank rel=noopener>Zeiler and Fergus (2013)</a>, who found that convnets pretrained on ImageNet performed much better on the Caltech-101 and Caltech-256 datasets than the same networks trained from scratch. However, it could also be that I just didn&rsquo;t run enough experiments – I definitely feel like I haven&rsquo;t explored everything as well as I&rsquo;d like. In addition, I&rsquo;m still building my intuition for what works and why. I should work more on visualising the way the network learns to uncover more hidden gotchas in addition to those I&rsquo;ve already found. Finally, it could be that it&rsquo;s just too hard to distinguish between covers from the genres I chose for the study.</p><h3 id=ideas-for-future-work>Ideas for future work<a hidden class=anchor aria-hidden=true href=#ideas-for-future-work>#</a></h3><p>There are many avenues for improving on the work I&rsquo;ve done so far. The code could definitely be made more robust and better tested, optimised and parallelised. It would be worth investing more in hyperparameter and architecture search, including incorporation of ideas from non-vanilla convnets (e.g., <a href=http://arxiv.org/pdf/1409.4842.pdf target=_blank rel=noopener>GoogLeNet</a>). This search should be guided by visualisation and a deeper understanding of the trained networks, which may also come from analysing class-level accuracy (certain genres seem to be easier to distinguish than others). In addition, more sophisticated preprocessing may yield improved results.</p><p>If the goal were to get the best possible performance on my dataset, I&rsquo;d invest in establishing the human performance baseline on the dataset by running some tests with Mechanical Turk. My guess is that humans would perform better than the algorithms tested so far due to access to external knowledge. Therefore, incorporating external knowledge in the form of manual features or additional data sources may yield the most substantial performance boosts. For example, text on an album cover may contain important clues about its genre, and models pretrained on style datasets may be more suitable than ImageNet models. In addition, it may be beneficial to use a model to detect multiple elements in images where the universe is not restricted to ImageNet classes. This approach was taken by <a href=http://apassant.net/2015/05/14/album-covers-music-deep-learning/ target=_blank rel=noopener>Alexandre Passant, who used Clarifai&rsquo;s API to tag and classify doom metal and K-pop album covers</a>. Finally, using several different models in an ensemble is likely to help squeeze a bit more accuracy out of the dataset.</p><p>Another direction that may be worth exploring is using image data for recommendation work. The reason I chose to work on this problem was my exposure to album covers through my work on <a href=http://www.bcrecommender.com target=_blank rel=noopener>Bandcamp Recommender – a music recommendation system</a>. It is well-known that visual elements influence the way users interact with recommender systems. This is especially true in Bandcamp Recommender&rsquo;s case, as users see the album covers before they choose to play them. This leads me to conjecture that considering features that describe the album covers when generating recommendations would increase user interaction with the system. However, it&rsquo;s hard to tell whether it&rsquo;d increase the overall relevance of the results. You can&rsquo;t judge an album by its cover. Or can you&mldr;?</p><h3 id=conclusion>Conclusion<a hidden class=anchor aria-hidden=true href=#conclusion>#</a></h3><p>While I&rsquo;ve learned a lot from working on this project, there&rsquo;s still much more to discover. It was especially great to learn some generally-applicable lessons about hyperparameter optimisation and improvements to vanilla gradient descent. Despite the many potential ways of improving performance on my dataset, my next steps in the field would probably include working on problems for which obtaining a good solution is feasible and useful. For example, I have some ideas for applications to marine creature identification.</p><p>Feedback and suggestions are always welcome. Please feel free to <a href=https://yanirseroussi.com/about/>contact me privately</a> or via the comments section.</p><p><small><strong>Acknowledgement:</strong> Thanks to Brian Basham and Diogo Moitinho de Almeida for useful tips and discussions.</small></p></div><footer class=post-footer><ul class=post-tags><li><a href=https://yanirseroussi.com/tags/data-science/>Data Science</a></li><li><a href=https://yanirseroussi.com/tags/deep-learning/>Deep Learning</a></li><li><a href=https://yanirseroussi.com/tags/machine-learning/>Machine Learning</a></li><li><a href=https://yanirseroussi.com/tags/predictive-modelling/>Predictive Modelling</a></li></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on x" href="https://x.com/intent/tweet/?text=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f&amp;hashtags=datascience%2cdeeplearning%2cmachinelearning%2cpredictivemodelling"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f&amp;title=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;summary=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;source=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f&title=Learning%20about%20deep%20learning%20through%20album%20cover%20classification"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on whatsapp" href="https://api.whatsapp.com/send?text=Learning%20about%20deep%20learning%20through%20album%20cover%20classification%20-%20https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on telegram" href="https://telegram.me/share/url?text=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&amp;url=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Learning about deep learning through album cover classification on ycombinator" href="https://news.ycombinator.com/submitlink?t=Learning%20about%20deep%20learning%20through%20album%20cover%20classification&u=https%3a%2f%2fyanirseroussi.com%2f2015%2f07%2f06%2flearning-about-deep-learning-through-album-cover-classification%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
 </a><script>const mailingListButton=document.getElementById("mailing-list-link");window.onscroll=function(){document.body.scrollTop>800||document.documentElement.scrollTop>800?(mailingListButton.style.visibility="visible",mailingListButton.style.opacity="1"):(mailingListButton.style.visibility="hidden",mailingListButton.style.opacity="0")}</script><div class=mailing-list-container><script src=https://f.convertkit.com/ckjs/ck.5.js></script><form class="mailing-list seva-form formkit-form" action=https://app.convertkit.com/forms/6549537/subscriptions method=post data-sv-form=6549537 data-uid=9157759fce data-format=inline data-version=5 data-options='{"settings":{"after_subscribe":{"action":"message","redirect_url":"","success_message":"Success! Now check your email to confirm your subscription."},"recaptcha":{"enabled":false},"return_visitor":{"action":"show","custom_content":""}},"version":"5"}'><div data-style=clean><ul class="formkit-alert formkit-alert-error" data-element=errors data-group=alert></ul><div data-element=fields data-stacked=false><label for=mailing-list-email>Get weekly posts in your mailbox</label>
 <input id=mailing-list-email name=email_address aria-label="Email address" placeholder="Email address" required type=email>
 <button data-element=submit>Subscribe</button></div></div></form><div class=footer>Join hundreds of subscribers. No spam or AI-generated slop. Unsubscribe any time.</div></div><section class=comment-section><p class="post-content contact-cta">Public comments are closed, but I love hearing from readers. Feel free to
diff --git a/2015/10/02/the-wonderful-world-of-recommender-systems/index.html b/2015/10/02/the-wonderful-world-of-recommender-systems/index.html
index d7aaad934..2c388b8c1 100644
--- a/2015/10/02/the-wonderful-world-of-recommender-systems/index.html
+++ b/2015/10/02/the-wonderful-world-of-recommender-systems/index.html
@@ -7,7 +7,7 @@
 100vw" srcset="https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot_hu17008018622414803394.png 360w,
 https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot_hu16912368895272562788.png 480w,
 https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot_hu9102468640053471026.png 720w,
-https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot.png 750w," src=https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot.png alt="Hynt recommendation widget" loading=lazy></a></figure><p><a href=https://hynt.com target=_blank rel=noopener>Hynt</a> is a recommender-system-as-a-service for e-commerce whose development I led up until the middle of last year. The general idea is that merchants simply add a few lines of JavaScript to their shop pages and Hynt does the hard work of recommending relevant items from the store, while considering the user and page context. Going live with Hynt reaffirmed many well-known UI/UX lessons. Most notably:</p><ul><li><em>Above the fold is better than below.</em> Engagement with Hynt widgets that were visible without scrolling was higher than those that were lower on the page.</li><li><em>More recommendations are better than a few.</em> Hynt widgets are responsive, adapting to the size of the container they&rsquo;re placed in. Engagement was more likely when more recommendations were displayed, because users were more likely to find something they liked without scrolling through the widget.</li><li><em>Fast is better than slow.</em> If recommendations load faster, more people see them, which increases engagement. In Hynt&rsquo;s case speed was especially important because the widgets load asynchronously after the host page finishes loading.</li></ul><p>Another important UI/UX element is explanations. Displaying a plausible explanation next to a recommendation can do wonders, without making any changes to the underlying recommendation algorithms. The impact of explanations has been studied extensively by Nava Tintarev and Judith Masthoff. They have identified seven different aims of explanations, which are summarised in the following table (reproduced from their <a href=http://homepages.abdn.ac.uk/n.tintarev/pages/papers/TintarevMasthoffICDE07.pdf target=_blank rel=noopener>survey of explanations in recommender systems</a>).</p><table><thead><tr><th>Aim</th><th>Definition</th></tr></thead><tbody><tr><td>Transparency</td><td>Explain how the system works</td></tr><tr><td>Scrutability</td><td>Allow users to tell the system it is wrong</td></tr><tr><td>Trust</td><td>Increase user confidence in the system</td></tr><tr><td>Effectiveness</td><td>Help users make good decisions</td></tr><tr><td>Persuasiveness</td><td>Convince users to try or buy</td></tr><tr><td>Efficiency</td><td>Help users make decisions faster</td></tr><tr><td>Satisfaction</td><td>Increase ease of usability or enjoyment</td></tr></tbody></table><p>Explanations are ubiquitous in real-world recommender systems. For example, Amazon uses explanations like &ldquo;frequently bought together&rdquo;, and &ldquo;customers who bought this item also bought&rdquo;, while Netflix presents different lists of recommendations where each list is driven by a different reason. However, as the following Netflix example shows, it is worth making sure that the explanations you provide don&rsquo;t <a href=http://funnyjunk.com/Thanks+netflix/funny-pictures/5040772/ target=_blank rel=noopener>make you look stupid</a>.</p><figure><a href=amazon-frequently-bought-together.png target=_blank rel=noopener><img sizes="(min-width: 768px) 633px,
+https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot.png 750w," src=https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/hynt-screenshot.png alt="Hynt recommendation widget" loading=lazy></a></figure><p><a href=https://hynt.com target=_blank rel=noopener>Hynt</a> is a recommender-system-as-a-service for e-commerce whose development I led up until the middle of last year. The general idea is that merchants simply add a few lines of JavaScript to their shop pages and Hynt does the hard work of recommending relevant items from the store, while considering the user and page context. Going live with Hynt reaffirmed many well-known UI/UX lessons. Most notably:</p><ul><li><em>Above the fold is better than below.</em> Engagement with Hynt widgets that were visible without scrolling was higher than those that were lower on the page.</li><li><em>More recommendations are better than a few.</em> Hynt widgets are responsive, adapting to the size of the container they&rsquo;re placed in. Engagement was more likely when more recommendations were displayed, because users were more likely to find something they liked without scrolling through the widget.</li><li><em>Fast is better than slow.</em> If recommendations load faster, more people see them, which increases engagement. In Hynt&rsquo;s case speed was especially important because the widgets load asynchronously after the host page finishes loading.</li></ul><p>Another important UI/UX element is explanations. Displaying a plausible explanation next to a recommendation can do wonders, without making any changes to the underlying recommendation algorithms. The impact of explanations has been studied extensively by Nava Tintarev and Judith Masthoff. They have identified seven different aims of explanations, which are summarised in the following table (reproduced from their <a href=http://homepages.abdn.ac.uk/n.tintarev/pages/papers/TintarevMasthoffICDE07.pdf target=_blank rel=noopener>survey of explanations in recommender systems</a>).</p><table><thead><tr><th style=text-align:left>Aim</th><th style=text-align:left>Definition</th></tr></thead><tbody><tr><td style=text-align:left>Transparency</td><td style=text-align:left>Explain how the system works</td></tr><tr><td style=text-align:left>Scrutability</td><td style=text-align:left>Allow users to tell the system it is wrong</td></tr><tr><td style=text-align:left>Trust</td><td style=text-align:left>Increase user confidence in the system</td></tr><tr><td style=text-align:left>Effectiveness</td><td style=text-align:left>Help users make good decisions</td></tr><tr><td style=text-align:left>Persuasiveness</td><td style=text-align:left>Convince users to try or buy</td></tr><tr><td style=text-align:left>Efficiency</td><td style=text-align:left>Help users make decisions faster</td></tr><tr><td style=text-align:left>Satisfaction</td><td style=text-align:left>Increase ease of usability or enjoyment</td></tr></tbody></table><p>Explanations are ubiquitous in real-world recommender systems. For example, Amazon uses explanations like &ldquo;frequently bought together&rdquo;, and &ldquo;customers who bought this item also bought&rdquo;, while Netflix presents different lists of recommendations where each list is driven by a different reason. However, as the following Netflix example shows, it is worth making sure that the explanations you provide don&rsquo;t <a href=http://funnyjunk.com/Thanks+netflix/funny-pictures/5040772/ target=_blank rel=noopener>make you look stupid</a>.</p><figure><a href=amazon-frequently-bought-together.png target=_blank rel=noopener><img sizes="(min-width: 768px) 633px,
 100vw" srcset="https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/amazon-frequently-bought-together_hu15589678651873710813.png 360w,
 https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/amazon-frequently-bought-together_hu18021758773494046297.png 480w,
 https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/amazon-frequently-bought-together.png 633w," src=https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/amazon-frequently-bought-together.png alt="Amazon frequently bought together" loading=lazy></a></figure><figure><a href=netflix-because-you-watched.png target=_blank rel=noopener><img sizes="(min-width: 768px) 720px,
diff --git a/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/index.html b/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/index.html
index af374b2e2..ae915c3ed 100644
--- a/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/index.html
+++ b/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/index.html
@@ -1,5 +1,5 @@
 <!doctype html><html lang=en dir=auto><head><meta charset=utf-8><meta http-equiv=X-UA-Compatible content="IE=edge"><meta name=viewport content="width=device-width,initial-scale=1,shrink-to-fit=no"><meta name=robots content="index, follow"><title>Hackers beware: Bootstrap sampling may be harmful | Yanir Seroussi | Data & AI for Startup Impact</title>
-<meta name=keywords content="bootstrapping,data science,hackers,software engineering,statistics"><meta name=description content="Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&rsquo;t that simple."><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Hackers beware: Bootstrap sampling may be harmful"><meta property="og:description" content="Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&rsquo;t that simple."><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/"><meta property="og:image" content="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg"><meta property="article:section" content="posts"><meta property="article:published_time" content="2019-01-07T21:07:56+00:00"><meta property="article:modified_time" content="2024-01-16T09:56:03+10:00"><meta name=twitter:card content="summary_large_image"><meta name=twitter:image content="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg"><meta name=twitter:title content="Hackers beware: Bootstrap sampling may be harmful"><meta name=twitter:description content="Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&rsquo;t that simple."><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Browse Posts","item":"https://yanirseroussi.com/posts/"},{"@type":"ListItem","position":2,"name":"Hackers beware: Bootstrap sampling may be harmful","item":"https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Hackers beware: Bootstrap sampling may be harmful","name":"Hackers beware: Bootstrap sampling may be harmful","description":"Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren\u0026rsquo;t that simple.","keywords":["bootstrapping","data science","hackers","software engineering","statistics"],"articleBody":"Bootstrap sampling techniques are very appealing, as they don’t require knowing much about statistics and opaque formulas. Instead, all one needs to do is resample the given data many times, and calculate the desired statistics. Therefore, bootstrapping has been promoted as an easy way of modelling uncertainty to hackers who don’t have much statistical knowledge. For example, the main thesis of the excellent Statistics for Hackers talk by Jake VanderPlas is: “If you can write a for-loop, you can do statistics”. Similar ground was covered by Erik Bernhardsson in The Hacker’s Guide to Uncertainty Estimates, which provides more use cases for bootstrapping (with code examples). However, I’ve learned in the past few weeks that there are quite a few pitfalls in bootstrapping. Much of what I’ve learned is summarised in a paper titled What Teachers Should Know about the Bootstrap: Resampling in the Undergraduate Statistics Curriculum by Tim Hesterberg. I doubt that many hackers would be motivated to read a paper with such a title, so my goal with this post is to make some of my discoveries more accessible to a wider audience. To learn more about the issues raised in this post, it’s worth reading Hesterberg’s paper and other linked resources.\nFor quick reference, here’s a summary of the advice in this post:\nUse an accurate method for estimating confidence intervals Use enough resamples – at least 10-15K Don’t compare confidence intervals visually Ensure that the basic assumptions apply to your situation Pitfall #1: Inaccurate confidence intervals Confidence intervals are a common way of quantifying the uncertainty in an estimate of a population parameter. The percentile method is one of the simplest bootstrapping approaches for generating confidence intervals. For example, let’s say we have a data sample of size n and we want to estimate a 95% confidence interval for the population mean. We take r bootstrap resamples from the original data sample, where each resample is a sample with replacement of size n. We calculate the mean of each resample and store the means in a sorted array. We then return the 95% confidence interval as the values that fall at the 0.025r and 0.975r indices of the sorted array (i.e., the 2.5% and 97.5% percentiles). The following table shows what the first two resamples may look like for a data sample of size n=5.\nOriginal sample Resample #1 Resample #2 … Values 10 30 20 … 12 20 20 20 12 30 30 12 30 45 45 30 Mean 23.4 23.8 26 … The percentile method is nice and simple. Any programmer should be able to easily implement it in their favourite programming language, assuming they can actually program. Unfortunately, this method is just not accurate enough for small sample sizes. Quoting Hesterberg (emphasis mine):\nThe sample sizes needed for different intervals to satisfy the “reasonably accurate” (off by no more than 10% on each side) criterion are: n ≥ 101 for the bootstrap t, 220 for the skewness-adjusted t statistic, 2,235 for expanded percentile, 2,383 for percentile, 4,815 for ordinary t (which I have rounded up to 5,000 above), 5,063 for t with bootstrap standard errors and something over 8,000 for the reverse percentile method.\nIn a shorter version of the paper cited above, Hesterberg concludes that:\nIn practice, implementing some of the more accurate bootstrap methods is difficult (especially those not described here), and people should use a package rather than attempt this themselves.\nIn short, make sure you’re using an accurate method for estimating confidence intervals when dealing with sample sizes of less than a few thousand values. Using a package is a great idea, but unfortunately I don’t know of any Python bootstrapping package that is feature-complete: ARCH and scikits-bootstrap support advanced confidence interval methods but don’t support analysis of two samples of uneven sizes, while bootstrapped works with samples of uneven sizes but only supports the percentile and the reverse percentile method (which Hesterberg found to be even less accurate). If you know of any better Python packages, please let me know! (I don’t use R, but I suspect the situation is better there). Update: ARCH now supports analysis of samples of uneven sizes following an issue I reported. It seems to be the best Python bootstrapping package, so I recommend using it.\nPitfall #2: Not enough resamples Accurate bootstrap estimates require a large number of resamples. Many code snippets use 1,000 resamples, probably because it looks like a large number. However, seeming large isn’t enough. Quoting Hesterberg again:\nFor both the bootstrap and permutation tests, the number of resamples needs to be 15,000 or more, for 95% probability that simulation-based one-sided levels fall within 10% of the true values, for 95% intervals and 5% tests. I recommend r = 10,000 for routine use, and more when accuracy matters.\n[…]\nWe want decisions to depend on the data, not random variation in the Monte Carlo implementation. We used r = 500,000 in the Verizon project.\nThat’s right, half a million resamples! Accuracy mattered in the Verizon case, as the results of the analysis determined whether large penalties were paid or not. In short, use at least 10-15,000 resamples to be safe. Don’t use 1,000.\nPitfall #3: Comparison of single-sample confidence intervals Confidence intervals are commonly used to decide if the difference between two samples is statistically significant. Bootstrapping provides a straightforward way of estimating confidence intervals without making assumptions about the way the data was generated. For example, given two samples, we can obtain confidence intervals for the mean of each sample and end up with a plot like this:\nWhen looking at this plot, some people may conclude that the difference between the groups isn’t statistically significant because the confidence intervals overlap. However, overlapping confidence intervals don’t imply a lack of statistical significance because it is possible for the confidence interval of the difference between the sample means to not contain zero. Prasanna Parasurama explained why this happens in this post. While this issue isn’t unique to bootstrapping, it’s worth remembering that when comparing two groups, we need to obtain the confidence interval for the difference in the parameter we’re comparing, not compare single-sample confidence intervals.\nFor a concrete example, consider a case where we’re looking at a binary outcomes (yes/no or 1/0), which occur in coin flips or online A/B tests. Sample A consists of 2,150 zeroes and 350 ones, while sample B consists of 2,250 zeroes and 440 ones. As these are fairly large samples, we can use the bootstrap percentile method to obtain 95% confidence intervals for the mean of each sample. As the following figure shows, these intervals overlap. If we use the same method to also obtain a 95% confidence interval for the difference in means between B and A, we see that it doesn’t include zero. Therefore, we can say that the difference between B and A is statistically significant, despite the overlap between the single-sample confidence intervals.\nIt’s worth noting that when analysing binary outcomes, we can make stronger assumptions about the data rather than use bootstrapping to obtain confidence intervals. Erik Bernhardsson suggests using the Beta distribution to obtain single-sample confidence intervals, but as we’ve seen, they don’t tell us enough about the differences between samples. I suggested using a Bayesian approach in the past, which makes explicit modelling assumptions that allow us to encode our prior knowledge on the specific environment where the data was generated. For example, when running online A/B tests, we often have a ballpark figure for reasonable results, which can be used in the Bayesian A/B testing calculator I built.\nPitfall #4: Unrepresentative and dependent samples While the basic bootstrap makes no assumption about the underlying distribution of the data, it is not assumption-free. For example, when dealing with correlated data points from a time series, using the basic bootstrapping approach is wrong because it assumes that the data points are independent. Instead, a block bootstrap should be used – see the ARCH package for some implementation examples. In addition, bootstrapping doesn’t solve problems with the underlying sampling approach. For example, the data sample may not be representative of the population because of its small size, or there may be selection biases and measurement errors. No amount of bootstrapping is going to help with such issues. In general, it always helps to be aware of the data’s generation process, e.g., different considerations apply when dealing with data from online experiments versus observational studies.\nConclusion and next steps While bootstrapping is a powerful method, its initial impression of simplicity is misleading. To draw valid conclusions, it’s a good idea to use a package and be aware of considerations that are specific to the analysed data sample. However, if you’re already increasing your awareness of the data and its generation process, it may make sense to explicitly encode your assumptions in the model. This is where another hacker resource would come in handy: Probabilistic Programming \u0026 Bayesian Methods for Hackers by Cam Davidson-Pilon. Admittedly, it’s a bit longer than the average blog post or conference talk, but it is worth reading.\nGoing down the bootstrapping rabbit hole has reminded me of an important lesson: Blog posts and talks – especially ones with the word hacker in the title – may be a good starting point, but they shouldn’t be relied on for serious work. Instead, it is better to consult peer-reviewed resources and textbooks, such as the references listed in ARCH’s documentation. In my future explorations of bootstrapping and other methods, I will heed Abraham Lincoln’s timeless advice to not trust everything I read on the internet.\nUpdate (Oct 2019): I published a post summarising a talk I gave on the topic, complete with simulation code that illustrates the issues with some bootstrapping algorithms.\n","wordCount":"1625","inLanguage":"en","image":"https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg","datePublished":"2019-01-07T21:07:56Z","dateModified":"2024-01-16T09:56:03+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Hackers beware: Bootstrap sampling may be harmful</h1><div class=post-meta><span title='2019-01-07 21:07:56 +0000 UTC'>January 7, 2019</span></div></header><figure class=entry-cover><img loading=eager srcset="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu8074241546914910689.jpg 360w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu8952304080438434340.jpg 480w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu12862740763701577169.jpg 720w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu9453409479550691476.jpg 1080w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu13538577505812580503.jpg 1500w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg 3531w" sizes="(min-width: 768px) 720px, 100vw" src=https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg alt width=3531 height=1200></figure><div class=post-content><p><a href=https://en.wikipedia.org/wiki/Bootstrapping_%28statistics%29 target=_blank rel=noopener>Bootstrap sampling techniques</a> are very appealing, as they don&rsquo;t require knowing much about statistics and opaque formulas. Instead, all one needs to do is resample the given data many times, and calculate the desired statistics. Therefore, bootstrapping has been promoted as an easy way of modelling uncertainty to hackers who don&rsquo;t have much statistical knowledge. For example, the main thesis of the excellent <a href=https://speakerdeck.com/jakevdp/statistics-for-hackers target=_blank rel=noopener><em>Statistics for Hackers</em></a> talk by Jake VanderPlas is: <em>&ldquo;If you can write a for-loop, you can do statistics&rdquo;</em>. Similar ground was covered by Erik Bernhardsson in <a href=https://erikbern.com/2018/10/08/the-hackers-guide-to-uncertainty-estimates.html target=_blank rel=noopener><em>The Hacker&rsquo;s Guide to Uncertainty Estimates</em></a>, which provides more use cases for bootstrapping (with code examples). However, I&rsquo;ve learned in the past few weeks that there are quite a few pitfalls in bootstrapping. Much of what I&rsquo;ve learned is summarised in a paper titled <a href=https://arxiv.org/abs/1411.5279 target=_blank rel=noopener><em>What Teachers Should Know about the Bootstrap: Resampling in the Undergraduate Statistics Curriculum</em></a> by Tim Hesterberg. I doubt that many hackers would be motivated to read a paper with such a title, so my goal with this post is to make some of my discoveries more accessible to a wider audience. To learn more about the issues raised in this post, it&rsquo;s worth reading Hesterberg&rsquo;s paper and other linked resources.</p><p>For quick reference, here&rsquo;s a summary of the advice in this post:</p><ul><li>Use an accurate method for estimating confidence intervals</li><li>Use enough resamples – at least 10-15K</li><li>Don&rsquo;t compare confidence intervals visually</li><li>Ensure that the basic assumptions apply to your situation</li></ul><h2 id=pitfall-1-inaccurate-confidence-intervals>Pitfall #1: Inaccurate confidence intervals<a hidden class=anchor aria-hidden=true href=#pitfall-1-inaccurate-confidence-intervals>#</a></h2><p><a href=https://en.wikipedia.org/wiki/Confidence_interval target=_blank rel=noopener>Confidence intervals</a> are a common way of quantifying the uncertainty in an estimate of a population parameter. The percentile method is one of the simplest bootstrapping approaches for generating confidence intervals. For example, let&rsquo;s say we have a data sample of size <code>n</code> and we want to estimate a 95% confidence interval for the population mean. We take <code>r</code> bootstrap <em>resamples</em> from the original data sample, where each resample is a sample with replacement of size <code>n</code>. We calculate the mean of each resample and store the means in a sorted array. We then return the 95% confidence interval as the values that fall at the <code>0.025r</code> and <code>0.975r</code> indices of the sorted array (i.e., the 2.5% and 97.5% percentiles). The following table shows what the first two resamples may look like for a data sample of size <code>n=5</code>.</p><table><thead><tr><th></th><th>Original sample</th><th>Resample #1</th><th>Resample #2</th><th>&mldr;</th></tr></thead><tbody><tr><td><strong>Values</strong></td><td>10</td><td>30</td><td>20</td><td>&mldr;</td></tr><tr><td></td><td>12</td><td>20</td><td>20</td><td></td></tr><tr><td></td><td>20</td><td>12</td><td>30</td><td></td></tr><tr><td></td><td>30</td><td>12</td><td>30</td><td></td></tr><tr><td></td><td>45</td><td>45</td><td>30</td><td></td></tr><tr><td></td><td></td><td></td><td></td><td></td></tr><tr><td><strong>Mean</strong></td><td><em>23.4</em></td><td><em>23.8</em></td><td><em>26</em></td><td><em>&mldr;</em></td></tr></tbody></table><p>The percentile method is nice and simple. Any programmer should be able to easily implement it in their favourite programming language, assuming <a href=https://blog.codinghorror.com/why-cant-programmers-program/ target=_blank rel=noopener>they can actually program</a>. Unfortunately, <strong>this method is just not accurate enough for small sample sizes</strong>. Quoting Hesterberg (emphasis mine):</p><blockquote><p>The sample sizes needed for different intervals to satisfy the &ldquo;reasonably accurate&rdquo; (off by no more than 10% on each side) criterion are: n ≥ 101 for the bootstrap t, 220 for the skewness-adjusted t statistic, 2,235 for expanded percentile, <b style=font-weight:700>2,383 for percentile</b>, 4,815 for ordinary t (which I have rounded up to 5,000 above), 5,063 for t with bootstrap standard errors and something over 8,000 for the reverse percentile method.</p></blockquote><p>In <a href=https://storage.googleapis.com/pub-tools-public-publication-data/pdf/44859.pdf target=_blank rel=noopener>a shorter version of the paper cited above</a>, Hesterberg concludes that:</p><blockquote><p>In practice, implementing some of the more accurate bootstrap methods is difficult (especially those not described here), and people should use a package rather than attempt this themselves.</p></blockquote><p>In short, <strong>make sure you&rsquo;re using an accurate method for estimating confidence intervals when dealing with sample sizes of less than a few thousand values</strong>. Using a package is a great idea, but unfortunately I don&rsquo;t know of any Python bootstrapping package that is feature-complete: <a href=https://github.com/bashtage/arch/ target=_blank rel=noopener>ARCH</a> and <a href=https://github.com/cgevans/scikits-bootstrap/ target=_blank rel=noopener>scikits-bootstrap</a> support advanced confidence interval methods but don&rsquo;t support analysis of two samples of uneven sizes, while <a href=https://github.com/facebookincubator/bootstrapped/ target=_blank rel=noopener>bootstrapped</a> works with samples of uneven sizes but only supports the percentile and the reverse percentile method (which Hesterberg found to be even less accurate). If you know of any better Python packages, please let me know! (I don&rsquo;t use R, but I suspect the situation is better there). <strong>Update</strong>: <a href=https://github.com/bashtage/arch/releases/tag/4.8.0 target=_blank rel=noopener>ARCH now supports</a> analysis of samples of uneven sizes <a href=https://github.com/bashtage/arch/issues/260 target=_blank rel=noopener>following an issue I reported</a>. It seems to be the best Python bootstrapping package, so I recommend using it.</p><h2 id=pitfall-2-not-enough-resamples>Pitfall #2: Not enough resamples<a hidden class=anchor aria-hidden=true href=#pitfall-2-not-enough-resamples>#</a></h2><p>Accurate bootstrap estimates require a large number of resamples. Many code snippets use 1,000 resamples, probably because it looks like a large number. However, <em>seeming</em> large isn&rsquo;t enough. Quoting Hesterberg again:</p><blockquote><p>For both the bootstrap and permutation tests, the number of resamples needs to be 15,000 or more, for 95% probability that simulation-based one-sided levels fall within 10% of the true values, for 95% intervals and 5% tests. I recommend r = 10,000 for routine use, and more when accuracy matters.</p><p>[&mldr;]</p><p>We want decisions to depend on the data, not random variation in the Monte Carlo implementation. We used r = 500,000 in the Verizon project.</p></blockquote><p>That&rsquo;s right, half a million resamples! Accuracy mattered in the Verizon case, as the results of the analysis determined whether large penalties were paid or not. In short, <strong>use at least 10-15,000 resamples to be safe</strong>. Don&rsquo;t use 1,000.</p><h2 id=pitfall-3-comparison-of-single-sample-confidence-intervals>Pitfall #3: Comparison of single-sample confidence intervals<a hidden class=anchor aria-hidden=true href=#pitfall-3-comparison-of-single-sample-confidence-intervals>#</a></h2><p>Confidence intervals are commonly used to decide if the difference between two samples is statistically significant. Bootstrapping provides a straightforward way of estimating confidence intervals without making assumptions about the way the data was generated. For example, given two samples, we can obtain confidence intervals for the mean of each sample and end up with a plot like this:</p><figure><a href=overlapping-confidence-intervals.png target=_blank rel=noopener><img sizes="(min-width: 768px) 720px,
+<meta name=keywords content="bootstrapping,data science,hackers,software engineering,statistics"><meta name=description content="Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&rsquo;t that simple."><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Hackers beware: Bootstrap sampling may be harmful"><meta property="og:description" content="Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&rsquo;t that simple."><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/"><meta property="og:image" content="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg"><meta property="article:section" content="posts"><meta property="article:published_time" content="2019-01-07T21:07:56+00:00"><meta property="article:modified_time" content="2024-01-16T09:56:03+10:00"><meta name=twitter:card content="summary_large_image"><meta name=twitter:image content="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg"><meta name=twitter:title content="Hackers beware: Bootstrap sampling may be harmful"><meta name=twitter:description content="Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&rsquo;t that simple."><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Browse Posts","item":"https://yanirseroussi.com/posts/"},{"@type":"ListItem","position":2,"name":"Hackers beware: Bootstrap sampling may be harmful","item":"https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Hackers beware: Bootstrap sampling may be harmful","name":"Hackers beware: Bootstrap sampling may be harmful","description":"Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren\u0026rsquo;t that simple.","keywords":["bootstrapping","data science","hackers","software engineering","statistics"],"articleBody":"Bootstrap sampling techniques are very appealing, as they don’t require knowing much about statistics and opaque formulas. Instead, all one needs to do is resample the given data many times, and calculate the desired statistics. Therefore, bootstrapping has been promoted as an easy way of modelling uncertainty to hackers who don’t have much statistical knowledge. For example, the main thesis of the excellent Statistics for Hackers talk by Jake VanderPlas is: “If you can write a for-loop, you can do statistics”. Similar ground was covered by Erik Bernhardsson in The Hacker’s Guide to Uncertainty Estimates, which provides more use cases for bootstrapping (with code examples). However, I’ve learned in the past few weeks that there are quite a few pitfalls in bootstrapping. Much of what I’ve learned is summarised in a paper titled What Teachers Should Know about the Bootstrap: Resampling in the Undergraduate Statistics Curriculum by Tim Hesterberg. I doubt that many hackers would be motivated to read a paper with such a title, so my goal with this post is to make some of my discoveries more accessible to a wider audience. To learn more about the issues raised in this post, it’s worth reading Hesterberg’s paper and other linked resources.\nFor quick reference, here’s a summary of the advice in this post:\nUse an accurate method for estimating confidence intervals Use enough resamples – at least 10-15K Don’t compare confidence intervals visually Ensure that the basic assumptions apply to your situation Pitfall #1: Inaccurate confidence intervals Confidence intervals are a common way of quantifying the uncertainty in an estimate of a population parameter. The percentile method is one of the simplest bootstrapping approaches for generating confidence intervals. For example, let’s say we have a data sample of size n and we want to estimate a 95% confidence interval for the population mean. We take r bootstrap resamples from the original data sample, where each resample is a sample with replacement of size n. We calculate the mean of each resample and store the means in a sorted array. We then return the 95% confidence interval as the values that fall at the 0.025r and 0.975r indices of the sorted array (i.e., the 2.5% and 97.5% percentiles). The following table shows what the first two resamples may look like for a data sample of size n=5.\nOriginal sample Resample #1 Resample #2 … Values 10 30 20 … 12 20 20 20 12 30 30 12 30 45 45 30 Mean 23.4 23.8 26 … The percentile method is nice and simple. Any programmer should be able to easily implement it in their favourite programming language, assuming they can actually program. Unfortunately, this method is just not accurate enough for small sample sizes. Quoting Hesterberg (emphasis mine):\nThe sample sizes needed for different intervals to satisfy the “reasonably accurate” (off by no more than 10% on each side) criterion are: n ≥ 101 for the bootstrap t, 220 for the skewness-adjusted t statistic, 2,235 for expanded percentile, 2,383 for percentile, 4,815 for ordinary t (which I have rounded up to 5,000 above), 5,063 for t with bootstrap standard errors and something over 8,000 for the reverse percentile method.\nIn a shorter version of the paper cited above, Hesterberg concludes that:\nIn practice, implementing some of the more accurate bootstrap methods is difficult (especially those not described here), and people should use a package rather than attempt this themselves.\nIn short, make sure you’re using an accurate method for estimating confidence intervals when dealing with sample sizes of less than a few thousand values. Using a package is a great idea, but unfortunately I don’t know of any Python bootstrapping package that is feature-complete: ARCH and scikits-bootstrap support advanced confidence interval methods but don’t support analysis of two samples of uneven sizes, while bootstrapped works with samples of uneven sizes but only supports the percentile and the reverse percentile method (which Hesterberg found to be even less accurate). If you know of any better Python packages, please let me know! (I don’t use R, but I suspect the situation is better there). Update: ARCH now supports analysis of samples of uneven sizes following an issue I reported. It seems to be the best Python bootstrapping package, so I recommend using it.\nPitfall #2: Not enough resamples Accurate bootstrap estimates require a large number of resamples. Many code snippets use 1,000 resamples, probably because it looks like a large number. However, seeming large isn’t enough. Quoting Hesterberg again:\nFor both the bootstrap and permutation tests, the number of resamples needs to be 15,000 or more, for 95% probability that simulation-based one-sided levels fall within 10% of the true values, for 95% intervals and 5% tests. I recommend r = 10,000 for routine use, and more when accuracy matters.\n[…]\nWe want decisions to depend on the data, not random variation in the Monte Carlo implementation. We used r = 500,000 in the Verizon project.\nThat’s right, half a million resamples! Accuracy mattered in the Verizon case, as the results of the analysis determined whether large penalties were paid or not. In short, use at least 10-15,000 resamples to be safe. Don’t use 1,000.\nPitfall #3: Comparison of single-sample confidence intervals Confidence intervals are commonly used to decide if the difference between two samples is statistically significant. Bootstrapping provides a straightforward way of estimating confidence intervals without making assumptions about the way the data was generated. For example, given two samples, we can obtain confidence intervals for the mean of each sample and end up with a plot like this:\nWhen looking at this plot, some people may conclude that the difference between the groups isn’t statistically significant because the confidence intervals overlap. However, overlapping confidence intervals don’t imply a lack of statistical significance because it is possible for the confidence interval of the difference between the sample means to not contain zero. Prasanna Parasurama explained why this happens in this post. While this issue isn’t unique to bootstrapping, it’s worth remembering that when comparing two groups, we need to obtain the confidence interval for the difference in the parameter we’re comparing, not compare single-sample confidence intervals.\nFor a concrete example, consider a case where we’re looking at a binary outcomes (yes/no or 1/0), which occur in coin flips or online A/B tests. Sample A consists of 2,150 zeroes and 350 ones, while sample B consists of 2,250 zeroes and 440 ones. As these are fairly large samples, we can use the bootstrap percentile method to obtain 95% confidence intervals for the mean of each sample. As the following figure shows, these intervals overlap. If we use the same method to also obtain a 95% confidence interval for the difference in means between B and A, we see that it doesn’t include zero. Therefore, we can say that the difference between B and A is statistically significant, despite the overlap between the single-sample confidence intervals.\nIt’s worth noting that when analysing binary outcomes, we can make stronger assumptions about the data rather than use bootstrapping to obtain confidence intervals. Erik Bernhardsson suggests using the Beta distribution to obtain single-sample confidence intervals, but as we’ve seen, they don’t tell us enough about the differences between samples. I suggested using a Bayesian approach in the past, which makes explicit modelling assumptions that allow us to encode our prior knowledge on the specific environment where the data was generated. For example, when running online A/B tests, we often have a ballpark figure for reasonable results, which can be used in the Bayesian A/B testing calculator I built.\nPitfall #4: Unrepresentative and dependent samples While the basic bootstrap makes no assumption about the underlying distribution of the data, it is not assumption-free. For example, when dealing with correlated data points from a time series, using the basic bootstrapping approach is wrong because it assumes that the data points are independent. Instead, a block bootstrap should be used – see the ARCH package for some implementation examples. In addition, bootstrapping doesn’t solve problems with the underlying sampling approach. For example, the data sample may not be representative of the population because of its small size, or there may be selection biases and measurement errors. No amount of bootstrapping is going to help with such issues. In general, it always helps to be aware of the data’s generation process, e.g., different considerations apply when dealing with data from online experiments versus observational studies.\nConclusion and next steps While bootstrapping is a powerful method, its initial impression of simplicity is misleading. To draw valid conclusions, it’s a good idea to use a package and be aware of considerations that are specific to the analysed data sample. However, if you’re already increasing your awareness of the data and its generation process, it may make sense to explicitly encode your assumptions in the model. This is where another hacker resource would come in handy: Probabilistic Programming \u0026 Bayesian Methods for Hackers by Cam Davidson-Pilon. Admittedly, it’s a bit longer than the average blog post or conference talk, but it is worth reading.\nGoing down the bootstrapping rabbit hole has reminded me of an important lesson: Blog posts and talks – especially ones with the word hacker in the title – may be a good starting point, but they shouldn’t be relied on for serious work. Instead, it is better to consult peer-reviewed resources and textbooks, such as the references listed in ARCH’s documentation. In my future explorations of bootstrapping and other methods, I will heed Abraham Lincoln’s timeless advice to not trust everything I read on the internet.\nUpdate (Oct 2019): I published a post summarising a talk I gave on the topic, complete with simulation code that illustrates the issues with some bootstrapping algorithms.\n","wordCount":"1625","inLanguage":"en","image":"https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg","datePublished":"2019-01-07T21:07:56Z","dateModified":"2024-01-16T09:56:03+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Hackers beware: Bootstrap sampling may be harmful</h1><div class=post-meta><span title='2019-01-07 21:07:56 +0000 UTC'>January 7, 2019</span></div></header><figure class=entry-cover><img loading=eager srcset="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu8074241546914910689.jpg 360w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu8952304080438434340.jpg 480w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu12862740763701577169.jpg 720w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu9453409479550691476.jpg 1080w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs_hu13538577505812580503.jpg 1500w ,https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg 3531w" sizes="(min-width: 768px) 720px, 100vw" src=https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/warning-signs.jpg alt width=3531 height=1200></figure><div class=post-content><p><a href=https://en.wikipedia.org/wiki/Bootstrapping_%28statistics%29 target=_blank rel=noopener>Bootstrap sampling techniques</a> are very appealing, as they don&rsquo;t require knowing much about statistics and opaque formulas. Instead, all one needs to do is resample the given data many times, and calculate the desired statistics. Therefore, bootstrapping has been promoted as an easy way of modelling uncertainty to hackers who don&rsquo;t have much statistical knowledge. For example, the main thesis of the excellent <a href=https://speakerdeck.com/jakevdp/statistics-for-hackers target=_blank rel=noopener><em>Statistics for Hackers</em></a> talk by Jake VanderPlas is: <em>&ldquo;If you can write a for-loop, you can do statistics&rdquo;</em>. Similar ground was covered by Erik Bernhardsson in <a href=https://erikbern.com/2018/10/08/the-hackers-guide-to-uncertainty-estimates.html target=_blank rel=noopener><em>The Hacker&rsquo;s Guide to Uncertainty Estimates</em></a>, which provides more use cases for bootstrapping (with code examples). However, I&rsquo;ve learned in the past few weeks that there are quite a few pitfalls in bootstrapping. Much of what I&rsquo;ve learned is summarised in a paper titled <a href=https://arxiv.org/abs/1411.5279 target=_blank rel=noopener><em>What Teachers Should Know about the Bootstrap: Resampling in the Undergraduate Statistics Curriculum</em></a> by Tim Hesterberg. I doubt that many hackers would be motivated to read a paper with such a title, so my goal with this post is to make some of my discoveries more accessible to a wider audience. To learn more about the issues raised in this post, it&rsquo;s worth reading Hesterberg&rsquo;s paper and other linked resources.</p><p>For quick reference, here&rsquo;s a summary of the advice in this post:</p><ul><li>Use an accurate method for estimating confidence intervals</li><li>Use enough resamples – at least 10-15K</li><li>Don&rsquo;t compare confidence intervals visually</li><li>Ensure that the basic assumptions apply to your situation</li></ul><h2 id=pitfall-1-inaccurate-confidence-intervals>Pitfall #1: Inaccurate confidence intervals<a hidden class=anchor aria-hidden=true href=#pitfall-1-inaccurate-confidence-intervals>#</a></h2><p><a href=https://en.wikipedia.org/wiki/Confidence_interval target=_blank rel=noopener>Confidence intervals</a> are a common way of quantifying the uncertainty in an estimate of a population parameter. The percentile method is one of the simplest bootstrapping approaches for generating confidence intervals. For example, let&rsquo;s say we have a data sample of size <code>n</code> and we want to estimate a 95% confidence interval for the population mean. We take <code>r</code> bootstrap <em>resamples</em> from the original data sample, where each resample is a sample with replacement of size <code>n</code>. We calculate the mean of each resample and store the means in a sorted array. We then return the 95% confidence interval as the values that fall at the <code>0.025r</code> and <code>0.975r</code> indices of the sorted array (i.e., the 2.5% and 97.5% percentiles). The following table shows what the first two resamples may look like for a data sample of size <code>n=5</code>.</p><table><thead><tr><th style=text-align:left></th><th style=text-align:left>Original sample</th><th style=text-align:left>Resample #1</th><th style=text-align:left>Resample #2</th><th style=text-align:left>&mldr;</th></tr></thead><tbody><tr><td style=text-align:left><strong>Values</strong></td><td style=text-align:left>10</td><td style=text-align:left>30</td><td style=text-align:left>20</td><td style=text-align:left>&mldr;</td></tr><tr><td style=text-align:left></td><td style=text-align:left>12</td><td style=text-align:left>20</td><td style=text-align:left>20</td><td style=text-align:left></td></tr><tr><td style=text-align:left></td><td style=text-align:left>20</td><td style=text-align:left>12</td><td style=text-align:left>30</td><td style=text-align:left></td></tr><tr><td style=text-align:left></td><td style=text-align:left>30</td><td style=text-align:left>12</td><td style=text-align:left>30</td><td style=text-align:left></td></tr><tr><td style=text-align:left></td><td style=text-align:left>45</td><td style=text-align:left>45</td><td style=text-align:left>30</td><td style=text-align:left></td></tr><tr><td style=text-align:left></td><td style=text-align:left></td><td style=text-align:left></td><td style=text-align:left></td><td style=text-align:left></td></tr><tr><td style=text-align:left><strong>Mean</strong></td><td style=text-align:left><em>23.4</em></td><td style=text-align:left><em>23.8</em></td><td style=text-align:left><em>26</em></td><td style=text-align:left><em>&mldr;</em></td></tr></tbody></table><p>The percentile method is nice and simple. Any programmer should be able to easily implement it in their favourite programming language, assuming <a href=https://blog.codinghorror.com/why-cant-programmers-program/ target=_blank rel=noopener>they can actually program</a>. Unfortunately, <strong>this method is just not accurate enough for small sample sizes</strong>. Quoting Hesterberg (emphasis mine):</p><blockquote><p>The sample sizes needed for different intervals to satisfy the &ldquo;reasonably accurate&rdquo; (off by no more than 10% on each side) criterion are: n ≥ 101 for the bootstrap t, 220 for the skewness-adjusted t statistic, 2,235 for expanded percentile, <b style=font-weight:700>2,383 for percentile</b>, 4,815 for ordinary t (which I have rounded up to 5,000 above), 5,063 for t with bootstrap standard errors and something over 8,000 for the reverse percentile method.</p></blockquote><p>In <a href=https://storage.googleapis.com/pub-tools-public-publication-data/pdf/44859.pdf target=_blank rel=noopener>a shorter version of the paper cited above</a>, Hesterberg concludes that:</p><blockquote><p>In practice, implementing some of the more accurate bootstrap methods is difficult (especially those not described here), and people should use a package rather than attempt this themselves.</p></blockquote><p>In short, <strong>make sure you&rsquo;re using an accurate method for estimating confidence intervals when dealing with sample sizes of less than a few thousand values</strong>. Using a package is a great idea, but unfortunately I don&rsquo;t know of any Python bootstrapping package that is feature-complete: <a href=https://github.com/bashtage/arch/ target=_blank rel=noopener>ARCH</a> and <a href=https://github.com/cgevans/scikits-bootstrap/ target=_blank rel=noopener>scikits-bootstrap</a> support advanced confidence interval methods but don&rsquo;t support analysis of two samples of uneven sizes, while <a href=https://github.com/facebookincubator/bootstrapped/ target=_blank rel=noopener>bootstrapped</a> works with samples of uneven sizes but only supports the percentile and the reverse percentile method (which Hesterberg found to be even less accurate). If you know of any better Python packages, please let me know! (I don&rsquo;t use R, but I suspect the situation is better there). <strong>Update</strong>: <a href=https://github.com/bashtage/arch/releases/tag/4.8.0 target=_blank rel=noopener>ARCH now supports</a> analysis of samples of uneven sizes <a href=https://github.com/bashtage/arch/issues/260 target=_blank rel=noopener>following an issue I reported</a>. It seems to be the best Python bootstrapping package, so I recommend using it.</p><h2 id=pitfall-2-not-enough-resamples>Pitfall #2: Not enough resamples<a hidden class=anchor aria-hidden=true href=#pitfall-2-not-enough-resamples>#</a></h2><p>Accurate bootstrap estimates require a large number of resamples. Many code snippets use 1,000 resamples, probably because it looks like a large number. However, <em>seeming</em> large isn&rsquo;t enough. Quoting Hesterberg again:</p><blockquote><p>For both the bootstrap and permutation tests, the number of resamples needs to be 15,000 or more, for 95% probability that simulation-based one-sided levels fall within 10% of the true values, for 95% intervals and 5% tests. I recommend r = 10,000 for routine use, and more when accuracy matters.</p><p>[&mldr;]</p><p>We want decisions to depend on the data, not random variation in the Monte Carlo implementation. We used r = 500,000 in the Verizon project.</p></blockquote><p>That&rsquo;s right, half a million resamples! Accuracy mattered in the Verizon case, as the results of the analysis determined whether large penalties were paid or not. In short, <strong>use at least 10-15,000 resamples to be safe</strong>. Don&rsquo;t use 1,000.</p><h2 id=pitfall-3-comparison-of-single-sample-confidence-intervals>Pitfall #3: Comparison of single-sample confidence intervals<a hidden class=anchor aria-hidden=true href=#pitfall-3-comparison-of-single-sample-confidence-intervals>#</a></h2><p>Confidence intervals are commonly used to decide if the difference between two samples is statistically significant. Bootstrapping provides a straightforward way of estimating confidence intervals without making assumptions about the way the data was generated. For example, given two samples, we can obtain confidence intervals for the mean of each sample and end up with a plot like this:</p><figure><a href=overlapping-confidence-intervals.png target=_blank rel=noopener><img sizes="(min-width: 768px) 720px,
 100vw" srcset="https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/overlapping-confidence-intervals_hu14505760578466431787.png 360w,
 https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/overlapping-confidence-intervals_hu10281521857797768529.png 480w,
 https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/overlapping-confidence-intervals_hu12306950532093799626.png 720w,
diff --git a/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/index.html b/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/index.html
index 05740058a..8de64d51a 100644
--- a/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/index.html
+++ b/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/index.html
@@ -1,5 +1,5 @@
 <!doctype html><html lang=en dir=auto><head><meta charset=utf-8><meta http-equiv=X-UA-Compatible content="IE=edge"><meta name=viewport content="width=device-width,initial-scale=1,shrink-to-fit=no"><meta name=robots content="index, follow"><title>Plumbing, Decisions, and Automation: De-hyping Data & AI | Yanir Seroussi | Data & AI for Startup Impact</title>
-<meta name=keywords content="artificial intelligence,business,career,data engineering,data science,data strategy,startups"><meta name=description content="Three essential questions to understand where an organisation stands when it comes to Data & AI (with zero hype)."><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Plumbing, Decisions, and Automation: De-hyping Data & AI"><meta property="og:description" content="Three essential questions to understand where an organisation stands when it comes to Data & AI (with zero hype)."><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/"><meta property="og:image" content="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp"><meta property="article:section" content="posts"><meta property="article:published_time" content="2024-05-27T02:00:00+00:00"><meta property="article:modified_time" content="2024-05-27T12:25:30+10:00"><meta name=twitter:card content="summary_large_image"><meta name=twitter:image content="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp"><meta name=twitter:title content="Plumbing, Decisions, and Automation: De-hyping Data & AI"><meta name=twitter:description content="Three essential questions to understand where an organisation stands when it comes to Data & AI (with zero hype)."><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Browse Posts","item":"https://yanirseroussi.com/posts/"},{"@type":"ListItem","position":2,"name":"Plumbing, Decisions, and Automation: De-hyping Data \u0026 AI","item":"https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Plumbing, Decisions, and Automation: De-hyping Data \u0026 AI","name":"Plumbing, Decisions, and Automation: De-hyping Data \u0026 AI","description":"Three essential questions to understand where an organisation stands when it comes to Data \u0026amp; AI (with zero hype).","keywords":["artificial intelligence","business","career","data engineering","data science","data strategy","startups"],"articleBody":"Data \u0026 AI health is hard to define. Recently, it occurred to me that its essence can be distilled with three questions:\nPlumbing: What’s the state of your data engineering lifecycles? Decisions: How do you use descriptive, predictive, and causal modelling to support decisions? Automation: How do you use AI to automate processes? These questions help identify gaps and opportunities. While each question focuses on the present state, it’s natural to follow up with plans for a brighter future.\nIn practice, you would go deep on each area. Each question is a door that leads to a corridor with many more doors.\nAmateurs versus professionals If you’ve ever worked with data, you’d have a sense of what amateur and professional answers to the above questions may look like. In practice, answers are multifaceted and fall on a continuum. But here are some simplified examples from each end of the continuum:\nAmateur Professional Plumbing Rudimentary pipelines, manually-populated spreadsheets All necessary data is trustworthy and available on tap Decisions Relying on one-off charts and models, along with the intuition of HiPPOs (highest-paid persons’ opinions) Relying on relevant data and modelling efforts that are proportional to the gravity of each decision Automation Superficial use of off-the-shelf tools Deep, mindful integration of tech to replace manual work where it delivers the most value Going down the rabbit hole The three areas pretty much define my career, but there is always much more to learn. The main message of this post is that little has changed since Harrington Emerson uttered these words in 1911:\nAs to methods, there may be a million and then some, but principles are few. The person who grasps principles can successfully select their own methods. The person who tries methods, ignoring principles, is sure to have trouble.\n(OK, one thing did change – Emerson used man rather than person, but I fixed it for him.)\nYou can explore further with these posts:\nPlumbing: Fully understanding the data engineering lifecycle is more important than mastering a single tool. Decisions: According to my 2018 definition, this is what data science is all about. There’s endless depth to building descriptive, predictive, and causal models. But the key to rising above tool hype is understanding the why of data science, which is to support decisions. Automation: The term AI is around peak hype right now. This makes it easy for cynics to dismiss the over-excited claims of AI proponents. Avoid cynicism – simply think of AI as automation and understand that relentless but mindful automation is key to success in our world. More questions to probe the Data-to-AI health of startups This post is a slight detour from the series on my Data-to-AI Health Check for Startups. I figured it’s a valuable detour since I now see the triad of Plumbing, Decisions, and Automation as the essence of Data \u0026 AI health for any organisation.\nPrevious posts in the series:\nAssessing a startup’s data-to-AI health: Overview and motivation Business questions to ask before taking a startup data role Probing the People aspects of an early-stage startup Question startup culture before accepting a data-to-AI role You can download a guide containing all the questions as a PDF. I’m still planning to cover Processes \u0026 Project Management next – hopefully I won’t get detoured again. Feedback is always welcome!\n","wordCount":"553","inLanguage":"en","image":"https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp","datePublished":"2024-05-27T02:00:00Z","dateModified":"2024-05-27T12:25:30+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Plumbing, Decisions, and Automation: De-hyping Data & AI</h1><div class=post-meta><span title='2024-05-27 02:00:00 +0000 UTC'>May 27, 2024</span></div></header><figure class=entry-cover><img loading=eager srcset="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu17220071323397176095.webp 360w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu8044998105800942894.webp 480w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu7426454085940226200.webp 720w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu8515556297246363014.webp 1080w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp 1200w" sizes="(min-width: 768px) 720px, 100vw" src=https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp alt="contrasting an amateur and a professional otter; the amateur asks about tools, the professional asks about plumbing, decisions, and automation" width=1200 height=630></figure><div class=post-content><p>Data & AI health is hard to define. Recently, it occurred to me that its essence can be distilled with three questions:</p><ol><li><strong>Plumbing:</strong> What&rsquo;s the state of your data engineering lifecycles?</li><li><strong>Decisions:</strong> How do you use descriptive, predictive, and causal modelling to support decisions?</li><li><strong>Automation:</strong> How do you use AI to automate processes?</li></ol><p>These questions help identify gaps and opportunities. While each question focuses on the present state, it&rsquo;s natural to follow up with plans for a brighter future.</p><p>In practice, you would go deep on each area. Each question is a door that leads to a corridor with many more doors.</p><h2 id=amateurs-versus-professionals>Amateurs versus professionals<a hidden class=anchor aria-hidden=true href=#amateurs-versus-professionals>#</a></h2><p>If you&rsquo;ve ever worked with data, you&rsquo;d have a sense of what amateur and professional answers to the above questions may look like. In practice, answers are multifaceted and fall on a continuum. But here are some simplified examples from each end of the continuum:</p><table><thead><tr><th></th><th>Amateur</th><th>Professional</th></tr></thead><tbody><tr><td><strong>Plumbing</strong></td><td>Rudimentary pipelines, manually-populated spreadsheets</td><td>All necessary data is trustworthy and available on tap</td></tr><tr><td><strong>Decisions</strong></td><td>Relying on one-off charts and models, along with the intuition of HiPPOs (highest-paid persons&rsquo; opinions)</td><td>Relying on relevant data and modelling efforts that are proportional to the gravity of each decision</td></tr><tr><td><strong>Automation</strong></td><td>Superficial use of off-the-shelf tools</td><td>Deep, mindful integration of tech to replace manual work where it delivers the most value</td></tr></tbody></table><h2 id=going-down-the-rabbit-hole>Going down the rabbit hole<a hidden class=anchor aria-hidden=true href=#going-down-the-rabbit-hole>#</a></h2><p>The three areas pretty much define my career, but there is always much more to learn. The main message of this post is that little has changed since <a href=https://yanirseroussi.com/2017/10/15/advice-for-aspiring-data-scientists-and-other-faqs/>Harrington Emerson uttered these words in 1911</a>:</p><blockquote><p>As to methods, there may be a million and then some, but principles are few. The person who grasps principles can successfully select their own methods. The person who tries methods, ignoring principles, is sure to have trouble.</p></blockquote><p><small>(OK, one thing did change – Emerson used <em>man</em> rather than <em>person</em>, but I fixed it for him.)</small></p><p>You can explore further with these posts:</p><ol><li><strong>Plumbing:</strong> Fully understanding <a href=https://yanirseroussi.com/til/2024/04/05/the-data-engineering-lifecycle-is-not-going-anywhere/>the data engineering lifecycle</a> is more important than mastering a single tool.</li><li><strong>Decisions:</strong> According to <a href=https://yanirseroussi.com/2018/07/22/defining-data-science-in-2018/>my 2018 definition</a>, this is what data science is all about. There&rsquo;s endless depth to building descriptive, predictive, and causal models. But the key to rising above tool hype is understanding <a href=https://yanirseroussi.com/2016/09/19/ask-why-finding-motives-causes-and-purpose-in-data-science/>the <em>why</em> of data science</a>, which is to support decisions.</li><li><strong>Automation:</strong> The term <em>AI</em> is around peak hype right now. This makes it easy for cynics to dismiss the over-excited claims of AI proponents. Avoid cynicism – <a href=https://yanirseroussi.com/til/2023/10/06/artificial-intelligence-was-a-marketing-term-all-along-just-call-it-automation/>simply think of AI as automation</a> and <a href=https://yanirseroussi.com/til/2024/05/25/adapting-to-the-economy-of-algorithms/>understand that relentless but mindful automation is key to success in our world</a>.</li></ol><h2 id=more-questions-to-probe-the-data-to-ai-health-of-startups>More questions to probe the Data-to-AI health of startups<a hidden class=anchor aria-hidden=true href=#more-questions-to-probe-the-data-to-ai-health-of-startups>#</a></h2><p>This post is a slight detour from the series on <a href=https://yanirseroussi.com/data-to-ai-health-check/>my Data-to-AI Health Check for Startups</a>. I figured it&rsquo;s a valuable detour since I now see the triad of Plumbing, Decisions, and Automation as the essence of Data & AI health for any organisation.</p><p>Previous posts in the series:</p><ul><li><a href=https://yanirseroussi.com/2024/04/22/assessing-a-startups-data-to-ai-health/>Assessing a startup&rsquo;s data-to-AI health: Overview and motivation</a></li><li><a href=https://yanirseroussi.com/2024/05/06/business-questions-to-ask-before-taking-a-startup-data-role/>Business questions to ask before taking a startup data role</a></li><li><a href=https://yanirseroussi.com/2024/05/13/probing-the-people-aspects-of-an-early-stage-startup/>Probing the People aspects of an early-stage startup</a></li><li><a href=https://yanirseroussi.com/2024/05/20/question-startup-culture-before-accepting-a-data-to-ai-role/>Question startup culture before accepting a data-to-AI role</a></li></ul><p><a href=https://yanirseroussi.com/data-to-ai-health-check/>You can download a guide containing all the questions as a PDF</a>. I&rsquo;m still planning to cover Processes & Project Management next – hopefully I won&rsquo;t get detoured again. Feedback is always welcome!</p></div><footer class=post-footer><ul class=post-tags><li><a href=https://yanirseroussi.com/tags/artificial-intelligence/>Artificial Intelligence</a></li><li><a href=https://yanirseroussi.com/tags/business/>Business</a></li><li><a href=https://yanirseroussi.com/tags/career/>Career</a></li><li><a href=https://yanirseroussi.com/tags/data-engineering/>Data Engineering</a></li><li><a href=https://yanirseroussi.com/tags/data-science/>Data Science</a></li><li><a href=https://yanirseroussi.com/tags/data-strategy/>Data Strategy</a></li><li><a href=https://yanirseroussi.com/tags/startups/>Startups</a></li></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on x" href="https://x.com/intent/tweet/?text=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f&amp;hashtags=artificialintelligence%2cbusiness%2ccareer%2cdataengineering%2cdatascience%2cdatastrategy%2cstartups"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f&amp;title=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;summary=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;source=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f&title=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on whatsapp" href="https://api.whatsapp.com/send?text=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI%20-%20https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on telegram" href="https://telegram.me/share/url?text=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on ycombinator" href="https://news.ycombinator.com/submitlink?t=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&u=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
+<meta name=keywords content="artificial intelligence,business,career,data engineering,data science,data strategy,startups"><meta name=description content="Three essential questions to understand where an organisation stands when it comes to Data & AI (with zero hype)."><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Plumbing, Decisions, and Automation: De-hyping Data & AI"><meta property="og:description" content="Three essential questions to understand where an organisation stands when it comes to Data & AI (with zero hype)."><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/"><meta property="og:image" content="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp"><meta property="article:section" content="posts"><meta property="article:published_time" content="2024-05-27T02:00:00+00:00"><meta property="article:modified_time" content="2024-05-27T12:25:30+10:00"><meta name=twitter:card content="summary_large_image"><meta name=twitter:image content="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp"><meta name=twitter:title content="Plumbing, Decisions, and Automation: De-hyping Data & AI"><meta name=twitter:description content="Three essential questions to understand where an organisation stands when it comes to Data & AI (with zero hype)."><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Browse Posts","item":"https://yanirseroussi.com/posts/"},{"@type":"ListItem","position":2,"name":"Plumbing, Decisions, and Automation: De-hyping Data \u0026 AI","item":"https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Plumbing, Decisions, and Automation: De-hyping Data \u0026 AI","name":"Plumbing, Decisions, and Automation: De-hyping Data \u0026 AI","description":"Three essential questions to understand where an organisation stands when it comes to Data \u0026amp; AI (with zero hype).","keywords":["artificial intelligence","business","career","data engineering","data science","data strategy","startups"],"articleBody":"Data \u0026 AI health is hard to define. Recently, it occurred to me that its essence can be distilled with three questions:\nPlumbing: What’s the state of your data engineering lifecycles? Decisions: How do you use descriptive, predictive, and causal modelling to support decisions? Automation: How do you use AI to automate processes? These questions help identify gaps and opportunities. While each question focuses on the present state, it’s natural to follow up with plans for a brighter future.\nIn practice, you would go deep on each area. Each question is a door that leads to a corridor with many more doors.\nAmateurs versus professionals If you’ve ever worked with data, you’d have a sense of what amateur and professional answers to the above questions may look like. In practice, answers are multifaceted and fall on a continuum. But here are some simplified examples from each end of the continuum:\nAmateur Professional Plumbing Rudimentary pipelines, manually-populated spreadsheets All necessary data is trustworthy and available on tap Decisions Relying on one-off charts and models, along with the intuition of HiPPOs (highest-paid persons’ opinions) Relying on relevant data and modelling efforts that are proportional to the gravity of each decision Automation Superficial use of off-the-shelf tools Deep, mindful integration of tech to replace manual work where it delivers the most value Going down the rabbit hole The three areas pretty much define my career, but there is always much more to learn. The main message of this post is that little has changed since Harrington Emerson uttered these words in 1911:\nAs to methods, there may be a million and then some, but principles are few. The person who grasps principles can successfully select their own methods. The person who tries methods, ignoring principles, is sure to have trouble.\n(OK, one thing did change – Emerson used man rather than person, but I fixed it for him.)\nYou can explore further with these posts:\nPlumbing: Fully understanding the data engineering lifecycle is more important than mastering a single tool. Decisions: According to my 2018 definition, this is what data science is all about. There’s endless depth to building descriptive, predictive, and causal models. But the key to rising above tool hype is understanding the why of data science, which is to support decisions. Automation: The term AI is around peak hype right now. This makes it easy for cynics to dismiss the over-excited claims of AI proponents. Avoid cynicism – simply think of AI as automation and understand that relentless but mindful automation is key to success in our world. More questions to probe the Data-to-AI health of startups This post is a slight detour from the series on my Data-to-AI Health Check for Startups. I figured it’s a valuable detour since I now see the triad of Plumbing, Decisions, and Automation as the essence of Data \u0026 AI health for any organisation.\nPrevious posts in the series:\nAssessing a startup’s data-to-AI health: Overview and motivation Business questions to ask before taking a startup data role Probing the People aspects of an early-stage startup Question startup culture before accepting a data-to-AI role You can download a guide containing all the questions as a PDF. I’m still planning to cover Processes \u0026 Project Management next – hopefully I won’t get detoured again. Feedback is always welcome!\n","wordCount":"553","inLanguage":"en","image":"https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp","datePublished":"2024-05-27T02:00:00Z","dateModified":"2024-05-27T12:25:30+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Plumbing, Decisions, and Automation: De-hyping Data & AI</h1><div class=post-meta><span title='2024-05-27 02:00:00 +0000 UTC'>May 27, 2024</span></div></header><figure class=entry-cover><img loading=eager srcset="https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu17220071323397176095.webp 360w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu8044998105800942894.webp 480w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu7426454085940226200.webp 720w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter_hu8515556297246363014.webp 1080w ,https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp 1200w" sizes="(min-width: 768px) 720px, 100vw" src=https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/amateur-versus-professional-data-and-ai-otter.webp alt="contrasting an amateur and a professional otter; the amateur asks about tools, the professional asks about plumbing, decisions, and automation" width=1200 height=630></figure><div class=post-content><p>Data & AI health is hard to define. Recently, it occurred to me that its essence can be distilled with three questions:</p><ol><li><strong>Plumbing:</strong> What&rsquo;s the state of your data engineering lifecycles?</li><li><strong>Decisions:</strong> How do you use descriptive, predictive, and causal modelling to support decisions?</li><li><strong>Automation:</strong> How do you use AI to automate processes?</li></ol><p>These questions help identify gaps and opportunities. While each question focuses on the present state, it&rsquo;s natural to follow up with plans for a brighter future.</p><p>In practice, you would go deep on each area. Each question is a door that leads to a corridor with many more doors.</p><h2 id=amateurs-versus-professionals>Amateurs versus professionals<a hidden class=anchor aria-hidden=true href=#amateurs-versus-professionals>#</a></h2><p>If you&rsquo;ve ever worked with data, you&rsquo;d have a sense of what amateur and professional answers to the above questions may look like. In practice, answers are multifaceted and fall on a continuum. But here are some simplified examples from each end of the continuum:</p><table><thead><tr><th style=text-align:left></th><th style=text-align:left>Amateur</th><th style=text-align:left>Professional</th></tr></thead><tbody><tr><td style=text-align:left><strong>Plumbing</strong></td><td style=text-align:left>Rudimentary pipelines, manually-populated spreadsheets</td><td style=text-align:left>All necessary data is trustworthy and available on tap</td></tr><tr><td style=text-align:left><strong>Decisions</strong></td><td style=text-align:left>Relying on one-off charts and models, along with the intuition of HiPPOs (highest-paid persons&rsquo; opinions)</td><td style=text-align:left>Relying on relevant data and modelling efforts that are proportional to the gravity of each decision</td></tr><tr><td style=text-align:left><strong>Automation</strong></td><td style=text-align:left>Superficial use of off-the-shelf tools</td><td style=text-align:left>Deep, mindful integration of tech to replace manual work where it delivers the most value</td></tr></tbody></table><h2 id=going-down-the-rabbit-hole>Going down the rabbit hole<a hidden class=anchor aria-hidden=true href=#going-down-the-rabbit-hole>#</a></h2><p>The three areas pretty much define my career, but there is always much more to learn. The main message of this post is that little has changed since <a href=https://yanirseroussi.com/2017/10/15/advice-for-aspiring-data-scientists-and-other-faqs/>Harrington Emerson uttered these words in 1911</a>:</p><blockquote><p>As to methods, there may be a million and then some, but principles are few. The person who grasps principles can successfully select their own methods. The person who tries methods, ignoring principles, is sure to have trouble.</p></blockquote><p><small>(OK, one thing did change – Emerson used <em>man</em> rather than <em>person</em>, but I fixed it for him.)</small></p><p>You can explore further with these posts:</p><ol><li><strong>Plumbing:</strong> Fully understanding <a href=https://yanirseroussi.com/til/2024/04/05/the-data-engineering-lifecycle-is-not-going-anywhere/>the data engineering lifecycle</a> is more important than mastering a single tool.</li><li><strong>Decisions:</strong> According to <a href=https://yanirseroussi.com/2018/07/22/defining-data-science-in-2018/>my 2018 definition</a>, this is what data science is all about. There&rsquo;s endless depth to building descriptive, predictive, and causal models. But the key to rising above tool hype is understanding <a href=https://yanirseroussi.com/2016/09/19/ask-why-finding-motives-causes-and-purpose-in-data-science/>the <em>why</em> of data science</a>, which is to support decisions.</li><li><strong>Automation:</strong> The term <em>AI</em> is around peak hype right now. This makes it easy for cynics to dismiss the over-excited claims of AI proponents. Avoid cynicism – <a href=https://yanirseroussi.com/til/2023/10/06/artificial-intelligence-was-a-marketing-term-all-along-just-call-it-automation/>simply think of AI as automation</a> and <a href=https://yanirseroussi.com/til/2024/05/25/adapting-to-the-economy-of-algorithms/>understand that relentless but mindful automation is key to success in our world</a>.</li></ol><h2 id=more-questions-to-probe-the-data-to-ai-health-of-startups>More questions to probe the Data-to-AI health of startups<a hidden class=anchor aria-hidden=true href=#more-questions-to-probe-the-data-to-ai-health-of-startups>#</a></h2><p>This post is a slight detour from the series on <a href=https://yanirseroussi.com/data-to-ai-health-check/>my Data-to-AI Health Check for Startups</a>. I figured it&rsquo;s a valuable detour since I now see the triad of Plumbing, Decisions, and Automation as the essence of Data & AI health for any organisation.</p><p>Previous posts in the series:</p><ul><li><a href=https://yanirseroussi.com/2024/04/22/assessing-a-startups-data-to-ai-health/>Assessing a startup&rsquo;s data-to-AI health: Overview and motivation</a></li><li><a href=https://yanirseroussi.com/2024/05/06/business-questions-to-ask-before-taking-a-startup-data-role/>Business questions to ask before taking a startup data role</a></li><li><a href=https://yanirseroussi.com/2024/05/13/probing-the-people-aspects-of-an-early-stage-startup/>Probing the People aspects of an early-stage startup</a></li><li><a href=https://yanirseroussi.com/2024/05/20/question-startup-culture-before-accepting-a-data-to-ai-role/>Question startup culture before accepting a data-to-AI role</a></li></ul><p><a href=https://yanirseroussi.com/data-to-ai-health-check/>You can download a guide containing all the questions as a PDF</a>. I&rsquo;m still planning to cover Processes & Project Management next – hopefully I won&rsquo;t get detoured again. Feedback is always welcome!</p></div><footer class=post-footer><ul class=post-tags><li><a href=https://yanirseroussi.com/tags/artificial-intelligence/>Artificial Intelligence</a></li><li><a href=https://yanirseroussi.com/tags/business/>Business</a></li><li><a href=https://yanirseroussi.com/tags/career/>Career</a></li><li><a href=https://yanirseroussi.com/tags/data-engineering/>Data Engineering</a></li><li><a href=https://yanirseroussi.com/tags/data-science/>Data Science</a></li><li><a href=https://yanirseroussi.com/tags/data-strategy/>Data Strategy</a></li><li><a href=https://yanirseroussi.com/tags/startups/>Startups</a></li></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on x" href="https://x.com/intent/tweet/?text=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f&amp;hashtags=artificialintelligence%2cbusiness%2ccareer%2cdataengineering%2cdatascience%2cdatastrategy%2cstartups"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f&amp;title=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;summary=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;source=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f&title=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on whatsapp" href="https://api.whatsapp.com/send?text=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI%20-%20https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on telegram" href="https://telegram.me/share/url?text=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&amp;url=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Plumbing, Decisions, and Automation: De-hyping Data & AI on ycombinator" href="https://news.ycombinator.com/submitlink?t=Plumbing%2c%20Decisions%2c%20and%20Automation%3a%20De-hyping%20Data%20%26%20AI&u=https%3a%2f%2fyanirseroussi.com%2f2024%2f05%2f27%2fplumbing-decisions-and-automation-de-hyping-data-and-ai%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
 </a><script>const mailingListButton=document.getElementById("mailing-list-link");window.onscroll=function(){document.body.scrollTop>800||document.documentElement.scrollTop>800?(mailingListButton.style.visibility="visible",mailingListButton.style.opacity="1"):(mailingListButton.style.visibility="hidden",mailingListButton.style.opacity="0")}</script><div class=mailing-list-container><script src=https://f.convertkit.com/ckjs/ck.5.js></script><form class="mailing-list seva-form formkit-form" action=https://app.convertkit.com/forms/6549537/subscriptions method=post data-sv-form=6549537 data-uid=9157759fce data-format=inline data-version=5 data-options='{"settings":{"after_subscribe":{"action":"message","redirect_url":"","success_message":"Success! Now check your email to confirm your subscription."},"recaptcha":{"enabled":false},"return_visitor":{"action":"show","custom_content":""}},"version":"5"}'><div data-style=clean><ul class="formkit-alert formkit-alert-error" data-element=errors data-group=alert></ul><div data-element=fields data-stacked=false><label for=mailing-list-email>Get weekly posts in your mailbox</label>
 <input id=mailing-list-email name=email_address aria-label="Email address" placeholder="Email address" required type=email>
 <button data-element=submit>Subscribe</button></div></div></form><div class=footer>Join hundreds of subscribers. No spam or AI-generated slop. Unsubscribe any time.</div></div><section class=comment-section><p class="post-content contact-cta">Public comments are closed, but I love hearing from readers. Feel free to
diff --git a/causal-inference-resources/index.html b/causal-inference-resources/index.html
index 9447bbbb5..18bf23f75 100644
--- a/causal-inference-resources/index.html
+++ b/causal-inference-resources/index.html
@@ -1,11 +1,47 @@
 <!doctype html><html lang=en dir=auto><head><meta charset=utf-8><meta http-equiv=X-UA-Compatible content="IE=edge"><meta name=viewport content="width=device-width,initial-scale=1,shrink-to-fit=no"><meta name=robots content="index, follow"><title>Causal inference resources | Yanir Seroussi | Data & AI for Startup Impact</title>
 <meta name=keywords content><meta name=description content="This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.
 Books:
-Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&rsquo;ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice."><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/causal-inference-resources/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/causal-inference-resources/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Causal inference resources"><meta property="og:description" content="This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.
+
+Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&rsquo;ve read. Highly recommended.
+Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.
+Why: A Guide to Finding and Using Causes by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in Why you should stop worrying about deep learning and deepen your understanding of causality instead.
+Causality, Probability, and Time by Samantha Kleinberg: More technical than Kleinberg&rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I&rsquo;m still unsure about the practicality of those methods on real data. See my post Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions for more details.
+Causal Inference in Statistics: A Primer by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl&rsquo;s work. I didn&rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of Causal Inference by Hernán and Robins.
+Elements of Causal Inference: Foundations and Learning Algorithms by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the Elements of Causal Inference isn&rsquo;t as widely applicable as Hastie et al.&rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren&rsquo;t yet scalable enough for practical use. This will probably change in the future.
+Mostly Harmless Econometrics by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl&rsquo;s work. I may get back to it one day.
+Causality: Models, Reasoning, and Inference by Judea Pearl: I haven&rsquo;t read it, and I doubt it&rsquo;d be very practical given the opinions of people who have. But maybe I&rsquo;ll get to it one day.
+The Book of Why: The New Science of Cause and Effect by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl&rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution.
+Causal Machine Learning by Robert Osazuwa Ness: Still a draft as of September 2022, but it looks promising.
+
+Articles:"><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/causal-inference-resources/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/causal-inference-resources/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Causal inference resources"><meta property="og:description" content="This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.
 Books:
-Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&rsquo;ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice."><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/causal-inference-resources/"><meta property="article:section" content><meta property="article:modified_time" content="2023-07-06T16:01:57+10:00"><meta name=twitter:card content="summary"><meta name=twitter:title content="Causal inference resources"><meta name=twitter:description content="This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.
+
+Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&rsquo;ve read. Highly recommended.
+Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.
+Why: A Guide to Finding and Using Causes by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in Why you should stop worrying about deep learning and deepen your understanding of causality instead.
+Causality, Probability, and Time by Samantha Kleinberg: More technical than Kleinberg&rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I&rsquo;m still unsure about the practicality of those methods on real data. See my post Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions for more details.
+Causal Inference in Statistics: A Primer by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl&rsquo;s work. I didn&rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of Causal Inference by Hernán and Robins.
+Elements of Causal Inference: Foundations and Learning Algorithms by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the Elements of Causal Inference isn&rsquo;t as widely applicable as Hastie et al.&rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren&rsquo;t yet scalable enough for practical use. This will probably change in the future.
+Mostly Harmless Econometrics by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl&rsquo;s work. I may get back to it one day.
+Causality: Models, Reasoning, and Inference by Judea Pearl: I haven&rsquo;t read it, and I doubt it&rsquo;d be very practical given the opinions of people who have. But maybe I&rsquo;ll get to it one day.
+The Book of Why: The New Science of Cause and Effect by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl&rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution.
+Causal Machine Learning by Robert Osazuwa Ness: Still a draft as of September 2022, but it looks promising.
+
+Articles:"><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/causal-inference-resources/"><meta property="article:section" content><meta property="article:modified_time" content="2023-07-06T16:01:57+10:00"><meta name=twitter:card content="summary"><meta name=twitter:title content="Causal inference resources"><meta name=twitter:description content="This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.
 Books:
-Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&rsquo;ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice."><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Causal inference resources","item":"https://yanirseroussi.com/causal-inference-resources/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Causal inference resources","name":"Causal inference resources","description":"This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.\nBooks:\nCausal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I\u0026rsquo;ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors\u0026rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.","keywords":[],"articleBody":"This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.\nBooks:\nCausal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I’ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors’ decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice. Why: A Guide to Finding and Using Causes by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in Why you should stop worrying about deep learning and deepen your understanding of causality instead. Causality, Probability, and Time by Samantha Kleinberg: More technical than Kleinberg’s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I’m still unsure about the practicality of those methods on real data. See my post Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions for more details. Causal Inference in Statistics: A Primer by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl’s work. I didn’t find it that practical, but I believe it helped me understand the graphical modelling parts of Causal Inference by Hernán and Robins. Elements of Causal Inference: Foundations and Learning Algorithms by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the Elements of Causal Inference isn’t as widely applicable as Hastie et al.’s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren’t yet scalable enough for practical use. This will probably change in the future. Mostly Harmless Econometrics by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl’s work. I may get back to it one day. Causality: Models, Reasoning, and Inference by Judea Pearl: I haven’t read it, and I doubt it’d be very practical given the opinions of people who have. But maybe I’ll get to it one day. The Book of Why: The New Science of Cause and Effect by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl’s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution. Causal Machine Learning by Robert Osazuwa Ness: Still a draft as of September 2022, but it looks promising. Articles:\nDoes water kill? A call for less casual causal inferences by Miguel Hernán: A great demonstration of why talking about causality requires well-defined interventions. The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data by Miguel Hernán: A high-level summary of causal inference and the need to be explicit about the causal goals of scientific studies. The Environment and Disease: Association or Causation? by Austin Bradford Hill: A classic discussion of the Bradford Hill criteria for causation. Highly recommended, as this 1965 paper also foresaw the problems with the statistical significance cult. Causal inference in statistics: An overview by Judea Pearl: A summary of Pearl’s work, which may be somewhat dated at this point (it’s from 2009). It’s still worth reading if you’re not ready to commit to reading his books. Simpson’s Paradox: An Anatomy by Judea Pearl: An explanation of Simpson’s paradox and its relationship to causal inference. This paper is worth reading, though I found that further reading is required to better understand why causal modelling “solves” the paradox. Guidelines for estimating causal effects in pragmatic randomized trials by Eleanor J. Murray, Sonja A. Swanson, and Miguel A. Hernán. Once you get over the terminology gap, you see how these guidelines apply to any field where experiments don’t always go as planned. Courses:\nCausal Diagrams: Draw Your Assumptions Before Your Conclusions. A high-level introduction to causal diagrams by Miguel Hernán. Highly recommended for those who want to get a conceptual overview of how causal diagrams work and why they’re useful. A/B Testing by Google: Online Experiment Design and Analysis. Experimentation is key to causal inference, with the online world offering an accessible ground for running experiments. This short course is worth doing if you’re involved in online experiments in any way. ","wordCount":"759","inLanguage":"en","datePublished":"0001-01-01T00:00:00Z","dateModified":"2023-07-06T16:01:57+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/causal-inference-resources/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Causal inference resources</h1><div class=post-meta></div></header><div class=post-content><p>This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on <a href=/tags/causal-inference/>causal inference</a> and <a href=/tags/a/b-testing/>A/B testing</a>.</p><p><strong>Books</strong>:</p><ul><li><a href=https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/ target=_blank rel=noopener><em>Causal Inference: What if</em></a> by Miguel Hernán and Jamie Robins: <a href=https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/>The most practical book I&rsquo;ve read</a>. Highly recommended.</li><li><a href=https://experimentguide.com/ target=_blank rel=noopener><em>Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing</em></a> by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.</li><li><a href=http://www.skleinberg.org/why/ target=_blank rel=noopener><em>Why: A Guide to Finding and Using Causes</em></a> by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in <a href=https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/><em>Why you should stop worrying about deep learning and deepen your understanding of causality instead</em></a>.</li><li><a href=http://www.skleinberg.org/causality_book/index.html target=_blank rel=noopener><em>Causality, Probability, and Time</em></a> by Samantha Kleinberg: More technical than Kleinberg&rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I&rsquo;m still unsure about the practicality of those methods on real data. See my post <a href=https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/><em>Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions</em></a> for more details.</li><li><a href=http://bayes.cs.ucla.edu/PRIMER/ target=_blank rel=noopener><em>Causal Inference in Statistics: A Primer</em></a> by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl&rsquo;s work. I didn&rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of <em>Causal Inference</em> by Hernán and Robins.</li><li><a href=https://mitpress.mit.edu/books/elements-causal-inference target=_blank rel=noopener><em>Elements of Causal Inference: Foundations and Learning Algorithms</em></a> by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book <a href=https://web.stanford.edu/~hastie/ElemStatLearn/ target=_blank rel=noopener><em>The Elements of Statistical Learning</em></a> by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the <em>Elements of Causal Inference</em> isn&rsquo;t as widely applicable as Hastie et al.&rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren&rsquo;t yet scalable enough for practical use. This will probably change in the future.</li><li><a href=http://www.mostlyharmlesseconometrics.com/ target=_blank rel=noopener><em>Mostly Harmless Econometrics</em></a> by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl&rsquo;s work. I may get back to it one day.</li><li><a href=http://bayes.cs.ucla.edu/BOOK-2K/index.html target=_blank rel=noopener><em>Causality: Models, Reasoning, and Inference</em></a> by Judea Pearl: I haven&rsquo;t read it, and I doubt it&rsquo;d be very practical given <a href=https://www.reddit.com/r/statistics/comments/8lu1sr/causal_inference_book_recommendations/ target=_blank rel=noopener>the opinions of people who have</a>. But maybe I&rsquo;ll get to it one day.</li><li><a href=http://bayes.cs.ucla.edu/WHY/ target=_blank rel=noopener><em>The Book of Why: The New Science of Cause and Effect</em></a> by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl&rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution.</li><li><a href=https://www.manning.com/books/causal-machine-learning target=_blank rel=noopener><em>Causal Machine Learning</em></a> by Robert Osazuwa Ness: Still a draft as of September 2022, but <a href=https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/>it looks promising</a>.</li></ul><p><strong>Articles</strong>:</p><ul><li><a href=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5207342/ target=_blank rel=noopener><em>Does water kill? A call for less casual causal inferences</em></a> by Miguel Hernán: A great demonstration of why talking about causality requires well-defined interventions.</li><li><a href=https://ajph.aphapublications.org/doi/10.2105/AJPH.2018.304337 target=_blank rel=noopener><em>The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data</em></a> by Miguel Hernán: A high-level summary of causal inference and the need to be explicit about the causal goals of scientific studies.</li><li><a href=https://www.edwardtufte.com/tufte/hill target=_blank rel=noopener><em>The Environment and Disease: Association or Causation?</em></a> by Austin Bradford Hill: A classic discussion of <a href=https://en.wikipedia.org/wiki/Bradford_Hill_criteria target=_blank rel=noopener>the Bradford Hill criteria for causation</a>. Highly recommended, as this 1965 paper also foresaw the problems with the statistical significance cult.</li><li><a href=http://ftp.cs.ucla.edu/pub/stat_ser/r350.pdf target=_blank rel=noopener><em>Causal inference in statistics: An overview</em></a> by Judea Pearl: A summary of Pearl&rsquo;s work, which may be somewhat dated at this point (it&rsquo;s from 2009). It&rsquo;s still worth reading if you&rsquo;re not ready to commit to reading his books.</li><li><a href=http://bayes.cs.ucla.edu/R264.pdf target=_blank rel=noopener><em>Simpson&rsquo;s Paradox: An Anatomy</em></a> by Judea Pearl: An explanation of <a href=https://en.wikipedia.org/wiki/Simpson%27s_paradox target=_blank rel=noopener>Simpson&rsquo;s paradox</a> and its relationship to causal inference. This paper is worth reading, though I found that further reading is required to better understand why causal modelling &ldquo;solves&rdquo; the paradox.</li><li><a href=https://arxiv.org/abs/1911.06030 target=_blank rel=noopener><em>Guidelines for estimating causal effects in pragmatic randomized trials</em></a> by Eleanor J. Murray, Sonja A. Swanson, and Miguel A. Hernán. Once you get over the terminology gap, you see how these guidelines apply to any field where experiments don&rsquo;t always go as planned.</li></ul><p><strong>Courses</strong>:</p><ul><li><a href=https://www.edx.org/course/causal-diagrams-draw-your-assumptions-before-your target=_blank rel=noopener><em>Causal Diagrams: Draw Your Assumptions Before Your Conclusions</em></a>. A high-level introduction to causal diagrams by Miguel Hernán. Highly recommended for those who want to get a conceptual overview of how causal diagrams work and why they&rsquo;re useful.</li><li><a href=https://www.udacity.com/course/ab-testing--ud257 target=_blank rel=noopener><em>A/B Testing by Google: Online Experiment Design and Analysis</em></a>. Experimentation is key to causal inference, with the online world offering an accessible ground for running experiments. This short course is worth doing if you&rsquo;re involved in online experiments in any way.</li></ul></div><footer class=post-footer><ul class=post-tags></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on x" href="https://x.com/intent/tweet/?text=Causal%20inference%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f&amp;hashtags="><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f&amp;title=Causal%20inference%20resources&amp;summary=Causal%20inference%20resources&amp;source=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f&title=Causal%20inference%20resources"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on whatsapp" href="https://api.whatsapp.com/send?text=Causal%20inference%20resources%20-%20https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on telegram" href="https://telegram.me/share/url?text=Causal%20inference%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on ycombinator" href="https://news.ycombinator.com/submitlink?t=Causal%20inference%20resources&u=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
+
+Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&rsquo;ve read. Highly recommended.
+Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.
+Why: A Guide to Finding and Using Causes by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in Why you should stop worrying about deep learning and deepen your understanding of causality instead.
+Causality, Probability, and Time by Samantha Kleinberg: More technical than Kleinberg&rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I&rsquo;m still unsure about the practicality of those methods on real data. See my post Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions for more details.
+Causal Inference in Statistics: A Primer by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl&rsquo;s work. I didn&rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of Causal Inference by Hernán and Robins.
+Elements of Causal Inference: Foundations and Learning Algorithms by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the Elements of Causal Inference isn&rsquo;t as widely applicable as Hastie et al.&rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren&rsquo;t yet scalable enough for practical use. This will probably change in the future.
+Mostly Harmless Econometrics by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl&rsquo;s work. I may get back to it one day.
+Causality: Models, Reasoning, and Inference by Judea Pearl: I haven&rsquo;t read it, and I doubt it&rsquo;d be very practical given the opinions of people who have. But maybe I&rsquo;ll get to it one day.
+The Book of Why: The New Science of Cause and Effect by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl&rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution.
+Causal Machine Learning by Robert Osazuwa Ness: Still a draft as of September 2022, but it looks promising.
+
+Articles:"><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Causal inference resources","item":"https://yanirseroussi.com/causal-inference-resources/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Causal inference resources","name":"Causal inference resources","description":"This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.\nBooks:\nCausal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I\u0026rsquo;ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors\u0026rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice. Why: A Guide to Finding and Using Causes by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in Why you should stop worrying about deep learning and deepen your understanding of causality instead. Causality, Probability, and Time by Samantha Kleinberg: More technical than Kleinberg\u0026rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I\u0026rsquo;m still unsure about the practicality of those methods on real data. See my post Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions for more details. Causal Inference in Statistics: A Primer by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl\u0026rsquo;s work. I didn\u0026rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of Causal Inference by Hernán and Robins. Elements of Causal Inference: Foundations and Learning Algorithms by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the Elements of Causal Inference isn\u0026rsquo;t as widely applicable as Hastie et al.\u0026rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren\u0026rsquo;t yet scalable enough for practical use. This will probably change in the future. Mostly Harmless Econometrics by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl\u0026rsquo;s work. I may get back to it one day. Causality: Models, Reasoning, and Inference by Judea Pearl: I haven\u0026rsquo;t read it, and I doubt it\u0026rsquo;d be very practical given the opinions of people who have. But maybe I\u0026rsquo;ll get to it one day. The Book of Why: The New Science of Cause and Effect by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl\u0026rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution. Causal Machine Learning by Robert Osazuwa Ness: Still a draft as of September 2022, but it looks promising. Articles:\n","keywords":[],"articleBody":"This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.\nBooks:\nCausal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I’ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors’ decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice. Why: A Guide to Finding and Using Causes by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in Why you should stop worrying about deep learning and deepen your understanding of causality instead. Causality, Probability, and Time by Samantha Kleinberg: More technical than Kleinberg’s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I’m still unsure about the practicality of those methods on real data. See my post Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions for more details. Causal Inference in Statistics: A Primer by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl’s work. I didn’t find it that practical, but I believe it helped me understand the graphical modelling parts of Causal Inference by Hernán and Robins. Elements of Causal Inference: Foundations and Learning Algorithms by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the Elements of Causal Inference isn’t as widely applicable as Hastie et al.’s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren’t yet scalable enough for practical use. This will probably change in the future. Mostly Harmless Econometrics by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl’s work. I may get back to it one day. Causality: Models, Reasoning, and Inference by Judea Pearl: I haven’t read it, and I doubt it’d be very practical given the opinions of people who have. But maybe I’ll get to it one day. The Book of Why: The New Science of Cause and Effect by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl’s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution. Causal Machine Learning by Robert Osazuwa Ness: Still a draft as of September 2022, but it looks promising. Articles:\nDoes water kill? A call for less casual causal inferences by Miguel Hernán: A great demonstration of why talking about causality requires well-defined interventions. The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data by Miguel Hernán: A high-level summary of causal inference and the need to be explicit about the causal goals of scientific studies. The Environment and Disease: Association or Causation? by Austin Bradford Hill: A classic discussion of the Bradford Hill criteria for causation. Highly recommended, as this 1965 paper also foresaw the problems with the statistical significance cult. Causal inference in statistics: An overview by Judea Pearl: A summary of Pearl’s work, which may be somewhat dated at this point (it’s from 2009). It’s still worth reading if you’re not ready to commit to reading his books. Simpson’s Paradox: An Anatomy by Judea Pearl: An explanation of Simpson’s paradox and its relationship to causal inference. This paper is worth reading, though I found that further reading is required to better understand why causal modelling “solves” the paradox. Guidelines for estimating causal effects in pragmatic randomized trials by Eleanor J. Murray, Sonja A. Swanson, and Miguel A. Hernán. Once you get over the terminology gap, you see how these guidelines apply to any field where experiments don’t always go as planned. Courses:\nCausal Diagrams: Draw Your Assumptions Before Your Conclusions. A high-level introduction to causal diagrams by Miguel Hernán. Highly recommended for those who want to get a conceptual overview of how causal diagrams work and why they’re useful. A/B Testing by Google: Online Experiment Design and Analysis. Experimentation is key to causal inference, with the online world offering an accessible ground for running experiments. This short course is worth doing if you’re involved in online experiments in any way. ","wordCount":"759","inLanguage":"en","datePublished":"0001-01-01T00:00:00Z","dateModified":"2023-07-06T16:01:57+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/causal-inference-resources/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Causal inference resources</h1><div class=post-meta></div></header><div class=post-content><p>This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on <a href=/tags/causal-inference/>causal inference</a> and <a href=/tags/a/b-testing/>A/B testing</a>.</p><p><strong>Books</strong>:</p><ul><li><a href=https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/ target=_blank rel=noopener><em>Causal Inference: What if</em></a> by Miguel Hernán and Jamie Robins: <a href=https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/>The most practical book I&rsquo;ve read</a>. Highly recommended.</li><li><a href=https://experimentguide.com/ target=_blank rel=noopener><em>Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing</em></a> by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.</li><li><a href=http://www.skleinberg.org/why/ target=_blank rel=noopener><em>Why: A Guide to Finding and Using Causes</em></a> by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in <a href=https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/><em>Why you should stop worrying about deep learning and deepen your understanding of causality instead</em></a>.</li><li><a href=http://www.skleinberg.org/causality_book/index.html target=_blank rel=noopener><em>Causality, Probability, and Time</em></a> by Samantha Kleinberg: More technical than Kleinberg&rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I&rsquo;m still unsure about the practicality of those methods on real data. See my post <a href=https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/><em>Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions</em></a> for more details.</li><li><a href=http://bayes.cs.ucla.edu/PRIMER/ target=_blank rel=noopener><em>Causal Inference in Statistics: A Primer</em></a> by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl&rsquo;s work. I didn&rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of <em>Causal Inference</em> by Hernán and Robins.</li><li><a href=https://mitpress.mit.edu/books/elements-causal-inference target=_blank rel=noopener><em>Elements of Causal Inference: Foundations and Learning Algorithms</em></a> by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book <a href=https://web.stanford.edu/~hastie/ElemStatLearn/ target=_blank rel=noopener><em>The Elements of Statistical Learning</em></a> by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the <em>Elements of Causal Inference</em> isn&rsquo;t as widely applicable as Hastie et al.&rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren&rsquo;t yet scalable enough for practical use. This will probably change in the future.</li><li><a href=http://www.mostlyharmlesseconometrics.com/ target=_blank rel=noopener><em>Mostly Harmless Econometrics</em></a> by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl&rsquo;s work. I may get back to it one day.</li><li><a href=http://bayes.cs.ucla.edu/BOOK-2K/index.html target=_blank rel=noopener><em>Causality: Models, Reasoning, and Inference</em></a> by Judea Pearl: I haven&rsquo;t read it, and I doubt it&rsquo;d be very practical given <a href=https://www.reddit.com/r/statistics/comments/8lu1sr/causal_inference_book_recommendations/ target=_blank rel=noopener>the opinions of people who have</a>. But maybe I&rsquo;ll get to it one day.</li><li><a href=http://bayes.cs.ucla.edu/WHY/ target=_blank rel=noopener><em>The Book of Why: The New Science of Cause and Effect</em></a> by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl&rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution.</li><li><a href=https://www.manning.com/books/causal-machine-learning target=_blank rel=noopener><em>Causal Machine Learning</em></a> by Robert Osazuwa Ness: Still a draft as of September 2022, but <a href=https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/>it looks promising</a>.</li></ul><p><strong>Articles</strong>:</p><ul><li><a href=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5207342/ target=_blank rel=noopener><em>Does water kill? A call for less casual causal inferences</em></a> by Miguel Hernán: A great demonstration of why talking about causality requires well-defined interventions.</li><li><a href=https://ajph.aphapublications.org/doi/10.2105/AJPH.2018.304337 target=_blank rel=noopener><em>The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data</em></a> by Miguel Hernán: A high-level summary of causal inference and the need to be explicit about the causal goals of scientific studies.</li><li><a href=https://www.edwardtufte.com/tufte/hill target=_blank rel=noopener><em>The Environment and Disease: Association or Causation?</em></a> by Austin Bradford Hill: A classic discussion of <a href=https://en.wikipedia.org/wiki/Bradford_Hill_criteria target=_blank rel=noopener>the Bradford Hill criteria for causation</a>. Highly recommended, as this 1965 paper also foresaw the problems with the statistical significance cult.</li><li><a href=http://ftp.cs.ucla.edu/pub/stat_ser/r350.pdf target=_blank rel=noopener><em>Causal inference in statistics: An overview</em></a> by Judea Pearl: A summary of Pearl&rsquo;s work, which may be somewhat dated at this point (it&rsquo;s from 2009). It&rsquo;s still worth reading if you&rsquo;re not ready to commit to reading his books.</li><li><a href=http://bayes.cs.ucla.edu/R264.pdf target=_blank rel=noopener><em>Simpson&rsquo;s Paradox: An Anatomy</em></a> by Judea Pearl: An explanation of <a href=https://en.wikipedia.org/wiki/Simpson%27s_paradox target=_blank rel=noopener>Simpson&rsquo;s paradox</a> and its relationship to causal inference. This paper is worth reading, though I found that further reading is required to better understand why causal modelling &ldquo;solves&rdquo; the paradox.</li><li><a href=https://arxiv.org/abs/1911.06030 target=_blank rel=noopener><em>Guidelines for estimating causal effects in pragmatic randomized trials</em></a> by Eleanor J. Murray, Sonja A. Swanson, and Miguel A. Hernán. Once you get over the terminology gap, you see how these guidelines apply to any field where experiments don&rsquo;t always go as planned.</li></ul><p><strong>Courses</strong>:</p><ul><li><a href=https://www.edx.org/course/causal-diagrams-draw-your-assumptions-before-your target=_blank rel=noopener><em>Causal Diagrams: Draw Your Assumptions Before Your Conclusions</em></a>. A high-level introduction to causal diagrams by Miguel Hernán. Highly recommended for those who want to get a conceptual overview of how causal diagrams work and why they&rsquo;re useful.</li><li><a href=https://www.udacity.com/course/ab-testing--ud257 target=_blank rel=noopener><em>A/B Testing by Google: Online Experiment Design and Analysis</em></a>. Experimentation is key to causal inference, with the online world offering an accessible ground for running experiments. This short course is worth doing if you&rsquo;re involved in online experiments in any way.</li></ul></div><footer class=post-footer><ul class=post-tags></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on x" href="https://x.com/intent/tweet/?text=Causal%20inference%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f&amp;hashtags="><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f&amp;title=Causal%20inference%20resources&amp;summary=Causal%20inference%20resources&amp;source=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f&title=Causal%20inference%20resources"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on whatsapp" href="https://api.whatsapp.com/send?text=Causal%20inference%20resources%20-%20https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on telegram" href="https://telegram.me/share/url?text=Causal%20inference%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Causal inference resources on ycombinator" href="https://news.ycombinator.com/submitlink?t=Causal%20inference%20resources&u=https%3a%2f%2fyanirseroussi.com%2fcausal-inference-resources%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
 </a><script>const mailingListButton=document.getElementById("mailing-list-link");window.onscroll=function(){document.body.scrollTop>800||document.documentElement.scrollTop>800?(mailingListButton.style.visibility="visible",mailingListButton.style.opacity="1"):(mailingListButton.style.visibility="hidden",mailingListButton.style.opacity="0")}</script><div class=mailing-list-container><script src=https://f.convertkit.com/ckjs/ck.5.js></script><form class="mailing-list seva-form formkit-form" action=https://app.convertkit.com/forms/6549537/subscriptions method=post data-sv-form=6549537 data-uid=9157759fce data-format=inline data-version=5 data-options='{"settings":{"after_subscribe":{"action":"message","redirect_url":"","success_message":"Success! Now check your email to confirm your subscription."},"recaptcha":{"enabled":false},"return_visitor":{"action":"show","custom_content":""}},"version":"5"}'><div data-style=clean><ul class="formkit-alert formkit-alert-error" data-element=errors data-group=alert></ul><div data-element=fields data-stacked=false><label for=mailing-list-email>Get weekly posts in your mailbox</label>
 <input id=mailing-list-email name=email_address aria-label="Email address" placeholder="Email address" required type=email>
 <button data-element=submit>Subscribe</button></div></div></form><div class=footer>Join hundreds of subscribers. No spam or AI-generated slop. Unsubscribe any time.</div></div><section class=comment-section><p class="post-content contact-cta">Public comments are closed, but I love hearing from readers. Feel free to
diff --git a/deep-learning-resources/index.html b/deep-learning-resources/index.html
index 88314c516..13c087f51 100644
--- a/deep-learning-resources/index.html
+++ b/deep-learning-resources/index.html
@@ -1,8 +1,77 @@
 <!doctype html><html lang=en dir=auto><head><meta charset=utf-8><meta http-equiv=X-UA-Compatible content="IE=edge"><meta name=viewport content="width=device-width,initial-scale=1,shrink-to-fit=no"><meta name=robots content="index, follow"><title>Deep learning resources | Yanir Seroussi | Data & AI for Startup Impact</title>
 <meta name=keywords content><meta name=description content="This page summarises the deep learning resources I&rsquo;ve consulted in my album cover classification project.
-Tutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net&rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne&rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&rsquo;re looking for lasagne4newbs: Lasagne&rsquo;s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al."><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/deep-learning-resources/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/deep-learning-resources/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Deep learning resources"><meta property="og:description" content="This page summarises the deep learning resources I&rsquo;ve consulted in my album cover classification project.
-Tutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net&rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne&rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&rsquo;re looking for lasagne4newbs: Lasagne&rsquo;s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al."><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/deep-learning-resources/"><meta property="article:section" content><meta property="article:published_time" content="2015-07-06T00:38:44+00:00"><meta property="article:modified_time" content="2021-11-09T15:38:25+10:00"><meta name=twitter:card content="summary"><meta name=twitter:title content="Deep learning resources"><meta name=twitter:description content="This page summarises the deep learning resources I&rsquo;ve consulted in my album cover classification project.
-Tutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net&rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne&rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&rsquo;re looking for lasagne4newbs: Lasagne&rsquo;s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al."><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Deep learning resources","item":"https://yanirseroussi.com/deep-learning-resources/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Deep learning resources","name":"Deep learning resources","description":"This page summarises the deep learning resources I\u0026rsquo;ve consulted in my album cover classification project.\nTutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net\u0026rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne\u0026rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you\u0026rsquo;re looking for lasagne4newbs: Lasagne\u0026rsquo;s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al.","keywords":[],"articleBody":"This page summarises the deep learning resources I’ve consulted in my album cover classification project.\nTutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net’s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne’s documentation and tutorials: still a bit lacking, but good when you know what you’re looking for lasagne4newbs: Lasagne’s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011): the work behind Hyperopt – pretty useful stuff, not only for deep learning Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014): interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014): 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them Going deeper with convolutions (Szegedy et al., 2014): the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012): the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible On the importance of initialization and momentum in deep learning (Sutskever et al., 2013): applying Nesterov momentum to deep learning – good read, simple concept, interesting results Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012): very compelling reasoning and experiments showing that random search outperforms grid search in many cases Recognizing Image Style (Karayev et al., 2014): identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014): VGGNet paper – interesting experiments and architectures – deep and homogeneous Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013): interesting work on visualisation, but I’ll need to apply it to understand it better ","wordCount":"467","inLanguage":"en","datePublished":"2015-07-06T00:38:44Z","dateModified":"2021-11-09T15:38:25+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/deep-learning-resources/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Deep learning resources</h1><div class=post-meta><span title='2015-07-06 00:38:44 +0000 UTC'>July 6, 2015</span></div></header><div class=post-content><p>This page summarises the deep learning resources I&rsquo;ve consulted in <a href=https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/>my album cover classification project</a>.</p><h3 id=tutorials-and-blog-posts>Tutorials and blog posts<a hidden class=anchor aria-hidden=true href=#tutorials-and-blog-posts>#</a></h3><ul><li><a href=http://cs231n.github.io/ target=_blank rel=noopener>Convolutional Neural Networks for Visual Recognition Stanford course notes</a>: an excellent resource, very up-to-date and useful, despite still being a work in progress</li><li><a href=http://deeplearning.net/tutorial/ target=_blank rel=noopener>DeepLearning.net&rsquo;s Theano-based tutorials</a>: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage</li><li><a href=http://lasagne.readthedocs.org/en/latest/ target=_blank rel=noopener>Lasagne&rsquo;s documentation and tutorials</a>: still a bit lacking, but good when you know what you&rsquo;re looking for</li><li><a href=https://github.com/enlitic/lasagne4newbs target=_blank rel=noopener>lasagne4newbs</a>: Lasagne&rsquo;s convnet example with richer comments</li><li><a href=http://danielnouri.org/notes/2014/12/17/using-convolutional-neural-nets-to-detect-facial-keypoints-tutorial/ target=_blank rel=noopener>Using convolutional neural nets to detect facial keypoints tutorial</a>: the resource that made me want to use Lasagne</li><li><a href=http://benanne.github.io/2015/03/17/plankton.html target=_blank rel=noopener>Classifying plankton with deep neural networks</a>: an epic post, which I found while looking for Lasagne examples</li><li><a href=https://en.wikipedia.org/wiki/Main_Page target=_blank rel=noopener>Various Wikipedia pages</a>: a bit disappointing – the above resources are much better</li></ul><h3 id=papers>Papers<a hidden class=anchor aria-hidden=true href=#papers>#</a></h3><ul><li><a href=http://arxiv.org/abs/1412.6980 target=_blank rel=noopener>Adam: a method for stochastic optimization (Kingma and Ba, 2015)</a>: an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice</li><li><a href=http://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization target=_blank rel=noopener>Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011)</a>: the work behind <a href=https://github.com/hyperopt/hyperopt target=_blank rel=noopener>Hyperopt</a> – pretty useful stuff, not only for deep learning</li><li><a href=http://arxiv.org/abs/1412.1710 target=_blank rel=noopener>Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014)</a>: interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful</li><li><a href=http://arxiv.org/abs/1404.7828 target=_blank rel=noopener>Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014)</a>: 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them</li><li><a href=http://arxiv.org/abs/1409.4842 target=_blank rel=noopener>Going deeper with convolutions (Szegedy et al., 2014)</a>: the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity</li><li><a href=http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks target=_blank rel=noopener>ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012)</a>: the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible</li><li><a href=http://www.cs.toronto.edu/~gdahl/papers/momentumNesterovDeepLearning.pdf target=_blank rel=noopener>On the importance of initialization and momentum in deep learning (Sutskever et al., 2013)</a>: applying Nesterov momentum to deep learning – good read, simple concept, interesting results</li><li><a href=http://jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf target=_blank rel=noopener>Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012)</a>: very compelling reasoning and experiments showing that random search outperforms grid search in many cases</li><li><a href=http://sergeykarayev.com/files/1311.3715v3.pdf target=_blank rel=noopener>Recognizing Image Style (Karayev et al., 2014)</a>: identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases</li><li><a href=http://arxiv.org/abs/1409.1556 target=_blank rel=noopener>Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014)</a>: VGGNet paper – interesting experiments and architectures – deep and homogeneous</li><li><a href=http://arxiv.org/abs/1311.2901 target=_blank rel=noopener>Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013)</a>: interesting work on visualisation, but I&rsquo;ll need to apply it to understand it better</li></ul></div><footer class=post-footer><ul class=post-tags></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on x" href="https://x.com/intent/tweet/?text=Deep%20learning%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f&amp;hashtags="><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f&amp;title=Deep%20learning%20resources&amp;summary=Deep%20learning%20resources&amp;source=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f&title=Deep%20learning%20resources"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on whatsapp" href="https://api.whatsapp.com/send?text=Deep%20learning%20resources%20-%20https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on telegram" href="https://telegram.me/share/url?text=Deep%20learning%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on ycombinator" href="https://news.ycombinator.com/submitlink?t=Deep%20learning%20resources&u=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
+Tutorials and blog posts
+
+Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress
+DeepLearning.net&rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage
+Lasagne&rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&rsquo;re looking for
+lasagne4newbs: Lasagne&rsquo;s convnet example with richer comments
+Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne
+Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples
+Various Wikipedia pages: a bit disappointing – the above resources are much better
+
+Papers
+
+Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice
+Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011): the work behind Hyperopt – pretty useful stuff, not only for deep learning
+Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014): interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful
+Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014): 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them
+Going deeper with convolutions (Szegedy et al., 2014): the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity
+ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012): the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible
+On the importance of initialization and momentum in deep learning (Sutskever et al., 2013): applying Nesterov momentum to deep learning – good read, simple concept, interesting results
+Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012): very compelling reasoning and experiments showing that random search outperforms grid search in many cases
+Recognizing Image Style (Karayev et al., 2014): identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases
+Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014): VGGNet paper – interesting experiments and architectures – deep and homogeneous
+Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013): interesting work on visualisation, but I&rsquo;ll need to apply it to understand it better
+"><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/deep-learning-resources/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate hreflang=en href=https://yanirseroussi.com/deep-learning-resources/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Deep learning resources"><meta property="og:description" content="This page summarises the deep learning resources I&rsquo;ve consulted in my album cover classification project.
+Tutorials and blog posts
+
+Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress
+DeepLearning.net&rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage
+Lasagne&rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&rsquo;re looking for
+lasagne4newbs: Lasagne&rsquo;s convnet example with richer comments
+Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne
+Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples
+Various Wikipedia pages: a bit disappointing – the above resources are much better
+
+Papers
+
+Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice
+Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011): the work behind Hyperopt – pretty useful stuff, not only for deep learning
+Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014): interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful
+Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014): 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them
+Going deeper with convolutions (Szegedy et al., 2014): the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity
+ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012): the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible
+On the importance of initialization and momentum in deep learning (Sutskever et al., 2013): applying Nesterov momentum to deep learning – good read, simple concept, interesting results
+Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012): very compelling reasoning and experiments showing that random search outperforms grid search in many cases
+Recognizing Image Style (Karayev et al., 2014): identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases
+Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014): VGGNet paper – interesting experiments and architectures – deep and homogeneous
+Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013): interesting work on visualisation, but I&rsquo;ll need to apply it to understand it better
+"><meta property="og:type" content="article"><meta property="og:url" content="https://yanirseroussi.com/deep-learning-resources/"><meta property="article:section" content><meta property="article:published_time" content="2015-07-06T00:38:44+00:00"><meta property="article:modified_time" content="2021-11-09T15:38:25+10:00"><meta name=twitter:card content="summary"><meta name=twitter:title content="Deep learning resources"><meta name=twitter:description content="This page summarises the deep learning resources I&rsquo;ve consulted in my album cover classification project.
+Tutorials and blog posts
+
+Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress
+DeepLearning.net&rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage
+Lasagne&rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&rsquo;re looking for
+lasagne4newbs: Lasagne&rsquo;s convnet example with richer comments
+Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne
+Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples
+Various Wikipedia pages: a bit disappointing – the above resources are much better
+
+Papers
+
+Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice
+Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011): the work behind Hyperopt – pretty useful stuff, not only for deep learning
+Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014): interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful
+Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014): 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them
+Going deeper with convolutions (Szegedy et al., 2014): the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity
+ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012): the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible
+On the importance of initialization and momentum in deep learning (Sutskever et al., 2013): applying Nesterov momentum to deep learning – good read, simple concept, interesting results
+Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012): very compelling reasoning and experiments showing that random search outperforms grid search in many cases
+Recognizing Image Style (Karayev et al., 2014): identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases
+Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014): VGGNet paper – interesting experiments and architectures – deep and homogeneous
+Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013): interesting work on visualisation, but I&rsquo;ll need to apply it to understand it better
+"><script type=application/ld+json>{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Deep learning resources","item":"https://yanirseroussi.com/deep-learning-resources/"}]}</script><script type=application/ld+json>{"@context":"https://schema.org","@type":"BlogPosting","headline":"Deep learning resources","name":"Deep learning resources","description":"This page summarises the deep learning resources I\u0026rsquo;ve consulted in my album cover classification project.\nTutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net\u0026rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne\u0026rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you\u0026rsquo;re looking for lasagne4newbs: Lasagne\u0026rsquo;s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011): the work behind Hyperopt – pretty useful stuff, not only for deep learning Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014): interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014): 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them Going deeper with convolutions (Szegedy et al., 2014): the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012): the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible On the importance of initialization and momentum in deep learning (Sutskever et al., 2013): applying Nesterov momentum to deep learning – good read, simple concept, interesting results Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012): very compelling reasoning and experiments showing that random search outperforms grid search in many cases Recognizing Image Style (Karayev et al., 2014): identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014): VGGNet paper – interesting experiments and architectures – deep and homogeneous Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013): interesting work on visualisation, but I\u0026rsquo;ll need to apply it to understand it better ","keywords":[],"articleBody":"This page summarises the deep learning resources I’ve consulted in my album cover classification project.\nTutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net’s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne’s documentation and tutorials: still a bit lacking, but good when you know what you’re looking for lasagne4newbs: Lasagne’s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011): the work behind Hyperopt – pretty useful stuff, not only for deep learning Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014): interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014): 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them Going deeper with convolutions (Szegedy et al., 2014): the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012): the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible On the importance of initialization and momentum in deep learning (Sutskever et al., 2013): applying Nesterov momentum to deep learning – good read, simple concept, interesting results Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012): very compelling reasoning and experiments showing that random search outperforms grid search in many cases Recognizing Image Style (Karayev et al., 2014): identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014): VGGNet paper – interesting experiments and architectures – deep and homogeneous Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013): interesting work on visualisation, but I’ll need to apply it to understand it better ","wordCount":"467","inLanguage":"en","datePublished":"2015-07-06T00:38:44Z","dateModified":"2021-11-09T15:38:25+10:00","author":{"@type":"Person","name":"Yanir Seroussi"},"mainEntityOfPage":{"@type":"WebPage","@id":"https://yanirseroussi.com/deep-learning-resources/"},"publisher":{"@type":"Organization","name":"Yanir Seroussi | Data \u0026 AI for Startup Impact","logo":{"@type":"ImageObject","url":"https://yanirseroussi.com/favicon.ico"}}}</script></head><body id=top><script>localStorage.getItem("pref-theme")==="dark"?document.body.classList.add("dark"):localStorage.getItem("pref-theme")==="light"?document.body.classList.remove("dark"):window.matchMedia("(prefers-color-scheme: dark)").matches&&document.body.classList.add("dark")</script><header class=header><nav class=nav><div class=logo><a href=https://yanirseroussi.com/ accesskey=h title="Yanir Seroussi | Data & AI for Startup Impact (Alt + H)">Yanir Seroussi | Data & AI for Startup Impact</a><div class=logo-switches><button id=theme-toggle accesskey=t title="(Alt + T)"><svg id="moon" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><path d="M21 12.79A9 9 0 1111.21 3 7 7 0 0021 12.79z"/></svg><svg id="sun" width="24" height="18" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round"><circle cx="12" cy="12" r="5"/><line x1="12" y1="1" x2="12" y2="3"/><line x1="12" y1="21" x2="12" y2="23"/><line x1="4.22" y1="4.22" x2="5.64" y2="5.64"/><line x1="18.36" y1="18.36" x2="19.78" y2="19.78"/><line x1="1" y1="12" x2="3" y2="12"/><line x1="21" y1="12" x2="23" y2="12"/><line x1="4.22" y1="19.78" x2="5.64" y2="18.36"/><line x1="18.36" y1="5.64" x2="19.78" y2="4.22"/></svg></button></div></div><button id=menu-trigger aria-haspopup=menu aria-label="Menu Button"><svg width="24" height="24" viewBox="0 0 24 24" fill="none" stroke="currentcolor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="feather feather-menu"><line x1="3" y1="12" x2="21" y2="12"/><line x1="3" y1="6" x2="21" y2="6"/><line x1="3" y1="18" x2="21" y2="18"/></svg></button><ul class="menu hidden"><li><a href=https://yanirseroussi.com/about/ title=About><span>About</span></a></li><li><a href=https://yanirseroussi.com/posts/ title=Writing><span>Writing</span></a></li><li><a href=https://yanirseroussi.com/talks/ title=Speaking><span>Speaking</span></a></li><li><a href=https://yanirseroussi.com/consult/ title=Consulting><span>Consulting</span></a></li></ul></nav></header><main class=main><article class=post-single><header class=post-header><h1 class="post-title entry-hint-parent">Deep learning resources</h1><div class=post-meta><span title='2015-07-06 00:38:44 +0000 UTC'>July 6, 2015</span></div></header><div class=post-content><p>This page summarises the deep learning resources I&rsquo;ve consulted in <a href=https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/>my album cover classification project</a>.</p><h3 id=tutorials-and-blog-posts>Tutorials and blog posts<a hidden class=anchor aria-hidden=true href=#tutorials-and-blog-posts>#</a></h3><ul><li><a href=http://cs231n.github.io/ target=_blank rel=noopener>Convolutional Neural Networks for Visual Recognition Stanford course notes</a>: an excellent resource, very up-to-date and useful, despite still being a work in progress</li><li><a href=http://deeplearning.net/tutorial/ target=_blank rel=noopener>DeepLearning.net&rsquo;s Theano-based tutorials</a>: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage</li><li><a href=http://lasagne.readthedocs.org/en/latest/ target=_blank rel=noopener>Lasagne&rsquo;s documentation and tutorials</a>: still a bit lacking, but good when you know what you&rsquo;re looking for</li><li><a href=https://github.com/enlitic/lasagne4newbs target=_blank rel=noopener>lasagne4newbs</a>: Lasagne&rsquo;s convnet example with richer comments</li><li><a href=http://danielnouri.org/notes/2014/12/17/using-convolutional-neural-nets-to-detect-facial-keypoints-tutorial/ target=_blank rel=noopener>Using convolutional neural nets to detect facial keypoints tutorial</a>: the resource that made me want to use Lasagne</li><li><a href=http://benanne.github.io/2015/03/17/plankton.html target=_blank rel=noopener>Classifying plankton with deep neural networks</a>: an epic post, which I found while looking for Lasagne examples</li><li><a href=https://en.wikipedia.org/wiki/Main_Page target=_blank rel=noopener>Various Wikipedia pages</a>: a bit disappointing – the above resources are much better</li></ul><h3 id=papers>Papers<a hidden class=anchor aria-hidden=true href=#papers>#</a></h3><ul><li><a href=http://arxiv.org/abs/1412.6980 target=_blank rel=noopener>Adam: a method for stochastic optimization (Kingma and Ba, 2015)</a>: an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice</li><li><a href=http://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization target=_blank rel=noopener>Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011)</a>: the work behind <a href=https://github.com/hyperopt/hyperopt target=_blank rel=noopener>Hyperopt</a> – pretty useful stuff, not only for deep learning</li><li><a href=http://arxiv.org/abs/1412.1710 target=_blank rel=noopener>Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014)</a>: interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful</li><li><a href=http://arxiv.org/abs/1404.7828 target=_blank rel=noopener>Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014)</a>: 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them</li><li><a href=http://arxiv.org/abs/1409.4842 target=_blank rel=noopener>Going deeper with convolutions (Szegedy et al., 2014)</a>: the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity</li><li><a href=http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks target=_blank rel=noopener>ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012)</a>: the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible</li><li><a href=http://www.cs.toronto.edu/~gdahl/papers/momentumNesterovDeepLearning.pdf target=_blank rel=noopener>On the importance of initialization and momentum in deep learning (Sutskever et al., 2013)</a>: applying Nesterov momentum to deep learning – good read, simple concept, interesting results</li><li><a href=http://jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf target=_blank rel=noopener>Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012)</a>: very compelling reasoning and experiments showing that random search outperforms grid search in many cases</li><li><a href=http://sergeykarayev.com/files/1311.3715v3.pdf target=_blank rel=noopener>Recognizing Image Style (Karayev et al., 2014)</a>: identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases</li><li><a href=http://arxiv.org/abs/1409.1556 target=_blank rel=noopener>Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014)</a>: VGGNet paper – interesting experiments and architectures – deep and homogeneous</li><li><a href=http://arxiv.org/abs/1311.2901 target=_blank rel=noopener>Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013)</a>: interesting work on visualisation, but I&rsquo;ll need to apply it to understand it better</li></ul></div><footer class=post-footer><ul class=post-tags></ul><ul class=share-buttons><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on x" href="https://x.com/intent/tweet/?text=Deep%20learning%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f&amp;hashtags="><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446C483.971.0 512 28.03 512 62.554zM269.951 190.75 182.567 75.216H56L207.216 272.95 63.9 436.783h61.366L235.9 310.383l96.667 126.4H456L298.367 228.367l134-153.151H371.033zM127.633 110h36.468l219.38 290.065H349.5z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on linkedin" href="https://www.linkedin.com/shareArticle?mini=true&amp;url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f&amp;title=Deep%20learning%20resources&amp;summary=Deep%20learning%20resources&amp;source=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM160.461 423.278V197.561h-75.04v225.717h75.04zm270.539.0V293.839c0-69.333-37.018-101.586-86.381-101.586-39.804.0-57.634 21.891-67.617 37.266v-31.958h-75.021c.995 21.181.0 225.717.0 225.717h75.02V297.222c0-6.748.486-13.492 2.474-18.315 5.414-13.475 17.767-27.434 38.494-27.434 27.135.0 38.007 20.707 38.007 51.037v120.768H431zM123.448 88.722C97.774 88.722 81 105.601 81 127.724c0 21.658 16.264 39.002 41.455 39.002h.484c26.165.0 42.452-17.344 42.452-39.002-.485-22.092-16.241-38.954-41.943-39.002z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on reddit" href="https://reddit.com/submit?url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f&title=Deep%20learning%20resources"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zM446 265.638c0-22.964-18.616-41.58-41.58-41.58-11.211.0-21.361 4.457-28.841 11.666-28.424-20.508-67.586-33.757-111.204-35.278l18.941-89.121 61.884 13.157c.756 15.734 13.642 28.29 29.56 28.29 16.407.0 29.706-13.299 29.706-29.701.0-16.403-13.299-29.702-29.706-29.702-11.666.0-21.657 6.792-26.515 16.578l-69.105-14.69c-1.922-.418-3.939-.042-5.585 1.036-1.658 1.073-2.811 2.761-3.224 4.686l-21.152 99.438c-44.258 1.228-84.046 14.494-112.837 35.232-7.468-7.164-17.589-11.591-28.757-11.591-22.965.0-41.585 18.616-41.585 41.58.0 16.896 10.095 31.41 24.568 37.918-.639 4.135-.99 8.328-.99 12.576.0 63.977 74.469 115.836 166.33 115.836s166.334-51.859 166.334-115.836c0-4.218-.347-8.387-.977-12.493 14.564-6.47 24.735-21.034 24.735-38.001zM326.526 373.831c-20.27 20.241-59.115 21.816-70.534 21.816-11.428.0-50.277-1.575-70.522-21.82-3.007-3.008-3.007-7.882.0-10.889 3.003-2.999 7.882-3.003 10.885.0 12.777 12.781 40.11 17.317 59.637 17.317 19.522.0 46.86-4.536 59.657-17.321 3.016-2.999 7.886-2.995 10.885.008 3.008 3.011 3.003 7.882-.008 10.889zm-5.23-48.781c-16.373.0-29.701-13.324-29.701-29.698.0-16.381 13.328-29.714 29.701-29.714 16.378.0 29.706 13.333 29.706 29.714.0 16.374-13.328 29.698-29.706 29.698zM160.91 295.348c0-16.381 13.328-29.71 29.714-29.71 16.369.0 29.689 13.329 29.689 29.71.0 16.373-13.32 29.693-29.689 29.693-16.386.0-29.714-13.32-29.714-29.693z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on facebook" href="https://facebook.com/sharer/sharer.php?u=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H342.978V319.085h66.6l12.672-82.621h-79.272v-53.617c0-22.603 11.073-44.636 46.58-44.636H425.6v-70.34s-32.71-5.582-63.982-5.582c-65.288.0-107.96 39.569-107.96 111.204v62.971h-72.573v82.621h72.573V512h-191.104c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on whatsapp" href="https://api.whatsapp.com/send?text=Deep%20learning%20resources%20-%20https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="0 0 512 512" height="30" width="30" fill="currentcolor"><path d="M449.446.0C483.971.0 512 28.03 512 62.554v386.892C512 483.97 483.97 512 449.446 512H62.554c-34.524.0-62.554-28.03-62.554-62.554V62.554c0-34.524 28.029-62.554 62.554-62.554h386.892zm-58.673 127.703c-33.842-33.881-78.847-52.548-126.798-52.568-98.799.0-179.21 80.405-179.249 179.234-.013 31.593 8.241 62.428 23.927 89.612l-25.429 92.884 95.021-24.925c26.181 14.28 55.659 21.807 85.658 21.816h.074c98.789.0 179.206-80.413 179.247-179.243.018-47.895-18.61-92.93-52.451-126.81zM263.976 403.485h-.06c-26.734-.01-52.954-7.193-75.828-20.767l-5.441-3.229-56.386 14.792 15.05-54.977-3.542-5.637c-14.913-23.72-22.791-51.136-22.779-79.287.033-82.142 66.867-148.971 149.046-148.971 39.793.014 77.199 15.531 105.329 43.692 28.128 28.16 43.609 65.592 43.594 105.4-.034 82.149-66.866 148.983-148.983 148.984zm81.721-111.581c-4.479-2.242-26.499-13.075-30.604-14.571-4.105-1.495-7.091-2.241-10.077 2.241-2.986 4.483-11.569 14.572-14.182 17.562-2.612 2.988-5.225 3.364-9.703 1.12-4.479-2.241-18.91-6.97-36.017-22.23C231.8 264.15 222.81 249.484 220.198 245s-.279-6.908 1.963-9.14c2.016-2.007 4.48-5.232 6.719-7.847 2.24-2.615 2.986-4.484 4.479-7.472 1.493-2.99.747-5.604-.374-7.846-1.119-2.241-10.077-24.288-13.809-33.256-3.635-8.733-7.327-7.55-10.077-7.688-2.609-.13-5.598-.158-8.583-.158-2.986.0-7.839 1.121-11.944 5.604-4.105 4.484-15.675 15.32-15.675 37.364.0 22.046 16.048 43.342 18.287 46.332 2.24 2.99 31.582 48.227 76.511 67.627 10.685 4.615 19.028 7.371 25.533 9.434 10.728 3.41 20.492 2.929 28.209 1.775 8.605-1.285 26.499-10.833 30.231-21.295 3.732-10.464 3.732-19.431 2.612-21.298-1.119-1.869-4.105-2.99-8.583-5.232z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on telegram" href="https://telegram.me/share/url?text=Deep%20learning%20resources&amp;url=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg viewBox="2 2 28 28" height="30" width="30" fill="currentcolor"><path d="M26.49 29.86H5.5a3.37 3.37.0 01-2.47-1 3.35 3.35.0 01-1-2.47V5.48A3.36 3.36.0 013 3 3.37 3.37.0 015.5 2h21A3.38 3.38.0 0129 3a3.36 3.36.0 011 2.46V26.37a3.35 3.35.0 01-1 2.47 3.38 3.38.0 01-2.51 1.02zm-5.38-6.71a.79.79.0 00.85-.66L24.73 9.24a.55.55.0 00-.18-.46.62.62.0 00-.41-.17q-.08.0-16.53 6.11a.59.59.0 00-.41.59.57.57.0 00.43.52l4 1.24 1.61 4.83a.62.62.0 00.63.43.56.56.0 00.4-.17L16.54 20l4.09 3A.9.9.0 0021.11 23.15zM13.8 20.71l-1.21-4q8.72-5.55 8.78-5.55c.15.0.23.0.23.16a.18.18.0 010 .06s-2.51 2.3-7.52 6.8z"/></svg></a></li><li><a target=_blank rel="noopener noreferrer" aria-label="share Deep learning resources on ycombinator" href="https://news.ycombinator.com/submitlink?t=Deep%20learning%20resources&u=https%3a%2f%2fyanirseroussi.com%2fdeep-learning-resources%2f"><svg width="30" height="30" viewBox="0 0 512 512" fill="currentcolor" xmlns:inkscape="http://www.inkscape.org/namespaces/inkscape"><path d="M449.446.0C483.971.0 512 28.03 512 62.554V449.446C512 483.97 483.97 512 449.446 512H62.554C28.03 512 0 483.97.0 449.446V62.554C0 28.03 28.029.0 62.554.0H449.446zM183.8767 87.9921h-62.034L230.6673 292.4508V424.0079h50.6655V292.4508L390.1575 87.9921H328.1233L256 238.2489z"/></svg></a></li></ul></footer><a href=/contact/#mailing-list-email target=_blank aria-label="subscribe to mailing list" class=mailing-list-link id=mailing-list-link>Subscribe
 </a><script>const mailingListButton=document.getElementById("mailing-list-link");window.onscroll=function(){document.body.scrollTop>800||document.documentElement.scrollTop>800?(mailingListButton.style.visibility="visible",mailingListButton.style.opacity="1"):(mailingListButton.style.visibility="hidden",mailingListButton.style.opacity="0")}</script><div class=mailing-list-container><script src=https://f.convertkit.com/ckjs/ck.5.js></script><form class="mailing-list seva-form formkit-form" action=https://app.convertkit.com/forms/6549537/subscriptions method=post data-sv-form=6549537 data-uid=9157759fce data-format=inline data-version=5 data-options='{"settings":{"after_subscribe":{"action":"message","redirect_url":"","success_message":"Success! Now check your email to confirm your subscription."},"recaptcha":{"enabled":false},"return_visitor":{"action":"show","custom_content":""}},"version":"5"}'><div data-style=clean><ul class="formkit-alert formkit-alert-error" data-element=errors data-group=alert></ul><div data-element=fields data-stacked=false><label for=mailing-list-email>Get weekly posts in your mailbox</label>
 <input id=mailing-list-email name=email_address aria-label="Email address" placeholder="Email address" required type=email>
 <button data-element=submit>Subscribe</button></div></div></form><div class=footer>Join hundreds of subscribers. No spam or AI-generated slop. Unsubscribe any time.</div></div><section class=comment-section><p class="post-content contact-cta">Public comments are closed, but I love hearing from readers. Feel free to
diff --git a/index.html b/index.html
index 1fbee2df6..de0db72d8 100644
--- a/index.html
+++ b/index.html
@@ -1,4 +1,4 @@
-<!doctype html><html lang=en dir=auto><head><meta name=generator content="Hugo 0.133.1"><meta charset=utf-8><meta http-equiv=X-UA-Compatible content="IE=edge"><meta name=viewport content="width=device-width,initial-scale=1,shrink-to-fit=no"><meta name=robots content="index, follow"><title>Yanir Seroussi | Data & AI for Startup Impact</title>
+<!doctype html><html lang=en dir=auto><head><meta name=generator content="Hugo 0.134.1"><meta charset=utf-8><meta http-equiv=X-UA-Compatible content="IE=edge"><meta name=viewport content="width=device-width,initial-scale=1,shrink-to-fit=no"><meta name=robots content="index, follow"><title>Yanir Seroussi | Data & AI for Startup Impact</title>
 <meta name=description content="Helping climate & nature tech startups ship data-intensive solutions (artificial intelligence, machine learning, data science, and advanced analytics).
 "><meta name=author content="Yanir Seroussi"><link rel=canonical href=https://yanirseroussi.com/><meta name=google-site-verification content="aWlue7NGcj4dQpjOKJF7YKiAvw3JuHnq6aFqX6VwWAU"><link crossorigin=anonymous href=/assets/css/stylesheet.6f5c97224af1f1714566202529b7d458386b85c4df858c71df30dd5c1c769363.css integrity="sha256-b1yXIkrx8XFFZiAlKbfUWDhrhcTfhYxx3zDdXBx2k2M=" rel="preload stylesheet" as=style><link rel=icon href=https://yanirseroussi.com/favicon.ico><link rel=icon type=image/png sizes=16x16 href=https://yanirseroussi.com/favicon-16x16.png><link rel=icon type=image/png sizes=32x32 href=https://yanirseroussi.com/favicon-32x32.png><link rel=apple-touch-icon href=https://yanirseroussi.com/apple-touch-icon.png><link rel=mask-icon href=https://yanirseroussi.com/safari-pinned-tab.svg><meta name=theme-color content="#2e2e33"><meta name=msapplication-TileColor content="#2e2e33"><link rel=alternate type=application/rss+xml href=https://yanirseroussi.com/index.xml><link rel=alternate hreflang=en href=https://yanirseroussi.com/><noscript><style>#theme-toggle,.top-link{display:none}</style><style>@media(prefers-color-scheme:dark){:root{--theme:rgb(29, 30, 32);--entry:rgb(46, 46, 51);--primary:rgb(218, 218, 219);--secondary:rgb(155, 156, 157);--tertiary:rgb(65, 66, 68);--content:rgb(196, 196, 197);--code-block-bg:rgb(46, 46, 51);--code-bg:rgb(55, 56, 62);--border:rgb(51, 51, 51)}.list{background:var(--theme)}.list:not(.dark)::-webkit-scrollbar-track{background:0 0}.list:not(.dark)::-webkit-scrollbar-thumb{border-color:var(--theme)}}</style></noscript><meta property="og:title" content="Yanir Seroussi | Data & AI for Startup Impact"><meta property="og:description" content="Helping climate & nature tech startups ship data-intensive solutions (artificial intelligence, machine learning, data science, and advanced analytics).
 "><meta property="og:type" content="website"><meta property="og:url" content="https://yanirseroussi.com/"><meta name=twitter:card content="summary"><meta name=twitter:title content="Yanir Seroussi | Data & AI for Startup Impact"><meta name=twitter:description content="Helping climate & nature tech startups ship data-intensive solutions (artificial intelligence, machine learning, data science, and advanced analytics).
diff --git a/index.xml b/index.xml
index d1fd056a3..baccfaf7f 100644
--- a/index.xml
+++ b/index.xml
@@ -1,4 +1,39 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Yanir Seroussi | Data &amp; AI for Startup Impact</title><link>https://yanirseroussi.com/</link><description>Recent content on Yanir Seroussi | Data &amp; AI for Startup Impact</description><generator>Hugo -- gohugo.io</generator><language>en-au</language><copyright>Text and figures licensed under [CC BY-NC-ND 4.0](https://creativecommons.org/licenses/by-nc-nd/4.0/) by [Yanir Seroussi](https://yanirseroussi.com/about/), except where noted otherwise</copyright><lastBuildDate>Mon, 02 Sep 2024 02:30:00 +0000</lastBuildDate><atom:link href="https://yanirseroussi.com/index.xml" rel="self" type="application/rss+xml"/><item><title>Juggling delivery, admin, and leads: Monthly biz recap</title><link>https://yanirseroussi.com/2024/09/02/juggling-delivery-admin-and-leads-monthly-biz-recap/</link><pubDate>Mon, 02 Sep 2024 02:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/09/02/juggling-delivery-admin-and-leads-monthly-biz-recap/</guid><description>Highlights and lessons from my solo expertise biz, including value pricing, fractional cash flow, and distractions from admin &amp;amp; politics.</description></item><item><title>AI hype, AI bullshit, and the real deal</title><link>https://yanirseroussi.com/2024/08/26/ai-hype-ai-bullshit-and-the-real-deal/</link><pubDate>Mon, 26 Aug 2024 01:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/26/ai-hype-ai-bullshit-and-the-real-deal/</guid><description>My views on separating AI hype and bullshit from the real deal. The general ideas apply to past and future hype waves in tech.</description></item><item><title>Giving up on the minimum viable data stack</title><link>https://yanirseroussi.com/2024/08/19/giving-up-on-the-minimum-viable-data-stack/</link><pubDate>Mon, 19 Aug 2024 03:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/19/giving-up-on-the-minimum-viable-data-stack/</guid><description>Exploring why universal advice on startup data stacks is challenging, and the importance of context-specific decisions in data infrastructure.</description></item><item><title>Keep learning: Your career is never truly done</title><link>https://yanirseroussi.com/2024/08/12/keep-learning-your-career-is-never-truly-done/</link><pubDate>Mon, 12 Aug 2024 01:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/12/keep-learning-your-career-is-never-truly-done/</guid><description>Podcast chat on my career journey from software engineering to data science and independent consulting.</description></item><item><title>First year lessons from a solo expertise biz in Data &amp; AI</title><link>https://yanirseroussi.com/2024/08/05/first-year-lessons-from-a-solo-expertise-biz-in-data-and-ai/</link><pubDate>Mon, 05 Aug 2024 08:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/05/first-year-lessons-from-a-solo-expertise-biz-in-data-and-ai/</guid><description>Reflections on building a solo expertise business in Data &amp;amp; AI, focusing on climate tech startups. Lessons learned from the first year of transition.</description></item><item><title>AI/ML lifecycle models versus real-world mess</title><link>https://yanirseroussi.com/2024/07/29/ai-ml-lifecycle-models-versus-real-world-mess/</link><pubDate>Mon, 29 Jul 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/29/ai-ml-lifecycle-models-versus-real-world-mess/</guid><description>The real world of AI/ML doesn&amp;rsquo;t fit into a neat diagram, so I created another diagram and a maturity heatmap to model the mess.</description></item><item><title>Your first Data-to-AI hire: Run a lovable process</title><link>https://yanirseroussi.com/2024/07/22/your-first-data-to-ai-hire-run-a-lovable-process/</link><pubDate>Mon, 22 Jul 2024 01:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/22/your-first-data-to-ai-hire-run-a-lovable-process/</guid><description>Video and key points from the second part of a webinar on a startup&amp;rsquo;s first data hire, covering tips for defining the role and running the process.</description></item><item><title>Learn about Dataland to avoid expensive hiring mistakes</title><link>https://yanirseroussi.com/2024/07/15/learn-about-dataland-to-avoid-expensive-hiring-mistakes/</link><pubDate>Mon, 15 Jul 2024 05:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/15/learn-about-dataland-to-avoid-expensive-hiring-mistakes/</guid><description>Video and key points from the first part of a webinar on a startup&amp;rsquo;s first data hire, covering data &amp;amp; AI definitions and high-level recommendations.</description></item><item><title>Exploring an AI product idea with the latest ChatGPT, Claude, and Gemini</title><link>https://yanirseroussi.com/2024/07/08/exploring-an-ai-product-idea-with-the-latest-chatgpt-claude-and-gemini/</link><pubDate>Mon, 08 Jul 2024 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/08/exploring-an-ai-product-idea-with-the-latest-chatgpt-claude-and-gemini/</guid><description>Asking identical questions about my MagicGrantMaker idea yielded near-identical responses from the top chatbot models.</description></item><item><title>Stay alert! Security is everyone's responsibility</title><link>https://yanirseroussi.com/2024/07/01/stay-alert-security-is-everyones-responsibility/</link><pubDate>Mon, 01 Jul 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/01/stay-alert-security-is-everyones-responsibility/</guid><description>Questions to assess the security posture of a startup, focusing on basic hygiene and handling of sensitive data.</description></item><item><title>Five team-building mistakes, according to Patty McCord</title><link>https://yanirseroussi.com/til/2024/06/26/five-team-building-mistakes-according-to-patty-mccord/</link><pubDate>Wed, 26 Jun 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/06/26/five-team-building-mistakes-according-to-patty-mccord/</guid><description>Takeaways from an interview with Patty McCord on The Startup Podcast.</description></item><item><title>Is your tech stack ready for data-intensive applications?</title><link>https://yanirseroussi.com/2024/06/24/is-your-tech-stack-ready-for-data-intensive-applications/</link><pubDate>Mon, 24 Jun 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/24/is-your-tech-stack-ready-for-data-intensive-applications/</guid><description>Questions to assess the quality of tech stacks and lifecycles, with a focus on artificial intelligence, machine learning, and analytics.</description></item><item><title>Dealing with endless data changes</title><link>https://yanirseroussi.com/til/2024/06/22/dealing-with-endless-data-changes/</link><pubDate>Sat, 22 Jun 2024 22:50:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/06/22/dealing-with-endless-data-changes/</guid><description>Quotes from Demetrios Brinkmann on the relationship between MLOps and DevOps, with MLOps allowing for managing changes that come from data.</description></item><item><title>AI ain't gonna save you from bad data</title><link>https://yanirseroussi.com/2024/06/17/ai-aint-gonna-save-you-from-bad-data/</link><pubDate>Mon, 17 Jun 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/17/ai-aint-gonna-save-you-from-bad-data/</guid><description>Since we&amp;rsquo;re far from a utopia where data issues are fully handled by AI, this post presents six questions humans can use to assess data projects.</description></item><item><title>The rules of the passion economy</title><link>https://yanirseroussi.com/til/2024/06/12/the-rules-of-the-passion-economy/</link><pubDate>Wed, 12 Jun 2024 02:50:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/06/12/the-rules-of-the-passion-economy/</guid><description>Summary of the main messages from the book The Passion Economy by Adam Davidson.</description></item><item><title>Startup data health starts with healthy event tracking</title><link>https://yanirseroussi.com/2024/06/10/startup-data-health-starts-with-healthy-event-tracking/</link><pubDate>Mon, 10 Jun 2024 04:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/10/startup-data-health-starts-with-healthy-event-tracking/</guid><description>Expanding on the startup health check question of tracking Kukuyeva&amp;rsquo;s five business aspects as wide events.</description></item><item><title>How to avoid startups with poor development processes</title><link>https://yanirseroussi.com/2024/06/03/how-to-avoid-startups-with-poor-development-processes/</link><pubDate>Mon, 03 Jun 2024 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/03/how-to-avoid-startups-with-poor-development-processes/</guid><description>Questions that prospective data specialists and engineers should ask about development processes before accepting a startup role.</description></item><item><title>Plumbing, Decisions, and Automation: De-hyping Data &amp; AI</title><link>https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/</link><pubDate>Mon, 27 May 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/</guid><description>Three essential questions to understand where an organisation stands when it comes to Data &amp;amp; AI (with zero hype).</description></item><item><title>Adapting to the economy of algorithms</title><link>https://yanirseroussi.com/til/2024/05/25/adapting-to-the-economy-of-algorithms/</link><pubDate>Sat, 25 May 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/05/25/adapting-to-the-economy-of-algorithms/</guid><description>Overview of the book The Economy of Algorithms by Marek Kowalkiewicz.</description></item><item><title>Question startup culture before accepting a data-to-AI role</title><link>https://yanirseroussi.com/2024/05/20/question-startup-culture-before-accepting-a-data-to-ai-role/</link><pubDate>Mon, 20 May 2024 02:25:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/20/question-startup-culture-before-accepting-a-data-to-ai-role/</guid><description>Eight questions that prospective data-to-AI employees should ask about a startup&amp;rsquo;s work and data culture.</description></item><item><title>Probing the People aspects of an early-stage startup</title><link>https://yanirseroussi.com/2024/05/13/probing-the-people-aspects-of-an-early-stage-startup/</link><pubDate>Mon, 13 May 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/13/probing-the-people-aspects-of-an-early-stage-startup/</guid><description>Ten questions that prospective employees should ask about a startup&amp;rsquo;s team, especially for data-centric roles.</description></item><item><title>Business questions to ask before taking a startup data role</title><link>https://yanirseroussi.com/2024/05/06/business-questions-to-ask-before-taking-a-startup-data-role/</link><pubDate>Mon, 06 May 2024 04:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/06/business-questions-to-ask-before-taking-a-startup-data-role/</guid><description>Fourteen questions that prospective employees should ask about a startup&amp;rsquo;s business model and product, especially for data-focused roles.</description></item><item><title>Mentorship and the art of actionable advice</title><link>https://yanirseroussi.com/2024/04/29/mentorship-and-the-art-of-actionable-advice/</link><pubDate>Mon, 29 Apr 2024 06:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/29/mentorship-and-the-art-of-actionable-advice/</guid><description>Reflections on what it takes to package expertise and deliver timely, actionable advice outside the context of employee relationships.</description></item><item><title>Assessing a startup's data-to-AI health</title><link>https://yanirseroussi.com/2024/04/22/assessing-a-startups-data-to-ai-health/</link><pubDate>Mon, 22 Apr 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/22/assessing-a-startups-data-to-ai-health/</guid><description>Reviewing the areas that should be assessed to determine a startup&amp;rsquo;s opportunities and challenges on the data/AI/ML front.</description></item><item><title>AI does not obviate the need for testing and observability</title><link>https://yanirseroussi.com/2024/04/15/ai-does-not-obviate-the-need-for-testing-and-observability/</link><pubDate>Mon, 15 Apr 2024 05:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/15/ai-does-not-obviate-the-need-for-testing-and-observability/</guid><description>It&amp;rsquo;s easy to prototype with AI, but production-grade AI apps require even more thorough testing and observability than traditional software.</description></item><item><title>LinkedIn is a teachable skill</title><link>https://yanirseroussi.com/til/2024/04/11/linkedin-is-a-teachable-skill/</link><pubDate>Thu, 11 Apr 2024 01:45:25 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/04/11/linkedin-is-a-teachable-skill/</guid><description>An high-level overview of things I learned from Justin Welsh&amp;rsquo;s LinkedIn Operating System course.</description></item><item><title>My experience as a Data Tech Lead with Work on Climate</title><link>https://yanirseroussi.com/2024/04/08/my-experience-as-a-data-tech-lead-with-work-on-climate/</link><pubDate>Mon, 08 Apr 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/08/my-experience-as-a-data-tech-lead-with-work-on-climate/</guid><description>The story of how I joined Work on Climate as a volunteer and became its data tech lead, with lessons applied to consulting &amp;amp; fractional work.</description></item><item><title>The data engineering lifecycle is not going anywhere</title><link>https://yanirseroussi.com/til/2024/04/05/the-data-engineering-lifecycle-is-not-going-anywhere/</link><pubDate>Fri, 05 Apr 2024 01:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/04/05/the-data-engineering-lifecycle-is-not-going-anywhere/</guid><description>My key takeaways from reading Fundamentals of Data Engineering by Joe Reis and Matt Housley.</description></item><item><title>Artificial intelligence, automation, and the art of counting fish</title><link>https://yanirseroussi.com/2024/04/01/artificial-intelligence-automation-and-the-art-of-counting-fish/</link><pubDate>Mon, 01 Apr 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/01/artificial-intelligence-automation-and-the-art-of-counting-fish/</guid><description>Discussing the use of AI to automate underwater marine surveys as an example of the uneven distribution of technological advancement.</description></item><item><title>Atomic Habits is full of actionable advice</title><link>https://yanirseroussi.com/til/2024/03/12/atomic-habits-is-full-of-actionable-advice/</link><pubDate>Tue, 12 Mar 2024 06:19:31 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/03/12/atomic-habits-is-full-of-actionable-advice/</guid><description>I put the book to use after the first listen, and will definitely revisit it in the future to form better habits.</description></item><item><title>Questions to consider when using AI for PDF data extraction</title><link>https://yanirseroussi.com/2024/03/11/questions-to-consider-when-using-ai-for-pdf-data-extraction/</link><pubDate>Mon, 11 Mar 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/03/11/questions-to-consider-when-using-ai-for-pdf-data-extraction/</guid><description>Discussing considerations that arise when attempting to automate the extraction of structured data from PDFs and similar documents.</description></item><item><title>Two types of startup data problems</title><link>https://yanirseroussi.com/2024/03/04/two-types-of-startup-data-problems/</link><pubDate>Mon, 04 Mar 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/03/04/two-types-of-startup-data-problems/</guid><description>Classifying startups as ML-centric or non-ML is a helpful exercise to uncover the data challenges they&amp;rsquo;re likely to face.</description></item><item><title>Avoiding AI complexity: First, write no code</title><link>https://yanirseroussi.com/2024/02/26/avoiding-ai-complexity-first-write-no-code/</link><pubDate>Mon, 26 Feb 2024 01:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/26/avoiding-ai-complexity-first-write-no-code/</guid><description>Two stories of getting AI functionality to production, which demonstrate the risks inherent in custom development versus starting with a no-code approach.</description></item><item><title>Building your startup's minimum viable data stack</title><link>https://yanirseroussi.com/2024/02/19/building-your-startups-minimum-viable-data-stack/</link><pubDate>Mon, 19 Feb 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/19/building-your-startups-minimum-viable-data-stack/</guid><description>First post in a series on building a minimum viable data stack for startups, introducing key definitions, components, and considerations.</description></item><item><title>The three Cs of indie consulting: Confidence, Cash, and Connections</title><link>https://yanirseroussi.com/til/2024/02/17/the-three-cs-of-indie-consulting-confidence-cash-and-connections/</link><pubDate>Sat, 17 Feb 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/02/17/the-three-cs-of-indie-consulting-confidence-cash-and-connections/</guid><description>Jonathan Stark makes a compelling argument why you should have the three Cs before quitting your job to go solo consulting.</description></item><item><title>Nudging ChatGPT to invent books you have no time to read</title><link>https://yanirseroussi.com/2024/02/12/nudging-chatgpt-to-invent-books-you-have-no-time-to-read/</link><pubDate>Mon, 12 Feb 2024 05:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/12/nudging-chatgpt-to-invent-books-you-have-no-time-to-read/</guid><description>Getting ChatGPT Plus to elaborate on possible book content and produce a PDF cheatsheet, with the goal of learning about its capabilities.</description></item><item><title>Future software development may require fewer humans</title><link>https://yanirseroussi.com/til/2024/02/06/future-software-development-may-require-fewer-humans/</link><pubDate>Tue, 06 Feb 2024 06:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/02/06/future-software-development-may-require-fewer-humans/</guid><description>Reflecting on an interview with Jason Warner, CEO of poolside.</description></item><item><title>Substance over titles: Your first data hire may be a data scientist</title><link>https://yanirseroussi.com/2024/02/05/substance-over-titles-your-first-data-hire-may-be-a-data-scientist/</link><pubDate>Mon, 05 Feb 2024 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/05/substance-over-titles-your-first-data-hire-may-be-a-data-scientist/</guid><description>Advice for hiring a startup&amp;rsquo;s first data person: match skills to business needs, consider contractors, and get help from data people.</description></item><item><title>New decade, new tagline: Data &amp; AI for Impact</title><link>https://yanirseroussi.com/2024/01/19/new-decade-new-tagline-data-and-ai-for-impact/</link><pubDate>Fri, 19 Jan 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/01/19/new-decade-new-tagline-data-and-ai-for-impact/</guid><description>Shifting focus to &amp;lsquo;Data &amp;amp; AI for Impact&amp;rsquo;, with more startup-related content, increased posting frequency, and deeper audience engagement.</description></item><item><title>Psychographic specialisations may work for discipline generalists</title><link>https://yanirseroussi.com/til/2024/01/09/psychographic-specialisations-may-work-for-discipline-generalists/</link><pubDate>Tue, 09 Jan 2024 03:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/01/09/psychographic-specialisations-may-work-for-discipline-generalists/</guid><description>When focusing on a market segment defined by personal beliefs, it&amp;rsquo;s often fine to position yourself as a generalist in your craft.</description></item><item><title>The power of parasocial relationships</title><link>https://yanirseroussi.com/til/2024/01/08/the-power-of-parasocial-relationships/</link><pubDate>Mon, 08 Jan 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/01/08/the-power-of-parasocial-relationships/</guid><description>Repeated exposure to media personas creates relationships that help justify premium fees.</description></item><item><title>Positioning is a common problem for data scientists</title><link>https://yanirseroussi.com/til/2023/12/18/positioning-is-a-common-problem-for-data-scientists/</link><pubDate>Mon, 18 Dec 2023 00:30:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/12/18/positioning-is-a-common-problem-for-data-scientists/</guid><description>With the commodification of data scientists, the problem of positioning has become more common: My takeaways from Genevieve Hayes interviewing Jonathan Stark.</description></item><item><title>Transfer learning applies to energy market bidding</title><link>https://yanirseroussi.com/til/2023/12/14/transfer-learning-applies-to-energy-market-bidding/</link><pubDate>Thu, 14 Dec 2023 00:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/12/14/transfer-learning-applies-to-energy-market-bidding/</guid><description>An interesting approach to bidding of energy storage assets, showing that training on New York data is transferable to Queensland.</description></item><item><title>Supporting volunteer monitoring of marine biodiversity with modern web and data tools</title><link>https://yanirseroussi.com/2023/11/29/supporting-volunteer-monitoring-of-marine-biodiversity-with-modern-web-and-data-tools/</link><pubDate>Wed, 29 Nov 2023 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/11/29/supporting-volunteer-monitoring-of-marine-biodiversity-with-modern-web-and-data-tools/</guid><description>Summarising the work Uri Seroussi and I did to improve Reef Life Survey&amp;rsquo;s Reef Species of the World app.</description></item><item><title>Our Blue Machine is changing, but we are not helpless</title><link>https://yanirseroussi.com/til/2023/11/28/our-blue-machine-is-changing-but-we-are-not-helpless/</link><pubDate>Tue, 28 Nov 2023 06:40:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/11/28/our-blue-machine-is-changing-but-we-are-not-helpless/</guid><description>One of my many highlights from Helen Czerski&amp;rsquo;s Blue Machine.</description></item><item><title>You don't need a proprietary API for static maps</title><link>https://yanirseroussi.com/til/2023/11/21/you-dont-need-a-proprietary-api-for-static-maps/</link><pubDate>Tue, 21 Nov 2023 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/11/21/you-dont-need-a-proprietary-api-for-static-maps/</guid><description>For many use cases, libraries like cartopy are better than the likes of Mapbox and Google Maps.</description></item><item><title>Lessons from reluctant data engineering</title><link>https://yanirseroussi.com/2023/10/25/lessons-from-reluctant-data-engineering/</link><pubDate>Wed, 25 Oct 2023 04:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/10/25/lessons-from-reluctant-data-engineering/</guid><description>Video and summary of a talk I gave at DataEngBytes Brisbane on what I learned from doing data engineering as part of every data science role I had.</description></item><item><title>Artificial intelligence was a marketing term all along – just call it automation</title><link>https://yanirseroussi.com/til/2023/10/06/artificial-intelligence-was-a-marketing-term-all-along-just-call-it-automation/</link><pubDate>Fri, 06 Oct 2023 05:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/10/06/artificial-intelligence-was-a-marketing-term-all-along-just-call-it-automation/</guid><description>Replacing &amp;lsquo;artificial intelligence&amp;rsquo; with &amp;lsquo;automation&amp;rsquo; is a useful trick for cutting through the hype.</description></item><item><title>The lines between solo consulting and product building are blurry</title><link>https://yanirseroussi.com/til/2023/09/25/the-lines-between-solo-consulting-and-product-building-are-blurry/</link><pubDate>Mon, 25 Sep 2023 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/09/25/the-lines-between-solo-consulting-and-product-building-are-blurry/</guid><description>It turns out that problems like finding a niche and defining the ideal clients are key to any solo business.</description></item><item><title>Google's Rules of Machine Learning still apply in the age of large language models</title><link>https://yanirseroussi.com/til/2023/09/21/googles-rules-of-machine-learning-still-apply-in-the-age-of-large-language-models/</link><pubDate>Thu, 21 Sep 2023 21:30:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/09/21/googles-rules-of-machine-learning-still-apply-in-the-age-of-large-language-models/</guid><description>Despite the excitement around large language models, building with machine learning remains an engineering problem with established best practices.</description></item><item><title>My rediscovery of quiet writing on the open web</title><link>https://yanirseroussi.com/2023/08/28/my-rediscovery-of-quiet-writing-on-the-open-web/</link><pubDate>Mon, 28 Aug 2023 05:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/08/28/my-rediscovery-of-quiet-writing-on-the-open-web/</guid><description>Reflections on publishing on this website: Writing publicly to share thoughts and documentation beats chasing views and likes.</description></item><item><title>The Minimalist Entrepreneur is too prescriptive for me</title><link>https://yanirseroussi.com/til/2023/08/21/the-minimalist-entrepreneur-is-too-prescriptive-for-me/</link><pubDate>Mon, 21 Aug 2023 03:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/21/the-minimalist-entrepreneur-is-too-prescriptive-for-me/</guid><description>While I found the story of Gumroad interesting, The Minimalist Entrepreneur seems to over-generalise from the founder&amp;rsquo;s experience.</description></item><item><title>Revisiting Start Small, Stay Small in 2023 (Chapter 2)</title><link>https://yanirseroussi.com/til/2023/08/17/revisiting-start-small-stay-small-in-2023-chapter-2/</link><pubDate>Thu, 17 Aug 2023 07:45:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/17/revisiting-start-small-stay-small-in-2023-chapter-2/</guid><description>A summary of the second chapter of Rob Walling&amp;rsquo;s Start Small, Stay Small, along with my thoughts &amp;amp; reflections.</description></item><item><title>Revisiting Start Small, Stay Small in 2023 (Chapter 1)</title><link>https://yanirseroussi.com/til/2023/08/16/revisiting-start-small-stay-small-in-2023-chapter-1/</link><pubDate>Wed, 16 Aug 2023 05:45:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/16/revisiting-start-small-stay-small-in-2023-chapter-1/</guid><description>A summary of the first chapter of Rob Walling&amp;rsquo;s Start Small, Stay Small, along with my thoughts &amp;amp; reflections.</description></item><item><title>Email notifications on public GitHub commits</title><link>https://yanirseroussi.com/til/2023/08/14/email-notifications-on-public-github-commits/</link><pubDate>Mon, 14 Aug 2023 05:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/14/email-notifications-on-public-github-commits/</guid><description>GitHub publishes an Atom feed, which means you can use any RSS reader to follow commits.</description></item><item><title>The rule of thirds can probably be ignored</title><link>https://yanirseroussi.com/til/2023/08/11/the-rule-of-thirds-can-probably-be-ignored/</link><pubDate>Fri, 11 Aug 2023 03:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/11/the-rule-of-thirds-can-probably-be-ignored/</guid><description>Turns out that the rule of thirds for composing visuals may not be that important.</description></item><item><title>Using YubiKey for SSH access</title><link>https://yanirseroussi.com/til/2023/07/23/using-yubikey-for-ssh-access/</link><pubDate>Sun, 23 Jul 2023 00:07:15 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/07/23/using-yubikey-for-ssh-access/</guid><description>Some pointers for setting up SSH access with YubiKey on Ubuntu 22.04.</description></item><item><title>Making a TIL section with Hugo and PaperMod</title><link>https://yanirseroussi.com/til/2023/07/17/making-a-til-section-with-hugo-and-papermod/</link><pubDate>Mon, 17 Jul 2023 00:06:15 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/07/17/making-a-til-section-with-hugo-and-papermod/</guid><description>How I added a Today I Learned section to my Hugo site with the PaperMod theme.</description></item><item><title>You can't save time</title><link>https://yanirseroussi.com/til/2023/07/11/you-cant-save-time/</link><pubDate>Tue, 11 Jul 2023 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/07/11/you-cant-save-time/</guid><description>Time can be spent doing different activities, but it can&amp;rsquo;t be stored and saved for later.</description></item><item><title>Was data science a failure mode of software engineering?</title><link>https://yanirseroussi.com/2023/06/30/was-data-science-a-failure-mode-of-software-engineering/</link><pubDate>Fri, 30 Jun 2023 00:06:30 +0000</pubDate><guid>https://yanirseroussi.com/2023/06/30/was-data-science-a-failure-mode-of-software-engineering/</guid><description>Yes, data science projects have suffered from classic software engineering mistakes, but the field is maturing with the rise of new engineering roles.</description></item><item><title>How hackable are automated coding assessments?</title><link>https://yanirseroussi.com/2023/05/26/how-hackable-are-automated-coding-assessments/</link><pubDate>Fri, 26 May 2023 00:03:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/05/26/how-hackable-are-automated-coding-assessments/</guid><description>Exploring the hackability of speed-based coding tests, using CodeSignal&amp;rsquo;s Industry Coding Framework as a case study.</description></item><item><title>Remaining relevant as a small language model</title><link>https://yanirseroussi.com/2023/04/21/remaining-relevant-as-a-small-language-model/</link><pubDate>Fri, 21 Apr 2023 00:06:30 +0000</pubDate><guid>https://yanirseroussi.com/2023/04/21/remaining-relevant-as-a-small-language-model/</guid><description>Bing Chat recently quipped that humans are small language models. Here are some of my thoughts on how we small language models can remain relevant (for now).</description></item><item><title>ChatGPT is transformative AI</title><link>https://yanirseroussi.com/2022/12/11/chatgpt-is-transformative-ai/</link><pubDate>Sun, 11 Dec 2022 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/12/11/chatgpt-is-transformative-ai/</guid><description>My perspective after a week of using ChatGPT: This is a step change in finding distilled information, and it&amp;rsquo;s only the beginning.</description></item><item><title>Causal Machine Learning is off to a good start, despite some issues</title><link>https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/</link><pubDate>Mon, 12 Sep 2022 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/</guid><description>Reviewing the first three chapters of the book Causal Machine Learning by Robert Osazuwa Ness.</description></item><item><title>The mission matters: Moving to climate tech as a data scientist</title><link>https://yanirseroussi.com/2022/06/06/the-mission-matters-moving-to-climate-tech-as-a-data-scientist/</link><pubDate>Mon, 06 Jun 2022 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/06/06/the-mission-matters-moving-to-climate-tech-as-a-data-scientist/</guid><description>Discussing my recent career move into climate tech as a way of doing more to help mitigate dangerous climate change.</description></item><item><title>Building useful machine learning tools keeps getting easier: A fish ID case study</title><link>https://yanirseroussi.com/2022/03/20/building-useful-machine-learning-tools-keeps-getting-easier-a-fish-id-case-study/</link><pubDate>Sun, 20 Mar 2022 04:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/03/20/building-useful-machine-learning-tools-keeps-getting-easier-a-fish-id-case-study/</guid><description>Lessons learned building a fish ID web app with fast.ai and Streamlit, in an attempt to reduce my fear of missing out on the latest deep learning developments.</description></item><item><title>Analysis strategies in online A/B experiments: Intention-to-treat, per-protocol, and other lessons from clinical trials</title><link>https://yanirseroussi.com/2022/01/14/analysis-strategies-in-online-a-b-experiments/</link><pubDate>Fri, 14 Jan 2022 00:05:40 +0000</pubDate><guid>https://yanirseroussi.com/2022/01/14/analysis-strategies-in-online-a-b-experiments/</guid><description>Epidemiologists analyse clinical trials to estimate the intention-to-treat and per-protocol effects. This post applies their strategies to online experiments.</description></item><item><title>Use your human brain to avoid artificial intelligence disasters</title><link>https://yanirseroussi.com/2021/11/22/use-your-human-brain-to-avoid-artificial-intelligence-disasters/</link><pubDate>Mon, 22 Nov 2021 03:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2021/11/22/use-your-human-brain-to-avoid-artificial-intelligence-disasters/</guid><description>Overview of a talk I gave at a deep learning course, focusing on AI ethics as the need for humans to think on the context and consequences of applying AI.</description></item><item><title>Migrating from WordPress.com to Hugo on GitHub + Cloudflare</title><link>https://yanirseroussi.com/2021/11/10/migrating-from-wordpress-com-to-hugo-on-github-cloudflare/</link><pubDate>Wed, 10 Nov 2021 06:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2021/11/10/migrating-from-wordpress-com-to-hugo-on-github-cloudflare/</guid><description>My reasons for switching from WordPress.com to Hugo on GitHub + Cloudflare, along with a summary of the solution components and migration process.</description></item><item><title>My work with Automattic</title><link>https://yanirseroussi.com/2021/10/07/my-work-with-automattic/</link><pubDate>Thu, 07 Oct 2021 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2021/10/07/my-work-with-automattic/</guid><description>Back-dated meta-post that gathers my posts on Automattic blogs into a summary of the work I&amp;rsquo;ve done with the company.</description></item><item><title>Some highlights from 2020</title><link>https://yanirseroussi.com/2021/04/05/some-highlights-from-2020/</link><pubDate>Mon, 05 Apr 2021 06:41:48 +0000</pubDate><guid>https://yanirseroussi.com/2021/04/05/some-highlights-from-2020/</guid><description>Sharing remote teamwork insights, my climate &amp;amp; sustainability activism, Reef Life Survey publications, and progress on Automattic&amp;rsquo;s Experimentation Platform.</description></item><item><title>Many is not enough: Counting simulations to bootstrap the right way</title><link>https://yanirseroussi.com/2020/08/24/many-is-not-enough-counting-simulations-to-bootstrap-the-right-way/</link><pubDate>Mon, 24 Aug 2020 01:35:17 +0000</pubDate><guid>https://yanirseroussi.com/2020/08/24/many-is-not-enough-counting-simulations-to-bootstrap-the-right-way/</guid><description>Going deeper into correct testing of different methods for bootstrap estimation of confidence intervals.</description></item><item><title>Software commodities are eating interesting data science work</title><link>https://yanirseroussi.com/2020/01/11/software-commodities-are-eating-interesting-data-science-work/</link><pubDate>Sat, 11 Jan 2020 09:22:35 +0000</pubDate><guid>https://yanirseroussi.com/2020/01/11/software-commodities-are-eating-interesting-data-science-work/</guid><description>Being a data scientist can sometimes feel like a race against software commodities that replace interesting work. What can one do to remain relevant?</description></item><item><title>A day in the life of a remote data scientist</title><link>https://yanirseroussi.com/2019/12/12/a-day-in-the-life-of-a-remote-data-scientist/</link><pubDate>Wed, 11 Dec 2019 22:06:19 +0000</pubDate><guid>https://yanirseroussi.com/2019/12/12/a-day-in-the-life-of-a-remote-data-scientist/</guid><description>Video of a talk I gave on remote data science work at the Data Science Sydney meetup.</description></item><item><title>Bootstrapping the right way?</title><link>https://yanirseroussi.com/2019/10/06/bootstrapping-the-right-way/</link><pubDate>Sun, 06 Oct 2019 06:48:07 +0000</pubDate><guid>https://yanirseroussi.com/2019/10/06/bootstrapping-the-right-way/</guid><description>Video and summary of a talk I gave at YOW! Data on bootstrap estimation of confidence intervals.</description></item><item><title>Hackers beware: Bootstrap sampling may be harmful</title><link>https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/</link><pubDate>Mon, 07 Jan 2019 21:07:56 +0000</pubDate><guid>https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/</guid><description>Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&amp;rsquo;t that simple.</description></item><item><title>The most practical causal inference book I’ve read (is still a draft)</title><link>https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/</link><pubDate>Mon, 24 Dec 2018 02:37:50 +0000</pubDate><guid>https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/</guid><description>Causal Inference by Miguel Hernán and Jamie Robins is a must-read for anyone interested in the area.</description></item><item><title>Reflections on remote data science work</title><link>https://yanirseroussi.com/2018/11/03/reflections-on-remote-data-science-work/</link><pubDate>Sat, 03 Nov 2018 06:33:13 +0000</pubDate><guid>https://yanirseroussi.com/2018/11/03/reflections-on-remote-data-science-work/</guid><description>Discussing the pluses and minuses of remote work eighteen months after joining Automattic as a data scientist.</description></item><item><title>Defining data science in 2018</title><link>https://yanirseroussi.com/2018/07/22/defining-data-science-in-2018/</link><pubDate>Sun, 22 Jul 2018 08:27:43 +0000</pubDate><guid>https://yanirseroussi.com/2018/07/22/defining-data-science-in-2018/</guid><description>Updating my definition of data science to match changes in the field. It is now broader than before, but its ultimate goal is still to support decisions.</description></item><item><title>Advice for aspiring data scientists and other FAQs</title><link>https://yanirseroussi.com/2017/10/15/advice-for-aspiring-data-scientists-and-other-faqs/</link><pubDate>Sun, 15 Oct 2017 09:15:25 +0000</pubDate><guid>https://yanirseroussi.com/2017/10/15/advice-for-aspiring-data-scientists-and-other-faqs/</guid><description>Frequently asked questions by visitors to this site, especially around entering the data science field.</description></item><item><title>State of Bandcamp Recommender, Late 2017</title><link>https://yanirseroussi.com/2017/09/02/state-of-bandcamp-recommender/</link><pubDate>Sat, 02 Sep 2017 10:19:02 +0000</pubDate><guid>https://yanirseroussi.com/2017/09/02/state-of-bandcamp-recommender/</guid><description>Call for BCRecommender maintainers followed by a decision to shut it down, as I don&amp;rsquo;t have enough time and Bandcamp now offers recommendations.</description></item><item><title>My 10-step path to becoming a remote data scientist with Automattic</title><link>https://yanirseroussi.com/2017/07/29/my-10-step-path-to-becoming-a-remote-data-scientist-with-automattic/</link><pubDate>Sat, 29 Jul 2017 05:39:26 +0000</pubDate><guid>https://yanirseroussi.com/2017/07/29/my-10-step-path-to-becoming-a-remote-data-scientist-with-automattic/</guid><description>I wanted a well-paid data science-y remote job with an established company that offers a good life balance and makes products I care about. I got it eventually.</description></item><item><title>Exploring and visualising Reef Life Survey data</title><link>https://yanirseroussi.com/2017/06/03/exploring-and-visualising-reef-life-survey-data/</link><pubDate>Sat, 03 Jun 2017 00:49:05 +0000</pubDate><guid>https://yanirseroussi.com/2017/06/03/exploring-and-visualising-reef-life-survey-data/</guid><description>Web tools I built to visualise Reef Life Survey data and assist citizen scientists in underwater visual census work.</description></item><item><title>Customer lifetime value and the proliferation of misinformation on the internet</title><link>https://yanirseroussi.com/2017/01/08/customer-lifetime-value-and-the-proliferation-of-misinformation-on-the-internet/</link><pubDate>Sun, 08 Jan 2017 20:02:30 +0000</pubDate><guid>https://yanirseroussi.com/2017/01/08/customer-lifetime-value-and-the-proliferation-of-misinformation-on-the-internet/</guid><description>There&amp;rsquo;s a lot of misleading content on the estimation of customer lifetime value. Here&amp;rsquo;s what I learned about doing it well.</description></item><item><title>Ask Why! Finding motives, causes, and purpose in data science</title><link>https://yanirseroussi.com/2016/09/19/ask-why-finding-motives-causes-and-purpose-in-data-science/</link><pubDate>Mon, 19 Sep 2016 21:28:44 +0000</pubDate><guid>https://yanirseroussi.com/2016/09/19/ask-why-finding-motives-causes-and-purpose-in-data-science/</guid><description>Video and summary of a talk I gave at the Data Science Sydney meetup, about going beyond the what &amp;amp; how of predictive modelling.</description></item><item><title>If you don’t pay attention, data can drive you off a cliff</title><link>https://yanirseroussi.com/2016/08/21/seven-ways-to-be-data-driven-off-a-cliff/</link><pubDate>Sun, 21 Aug 2016 21:34:17 +0000</pubDate><guid>https://yanirseroussi.com/2016/08/21/seven-ways-to-be-data-driven-off-a-cliff/</guid><description>Seven common mistakes to avoid when working with data, such as ignoring uncertainty and confusing observed and unobserved quantities.</description></item><item><title>Is Data Scientist a useless job title?</title><link>https://yanirseroussi.com/2016/08/04/is-data-scientist-a-useless-job-title/</link><pubDate>Thu, 04 Aug 2016 22:26:03 +0000</pubDate><guid>https://yanirseroussi.com/2016/08/04/is-data-scientist-a-useless-job-title/</guid><description>It seems like anyone who touches data can call themselves a data scientist, which makes the title useless. The work they do can still be useful, though.</description></item><item><title>Making Bayesian A/B testing more accessible</title><link>https://yanirseroussi.com/2016/06/19/making-bayesian-ab-testing-more-accessible/</link><pubDate>Sun, 19 Jun 2016 10:32:15 +0000</pubDate><guid>https://yanirseroussi.com/2016/06/19/making-bayesian-ab-testing-more-accessible/</guid><description>A web tool I built to interpret A/B test results in a Bayesian way, including prior specification, visualisations, and decision rules.</description></item><item><title>Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions</title><link>https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/</link><pubDate>Sat, 14 May 2016 19:57:03 +0000</pubDate><guid>https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/</guid><description>Discussing the need for untested assumptions and temporality in causal inference. Mostly based on Samantha Kleinberg&amp;rsquo;s Causality, Probability, and Time.</description></item><item><title>The rise of greedy robots</title><link>https://yanirseroussi.com/2016/03/20/the-rise-of-greedy-robots/</link><pubDate>Sun, 20 Mar 2016 20:33:43 +0000</pubDate><guid>https://yanirseroussi.com/2016/03/20/the-rise-of-greedy-robots/</guid><description>Is artificial/machine intelligence a future threat? I argue that it&amp;rsquo;s already here, with greedy robots already dominating our lives.</description></item><item><title>Why you should stop worrying about deep learning and deepen your understanding of causality instead</title><link>https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/</link><pubDate>Sun, 14 Feb 2016 11:04:11 +0000</pubDate><guid>https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/</guid><description>Causality is often overlooked but is of much higher relevance to most data scientists than deep learning.</description></item><item><title>The joys of offline data collection</title><link>https://yanirseroussi.com/2016/01/24/the-joys-of-offline-data-collection/</link><pubDate>Sun, 24 Jan 2016 00:32:25 +0000</pubDate><guid>https://yanirseroussi.com/2016/01/24/the-joys-of-offline-data-collection/</guid><description>Insights on data collection and machine learning from spending a month sailing, diving, and counting fish with Reef Life Survey.</description></item><item><title>This holiday season, give me real insights</title><link>https://yanirseroussi.com/2015/12/08/this-holiday-season-give-me-real-insights/</link><pubDate>Tue, 08 Dec 2015 06:57:25 +0000</pubDate><guid>https://yanirseroussi.com/2015/12/08/this-holiday-season-give-me-real-insights/</guid><description>Some companies present raw data or information as &amp;ldquo;insights&amp;rdquo;. This post surveys some examples, and discusses how they can be turned into real insights.</description></item><item><title>The hardest parts of data science</title><link>https://yanirseroussi.com/2015/11/23/the-hardest-parts-of-data-science/</link><pubDate>Mon, 23 Nov 2015 04:14:21 +0000</pubDate><guid>https://yanirseroussi.com/2015/11/23/the-hardest-parts-of-data-science/</guid><description>Defining feasible problems and coming up with reasonable ways of measuring solutions is harder than building accurate models or obtaining clean data.</description></item><item><title>Migrating a simple web application from MongoDB to Elasticsearch</title><link>https://yanirseroussi.com/2015/11/04/migrating-a-simple-web-application-from-mongodb-to-elasticsearch/</link><pubDate>Wed, 04 Nov 2015 03:53:18 +0000</pubDate><guid>https://yanirseroussi.com/2015/11/04/migrating-a-simple-web-application-from-mongodb-to-elasticsearch/</guid><description>Migrating BCRecommender from MongoDB to Elasticsearch made it possible to offer a richer search experience to users at a similar cost, among other benefits.</description></item><item><title>Miscommunicating science: Simplistic models, nutritionism, and the art of storytelling</title><link>https://yanirseroussi.com/2015/10/19/nutritionism-and-the-need-for-complex-models-to-explain-complex-phenomena/</link><pubDate>Mon, 19 Oct 2015 00:02:32 +0000</pubDate><guid>https://yanirseroussi.com/2015/10/19/nutritionism-and-the-need-for-complex-models-to-explain-complex-phenomena/</guid><description>Nutritionism is a special case of misinterpretation and miscommunication of scientific results – something many data scientists encounter in their work.</description></item><item><title>The wonderful world of recommender systems</title><link>https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/</link><pubDate>Fri, 02 Oct 2015 05:25:57 +0000</pubDate><guid>https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/</guid><description>Giving an overview of the field and common paradigms, and debunking five common myths about recommender systems.</description></item><item><title>You don’t need a data scientist (yet)</title><link>https://yanirseroussi.com/2015/08/24/you-dont-need-a-data-scientist-yet/</link><pubDate>Mon, 24 Aug 2015 08:25:30 +0000</pubDate><guid>https://yanirseroussi.com/2015/08/24/you-dont-need-a-data-scientist-yet/</guid><description>Hiring data scientists prematurely is wasteful and frustrating. Here are some questions to ask before you hire your first data scientist.</description></item><item><title>Goodbye, Parse.com</title><link>https://yanirseroussi.com/2015/07/31/goodbye-parse-com/</link><pubDate>Fri, 31 Jul 2015 03:29:50 +0000</pubDate><guid>https://yanirseroussi.com/2015/07/31/goodbye-parse-com/</guid><description>Migrating my web apps away from Parse.com due to reliability issues. Self-hosting is a better solution.</description></item><item><title>Learning about deep learning through album cover classification</title><link>https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/</link><pubDate>Mon, 06 Jul 2015 22:21:42 +0000</pubDate><guid>https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/</guid><description>Progress on my album cover classification project, highlighting lessons that would be useful to others who are getting started with deep learning.</description></item><item><title>Deep learning resources</title><link>https://yanirseroussi.com/deep-learning-resources/</link><pubDate>Mon, 06 Jul 2015 00:38:44 +0000</pubDate><guid>https://yanirseroussi.com/deep-learning-resources/</guid><description>This page summarises the deep learning resources I&amp;rsquo;ve consulted in my album cover classification project.
-Tutorials and blog posts Convolutional Neural Networks for Visual Recognition Stanford course notes: an excellent resource, very up-to-date and useful, despite still being a work in progress DeepLearning.net&amp;rsquo;s Theano-based tutorials: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage Lasagne&amp;rsquo;s documentation and tutorials: still a bit lacking, but good when you know what you&amp;rsquo;re looking for lasagne4newbs: Lasagne&amp;rsquo;s convnet example with richer comments Using convolutional neural nets to detect facial keypoints tutorial: the resource that made me want to use Lasagne Classifying plankton with deep neural networks: an epic post, which I found while looking for Lasagne examples Various Wikipedia pages: a bit disappointing – the above resources are much better Papers Adam: a method for stochastic optimization (Kingma and Ba, 2015): an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice Algorithms for Hyper-Parameter Optimization (Bergstra et al.</description></item><item><title>Hopping on the deep learning bandwagon</title><link>https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/</link><pubDate>Sat, 06 Jun 2015 05:00:22 +0000</pubDate><guid>https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/</guid><description>To become proficient at solving data science problems, you need to get your hands dirty. Here, I used album cover classification to learn about deep learning.</description></item><item><title>First steps in data science: author-aware sentiment analysis</title><link>https://yanirseroussi.com/2015/05/02/first-steps-in-data-science-author-aware-sentiment-analysis/</link><pubDate>Sat, 02 May 2015 08:31:10 +0000</pubDate><guid>https://yanirseroussi.com/2015/05/02/first-steps-in-data-science-author-aware-sentiment-analysis/</guid><description>I became a data scientist by doing a PhD, but the same steps can be followed without a formal education program.</description></item><item><title>My divestment from fossil fuels</title><link>https://yanirseroussi.com/2015/04/24/my-divestment-from-fossil-fuels/</link><pubDate>Fri, 24 Apr 2015 00:19:36 +0000</pubDate><guid>https://yanirseroussi.com/2015/04/24/my-divestment-from-fossil-fuels/</guid><description>Recent choices I&amp;rsquo;ve made to reduce my exposure to fossil fuels, including practical steps that can be taken by Australians and generally applicable lessons.</description></item><item><title>My PhD work</title><link>https://yanirseroussi.com/phd-work/</link><pubDate>Mon, 30 Mar 2015 03:23:33 +0000</pubDate><guid>https://yanirseroussi.com/phd-work/</guid><description>An overview of my PhD in data science / artificial intelligence. Thesis title: Text Mining and Rating Prediction with Topical User Models.</description></item><item><title>The long road to a lifestyle business</title><link>https://yanirseroussi.com/2015/03/22/the-long-road-to-a-lifestyle-business/</link><pubDate>Sun, 22 Mar 2015 09:43:47 +0000</pubDate><guid>https://yanirseroussi.com/2015/03/22/the-long-road-to-a-lifestyle-business/</guid><description>Progress since leaving my last full-time job and setting on an independent path that includes data science consulting and work on my own projects.</description></item><item><title>Learning to rank for personalised search (Yandex Search Personalisation – Kaggle Competition Summary – Part 2)</title><link>https://yanirseroussi.com/2015/02/11/learning-to-rank-for-personalised-search-yandex-search-personalisation-kaggle-competition-summary-part-2/</link><pubDate>Wed, 11 Feb 2015 06:34:17 +0000</pubDate><guid>https://yanirseroussi.com/2015/02/11/learning-to-rank-for-personalised-search-yandex-search-personalisation-kaggle-competition-summary-part-2/</guid><description>My team&amp;rsquo;s solution to the Yandex Search Personalisation competition (finished 9th out of 194 teams).</description></item><item><title>Is thinking like a search engine possible? (Yandex search personalisation – Kaggle competition summary – part 1)</title><link>https://yanirseroussi.com/2015/01/29/is-thinking-like-a-search-engine-possible-yandex-search-personalisation-kaggle-competition-summary-part-1/</link><pubDate>Thu, 29 Jan 2015 10:37:39 +0000</pubDate><guid>https://yanirseroussi.com/2015/01/29/is-thinking-like-a-search-engine-possible-yandex-search-personalisation-kaggle-competition-summary-part-1/</guid><description>Insights on search personalisation and SEO from participating in a Kaggle competition (finished 9th out of 194 teams).</description></item><item><title>Automating Parse.com bulk data imports</title><link>https://yanirseroussi.com/2015/01/15/automating-parse-com-bulk-data-imports/</link><pubDate>Thu, 15 Jan 2015 04:41:16 +0000</pubDate><guid>https://yanirseroussi.com/2015/01/15/automating-parse-com-bulk-data-imports/</guid><description>A script for importing data into the Parse backend-as-a-service.</description></item><item><title>Stochastic Gradient Boosting: Choosing the Best Number of Iterations</title><link>https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/</link><pubDate>Mon, 29 Dec 2014 02:30:06 +0000</pubDate><guid>https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/</guid><description>Exploring an approach to choosing the optimal number of iterations in stochastic gradient boosting, following a bug I found in scikit-learn.</description></item><item><title>SEO: Mostly about showing up?</title><link>https://yanirseroussi.com/2014/12/15/seo-mostly-about-showing-up/</link><pubDate>Mon, 15 Dec 2014 04:25:25 +0000</pubDate><guid>https://yanirseroussi.com/2014/12/15/seo-mostly-about-showing-up/</guid><description>Increasing SEO traffic to BCRecommender by adding content and opening up more pages for crawling. It turns out that thin content is better than no content.</description></item><item><title>Fitting noise: Forecasting the sale price of bulldozers (Kaggle competition summary)</title><link>https://yanirseroussi.com/2014/11/19/fitting-noise-forecasting-the-sale-price-of-bulldozers-kaggle-competition-summary/</link><pubDate>Wed, 19 Nov 2014 09:17:34 +0000</pubDate><guid>https://yanirseroussi.com/2014/11/19/fitting-noise-forecasting-the-sale-price-of-bulldozers-kaggle-competition-summary/</guid><description>Summary of a Kaggle competition to forecast bulldozer sale price, where I finished 9th out of 476 teams.</description></item><item><title>BCRecommender Traction Update</title><link>https://yanirseroussi.com/2014/11/05/bcrecommender-traction-update/</link><pubDate>Wed, 05 Nov 2014 02:29:35 +0000</pubDate><guid>https://yanirseroussi.com/2014/11/05/bcrecommender-traction-update/</guid><description>Update on BCRecommender traction using three channels: blogger outreach, search engine optimisation, and content marketing.</description></item><item><title>What is data science?</title><link>https://yanirseroussi.com/2014/10/23/what-is-data-science/</link><pubDate>Thu, 23 Oct 2014 03:22:08 +0000</pubDate><guid>https://yanirseroussi.com/2014/10/23/what-is-data-science/</guid><description>Data science has been a hot term in the past few years. Still, there isn&amp;rsquo;t a single definition of the field. This post discusses my favourite definition.</description></item><item><title>Greek Media Monitoring Kaggle competition: My approach</title><link>https://yanirseroussi.com/2014/10/07/greek-media-monitoring-kaggle-competition-my-approach/</link><pubDate>Tue, 07 Oct 2014 03:21:35 +0000</pubDate><guid>https://yanirseroussi.com/2014/10/07/greek-media-monitoring-kaggle-competition-my-approach/</guid><description>Summary of my approach to the Greek Media Monitoring Kaggle competition, where I finished 6th out of 120 teams.</description></item><item><title>Applying the Traction Book’s Bullseye framework to BCRecommender</title><link>https://yanirseroussi.com/2014/09/24/applying-the-traction-books-bullseye-framework-to-bcrecommender/</link><pubDate>Wed, 24 Sep 2014 04:57:39 +0000</pubDate><guid>https://yanirseroussi.com/2014/09/24/applying-the-traction-books-bullseye-framework-to-bcrecommender/</guid><description>Ranking 19 channels with the goal of getting traction for BCRecommender.</description></item><item><title>Bandcamp recommendation and discovery algorithms</title><link>https://yanirseroussi.com/2014/09/19/bandcamp-recommendation-and-discovery-algorithms/</link><pubDate>Fri, 19 Sep 2014 14:26:55 +0000</pubDate><guid>https://yanirseroussi.com/2014/09/19/bandcamp-recommendation-and-discovery-algorithms/</guid><description>The recommendation backend for my BCRecommender service for personalised Bandcamp music discovery.</description></item><item><title>Building a recommender system on a shoestring budget (or: BCRecommender part 2 – general system layout)</title><link>https://yanirseroussi.com/2014/09/07/building-a-recommender-system-on-a-shoestring-budget/</link><pubDate>Sun, 07 Sep 2014 10:48:44 +0000</pubDate><guid>https://yanirseroussi.com/2014/09/07/building-a-recommender-system-on-a-shoestring-budget/</guid><description>Iterating on my BCRecommender service with the goal of keeping costs low while providing a valuable music recommendation service.</description></item><item><title>Building a Bandcamp recommender system (part 1 – motivation)</title><link>https://yanirseroussi.com/2014/08/30/building-a-bandcamp-recommender-system-part-1-motivation/</link><pubDate>Sat, 30 Aug 2014 08:11:38 +0000</pubDate><guid>https://yanirseroussi.com/2014/08/30/building-a-bandcamp-recommender-system-part-1-motivation/</guid><description>My motivation behind building BCRecommender, a free recommendation &amp;amp; discovery service for Bandcamp music.</description></item><item><title>How to (almost) win Kaggle competitions</title><link>https://yanirseroussi.com/2014/08/24/how-to-almost-win-kaggle-competitions/</link><pubDate>Sun, 24 Aug 2014 12:40:53 +0000</pubDate><guid>https://yanirseroussi.com/2014/08/24/how-to-almost-win-kaggle-competitions/</guid><description>Summary of a talk I gave at the Data Science Sydney meetup with ten tips on almost-winning Kaggle competitions.</description></item><item><title>Data’s hierarchy of needs</title><link>https://yanirseroussi.com/2014/08/17/datas-hierarchy-of-needs/</link><pubDate>Sun, 17 Aug 2014 13:09:30 +0000</pubDate><guid>https://yanirseroussi.com/2014/08/17/datas-hierarchy-of-needs/</guid><description>Discussing the hierarchy of needs proposed by Jay Kreps. Key takeaway: Data-driven algorithms &amp;amp; insights can only be as good as the underlying data.</description></item><item><title>Kaggle competition tips and summaries</title><link>https://yanirseroussi.com/kaggle/</link><pubDate>Sat, 05 Apr 2014 23:46:10 +0000</pubDate><guid>https://yanirseroussi.com/kaggle/</guid><description>Pointers to all my Kaggle advice posts and competition summaries.</description></item><item><title>Kaggle beginner tips</title><link>https://yanirseroussi.com/2014/01/19/kaggle-beginner-tips/</link><pubDate>Sun, 19 Jan 2014 10:34:28 +0000</pubDate><guid>https://yanirseroussi.com/2014/01/19/kaggle-beginner-tips/</guid><description>First post! An email I sent to members of the Data Science Sydney Meetup with tips on how to get started with Kaggle competitions.</description></item><item><title>About Yanir: Startup Data &amp; AI Consultant</title><link>https://yanirseroussi.com/about/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/about/</guid><description>About Yanir Seroussi, a hands-on data tech lead with over a decade of experience. Yanir helps climate/nature tech startups ship data-intensive solutions.</description></item><item><title>Book a free fifteen-minute call</title><link>https://yanirseroussi.com/free-intro-call/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/free-intro-call/</guid><description>Booking form for a quick intro call with Yanir Seroussi.</description></item><item><title>Causal inference resources</title><link>https://yanirseroussi.com/causal-inference-resources/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/causal-inference-resources/</guid><description>This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on causal inference and A/B testing.
-Books:
-Causal Inference: What if by Miguel Hernán and Jamie Robins: The most practical book I&amp;rsquo;ve read. Highly recommended. Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&amp;rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.</description></item><item><title>Free Guide: Data-to-AI Health Check for Startups</title><link>https://yanirseroussi.com/data-to-ai-health-check/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/data-to-ai-health-check/</guid><description>Download a free PDF guide that helps you assess a startup&amp;rsquo;s Data-to-AI health by probing eight key areas.</description></item><item><title>Helping climate &amp; nature tech startups ship data-intensive solutions</title><link>https://yanirseroussi.com/consult/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/consult/</guid><description>Consulting for climate &amp;amp; nature tech startups: Strategic advice, implementation of Data/AI/ML solutions, and hiring help by an experienced tech leader.</description></item><item><title>Speaking engagements by Yanir: Startup Data &amp; AI Consultant</title><link>https://yanirseroussi.com/talks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/talks/</guid><description>Yanir Seroussi speaks on data science, artificial intelligence, machine learning, and career journey.</description></item><item><title>Stay in touch</title><link>https://yanirseroussi.com/contact/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/contact/</guid><description>Contact me or subscribe to the mailing list.</description></item></channel></rss>
\ No newline at end of file
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Yanir Seroussi | Data &amp; AI for Startup Impact</title><link>https://yanirseroussi.com/</link><description>Recent content on Yanir Seroussi | Data &amp; AI for Startup Impact</description><generator>Hugo -- gohugo.io</generator><language>en-au</language><copyright>Text and figures licensed under [CC BY-NC-ND 4.0](https://creativecommons.org/licenses/by-nc-nd/4.0/) by [Yanir Seroussi](https://yanirseroussi.com/about/), except where noted otherwise</copyright><lastBuildDate>Mon, 02 Sep 2024 02:30:00 +0000</lastBuildDate><atom:link href="https://yanirseroussi.com/index.xml" rel="self" type="application/rss+xml"/><item><title>Juggling delivery, admin, and leads: Monthly biz recap</title><link>https://yanirseroussi.com/2024/09/02/juggling-delivery-admin-and-leads-monthly-biz-recap/</link><pubDate>Mon, 02 Sep 2024 02:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/09/02/juggling-delivery-admin-and-leads-monthly-biz-recap/</guid><description>Highlights and lessons from my solo expertise biz, including value pricing, fractional cash flow, and distractions from admin &amp;amp; politics.</description></item><item><title>AI hype, AI bullshit, and the real deal</title><link>https://yanirseroussi.com/2024/08/26/ai-hype-ai-bullshit-and-the-real-deal/</link><pubDate>Mon, 26 Aug 2024 01:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/26/ai-hype-ai-bullshit-and-the-real-deal/</guid><description>My views on separating AI hype and bullshit from the real deal. The general ideas apply to past and future hype waves in tech.</description></item><item><title>Giving up on the minimum viable data stack</title><link>https://yanirseroussi.com/2024/08/19/giving-up-on-the-minimum-viable-data-stack/</link><pubDate>Mon, 19 Aug 2024 03:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/19/giving-up-on-the-minimum-viable-data-stack/</guid><description>Exploring why universal advice on startup data stacks is challenging, and the importance of context-specific decisions in data infrastructure.</description></item><item><title>Keep learning: Your career is never truly done</title><link>https://yanirseroussi.com/2024/08/12/keep-learning-your-career-is-never-truly-done/</link><pubDate>Mon, 12 Aug 2024 01:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/12/keep-learning-your-career-is-never-truly-done/</guid><description>Podcast chat on my career journey from software engineering to data science and independent consulting.</description></item><item><title>First year lessons from a solo expertise biz in Data &amp; AI</title><link>https://yanirseroussi.com/2024/08/05/first-year-lessons-from-a-solo-expertise-biz-in-data-and-ai/</link><pubDate>Mon, 05 Aug 2024 08:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/08/05/first-year-lessons-from-a-solo-expertise-biz-in-data-and-ai/</guid><description>Reflections on building a solo expertise business in Data &amp;amp; AI, focusing on climate tech startups. Lessons learned from the first year of transition.</description></item><item><title>AI/ML lifecycle models versus real-world mess</title><link>https://yanirseroussi.com/2024/07/29/ai-ml-lifecycle-models-versus-real-world-mess/</link><pubDate>Mon, 29 Jul 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/29/ai-ml-lifecycle-models-versus-real-world-mess/</guid><description>The real world of AI/ML doesn&amp;rsquo;t fit into a neat diagram, so I created another diagram and a maturity heatmap to model the mess.</description></item><item><title>Your first Data-to-AI hire: Run a lovable process</title><link>https://yanirseroussi.com/2024/07/22/your-first-data-to-ai-hire-run-a-lovable-process/</link><pubDate>Mon, 22 Jul 2024 01:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/22/your-first-data-to-ai-hire-run-a-lovable-process/</guid><description>Video and key points from the second part of a webinar on a startup&amp;rsquo;s first data hire, covering tips for defining the role and running the process.</description></item><item><title>Learn about Dataland to avoid expensive hiring mistakes</title><link>https://yanirseroussi.com/2024/07/15/learn-about-dataland-to-avoid-expensive-hiring-mistakes/</link><pubDate>Mon, 15 Jul 2024 05:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/15/learn-about-dataland-to-avoid-expensive-hiring-mistakes/</guid><description>Video and key points from the first part of a webinar on a startup&amp;rsquo;s first data hire, covering data &amp;amp; AI definitions and high-level recommendations.</description></item><item><title>Exploring an AI product idea with the latest ChatGPT, Claude, and Gemini</title><link>https://yanirseroussi.com/2024/07/08/exploring-an-ai-product-idea-with-the-latest-chatgpt-claude-and-gemini/</link><pubDate>Mon, 08 Jul 2024 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/08/exploring-an-ai-product-idea-with-the-latest-chatgpt-claude-and-gemini/</guid><description>Asking identical questions about my MagicGrantMaker idea yielded near-identical responses from the top chatbot models.</description></item><item><title>Stay alert! Security is everyone's responsibility</title><link>https://yanirseroussi.com/2024/07/01/stay-alert-security-is-everyones-responsibility/</link><pubDate>Mon, 01 Jul 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/07/01/stay-alert-security-is-everyones-responsibility/</guid><description>Questions to assess the security posture of a startup, focusing on basic hygiene and handling of sensitive data.</description></item><item><title>Five team-building mistakes, according to Patty McCord</title><link>https://yanirseroussi.com/til/2024/06/26/five-team-building-mistakes-according-to-patty-mccord/</link><pubDate>Wed, 26 Jun 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/06/26/five-team-building-mistakes-according-to-patty-mccord/</guid><description>Takeaways from an interview with Patty McCord on The Startup Podcast.</description></item><item><title>Is your tech stack ready for data-intensive applications?</title><link>https://yanirseroussi.com/2024/06/24/is-your-tech-stack-ready-for-data-intensive-applications/</link><pubDate>Mon, 24 Jun 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/24/is-your-tech-stack-ready-for-data-intensive-applications/</guid><description>Questions to assess the quality of tech stacks and lifecycles, with a focus on artificial intelligence, machine learning, and analytics.</description></item><item><title>Dealing with endless data changes</title><link>https://yanirseroussi.com/til/2024/06/22/dealing-with-endless-data-changes/</link><pubDate>Sat, 22 Jun 2024 22:50:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/06/22/dealing-with-endless-data-changes/</guid><description>Quotes from Demetrios Brinkmann on the relationship between MLOps and DevOps, with MLOps allowing for managing changes that come from data.</description></item><item><title>AI ain't gonna save you from bad data</title><link>https://yanirseroussi.com/2024/06/17/ai-aint-gonna-save-you-from-bad-data/</link><pubDate>Mon, 17 Jun 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/17/ai-aint-gonna-save-you-from-bad-data/</guid><description>Since we&amp;rsquo;re far from a utopia where data issues are fully handled by AI, this post presents six questions humans can use to assess data projects.</description></item><item><title>The rules of the passion economy</title><link>https://yanirseroussi.com/til/2024/06/12/the-rules-of-the-passion-economy/</link><pubDate>Wed, 12 Jun 2024 02:50:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/06/12/the-rules-of-the-passion-economy/</guid><description>Summary of the main messages from the book The Passion Economy by Adam Davidson.</description></item><item><title>Startup data health starts with healthy event tracking</title><link>https://yanirseroussi.com/2024/06/10/startup-data-health-starts-with-healthy-event-tracking/</link><pubDate>Mon, 10 Jun 2024 04:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/10/startup-data-health-starts-with-healthy-event-tracking/</guid><description>Expanding on the startup health check question of tracking Kukuyeva&amp;rsquo;s five business aspects as wide events.</description></item><item><title>How to avoid startups with poor development processes</title><link>https://yanirseroussi.com/2024/06/03/how-to-avoid-startups-with-poor-development-processes/</link><pubDate>Mon, 03 Jun 2024 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/06/03/how-to-avoid-startups-with-poor-development-processes/</guid><description>Questions that prospective data specialists and engineers should ask about development processes before accepting a startup role.</description></item><item><title>Plumbing, Decisions, and Automation: De-hyping Data &amp; AI</title><link>https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/</link><pubDate>Mon, 27 May 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/27/plumbing-decisions-and-automation-de-hyping-data-and-ai/</guid><description>Three essential questions to understand where an organisation stands when it comes to Data &amp;amp; AI (with zero hype).</description></item><item><title>Adapting to the economy of algorithms</title><link>https://yanirseroussi.com/til/2024/05/25/adapting-to-the-economy-of-algorithms/</link><pubDate>Sat, 25 May 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/05/25/adapting-to-the-economy-of-algorithms/</guid><description>Overview of the book The Economy of Algorithms by Marek Kowalkiewicz.</description></item><item><title>Question startup culture before accepting a data-to-AI role</title><link>https://yanirseroussi.com/2024/05/20/question-startup-culture-before-accepting-a-data-to-ai-role/</link><pubDate>Mon, 20 May 2024 02:25:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/20/question-startup-culture-before-accepting-a-data-to-ai-role/</guid><description>Eight questions that prospective data-to-AI employees should ask about a startup&amp;rsquo;s work and data culture.</description></item><item><title>Probing the People aspects of an early-stage startup</title><link>https://yanirseroussi.com/2024/05/13/probing-the-people-aspects-of-an-early-stage-startup/</link><pubDate>Mon, 13 May 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/13/probing-the-people-aspects-of-an-early-stage-startup/</guid><description>Ten questions that prospective employees should ask about a startup&amp;rsquo;s team, especially for data-centric roles.</description></item><item><title>Business questions to ask before taking a startup data role</title><link>https://yanirseroussi.com/2024/05/06/business-questions-to-ask-before-taking-a-startup-data-role/</link><pubDate>Mon, 06 May 2024 04:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/05/06/business-questions-to-ask-before-taking-a-startup-data-role/</guid><description>Fourteen questions that prospective employees should ask about a startup&amp;rsquo;s business model and product, especially for data-focused roles.</description></item><item><title>Mentorship and the art of actionable advice</title><link>https://yanirseroussi.com/2024/04/29/mentorship-and-the-art-of-actionable-advice/</link><pubDate>Mon, 29 Apr 2024 06:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/29/mentorship-and-the-art-of-actionable-advice/</guid><description>Reflections on what it takes to package expertise and deliver timely, actionable advice outside the context of employee relationships.</description></item><item><title>Assessing a startup's data-to-AI health</title><link>https://yanirseroussi.com/2024/04/22/assessing-a-startups-data-to-ai-health/</link><pubDate>Mon, 22 Apr 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/22/assessing-a-startups-data-to-ai-health/</guid><description>Reviewing the areas that should be assessed to determine a startup&amp;rsquo;s opportunities and challenges on the data/AI/ML front.</description></item><item><title>AI does not obviate the need for testing and observability</title><link>https://yanirseroussi.com/2024/04/15/ai-does-not-obviate-the-need-for-testing-and-observability/</link><pubDate>Mon, 15 Apr 2024 05:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/15/ai-does-not-obviate-the-need-for-testing-and-observability/</guid><description>It&amp;rsquo;s easy to prototype with AI, but production-grade AI apps require even more thorough testing and observability than traditional software.</description></item><item><title>LinkedIn is a teachable skill</title><link>https://yanirseroussi.com/til/2024/04/11/linkedin-is-a-teachable-skill/</link><pubDate>Thu, 11 Apr 2024 01:45:25 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/04/11/linkedin-is-a-teachable-skill/</guid><description>An high-level overview of things I learned from Justin Welsh&amp;rsquo;s LinkedIn Operating System course.</description></item><item><title>My experience as a Data Tech Lead with Work on Climate</title><link>https://yanirseroussi.com/2024/04/08/my-experience-as-a-data-tech-lead-with-work-on-climate/</link><pubDate>Mon, 08 Apr 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/08/my-experience-as-a-data-tech-lead-with-work-on-climate/</guid><description>The story of how I joined Work on Climate as a volunteer and became its data tech lead, with lessons applied to consulting &amp;amp; fractional work.</description></item><item><title>The data engineering lifecycle is not going anywhere</title><link>https://yanirseroussi.com/til/2024/04/05/the-data-engineering-lifecycle-is-not-going-anywhere/</link><pubDate>Fri, 05 Apr 2024 01:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/04/05/the-data-engineering-lifecycle-is-not-going-anywhere/</guid><description>My key takeaways from reading Fundamentals of Data Engineering by Joe Reis and Matt Housley.</description></item><item><title>Artificial intelligence, automation, and the art of counting fish</title><link>https://yanirseroussi.com/2024/04/01/artificial-intelligence-automation-and-the-art-of-counting-fish/</link><pubDate>Mon, 01 Apr 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/04/01/artificial-intelligence-automation-and-the-art-of-counting-fish/</guid><description>Discussing the use of AI to automate underwater marine surveys as an example of the uneven distribution of technological advancement.</description></item><item><title>Atomic Habits is full of actionable advice</title><link>https://yanirseroussi.com/til/2024/03/12/atomic-habits-is-full-of-actionable-advice/</link><pubDate>Tue, 12 Mar 2024 06:19:31 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/03/12/atomic-habits-is-full-of-actionable-advice/</guid><description>I put the book to use after the first listen, and will definitely revisit it in the future to form better habits.</description></item><item><title>Questions to consider when using AI for PDF data extraction</title><link>https://yanirseroussi.com/2024/03/11/questions-to-consider-when-using-ai-for-pdf-data-extraction/</link><pubDate>Mon, 11 Mar 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/03/11/questions-to-consider-when-using-ai-for-pdf-data-extraction/</guid><description>Discussing considerations that arise when attempting to automate the extraction of structured data from PDFs and similar documents.</description></item><item><title>Two types of startup data problems</title><link>https://yanirseroussi.com/2024/03/04/two-types-of-startup-data-problems/</link><pubDate>Mon, 04 Mar 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/03/04/two-types-of-startup-data-problems/</guid><description>Classifying startups as ML-centric or non-ML is a helpful exercise to uncover the data challenges they&amp;rsquo;re likely to face.</description></item><item><title>Avoiding AI complexity: First, write no code</title><link>https://yanirseroussi.com/2024/02/26/avoiding-ai-complexity-first-write-no-code/</link><pubDate>Mon, 26 Feb 2024 01:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/26/avoiding-ai-complexity-first-write-no-code/</guid><description>Two stories of getting AI functionality to production, which demonstrate the risks inherent in custom development versus starting with a no-code approach.</description></item><item><title>Building your startup's minimum viable data stack</title><link>https://yanirseroussi.com/2024/02/19/building-your-startups-minimum-viable-data-stack/</link><pubDate>Mon, 19 Feb 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/19/building-your-startups-minimum-viable-data-stack/</guid><description>First post in a series on building a minimum viable data stack for startups, introducing key definitions, components, and considerations.</description></item><item><title>The three Cs of indie consulting: Confidence, Cash, and Connections</title><link>https://yanirseroussi.com/til/2024/02/17/the-three-cs-of-indie-consulting-confidence-cash-and-connections/</link><pubDate>Sat, 17 Feb 2024 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/02/17/the-three-cs-of-indie-consulting-confidence-cash-and-connections/</guid><description>Jonathan Stark makes a compelling argument why you should have the three Cs before quitting your job to go solo consulting.</description></item><item><title>Nudging ChatGPT to invent books you have no time to read</title><link>https://yanirseroussi.com/2024/02/12/nudging-chatgpt-to-invent-books-you-have-no-time-to-read/</link><pubDate>Mon, 12 Feb 2024 05:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/12/nudging-chatgpt-to-invent-books-you-have-no-time-to-read/</guid><description>Getting ChatGPT Plus to elaborate on possible book content and produce a PDF cheatsheet, with the goal of learning about its capabilities.</description></item><item><title>Future software development may require fewer humans</title><link>https://yanirseroussi.com/til/2024/02/06/future-software-development-may-require-fewer-humans/</link><pubDate>Tue, 06 Feb 2024 06:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/02/06/future-software-development-may-require-fewer-humans/</guid><description>Reflecting on an interview with Jason Warner, CEO of poolside.</description></item><item><title>Substance over titles: Your first data hire may be a data scientist</title><link>https://yanirseroussi.com/2024/02/05/substance-over-titles-your-first-data-hire-may-be-a-data-scientist/</link><pubDate>Mon, 05 Feb 2024 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/02/05/substance-over-titles-your-first-data-hire-may-be-a-data-scientist/</guid><description>Advice for hiring a startup&amp;rsquo;s first data person: match skills to business needs, consider contractors, and get help from data people.</description></item><item><title>New decade, new tagline: Data &amp; AI for Impact</title><link>https://yanirseroussi.com/2024/01/19/new-decade-new-tagline-data-and-ai-for-impact/</link><pubDate>Fri, 19 Jan 2024 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2024/01/19/new-decade-new-tagline-data-and-ai-for-impact/</guid><description>Shifting focus to &amp;lsquo;Data &amp;amp; AI for Impact&amp;rsquo;, with more startup-related content, increased posting frequency, and deeper audience engagement.</description></item><item><title>Psychographic specialisations may work for discipline generalists</title><link>https://yanirseroussi.com/til/2024/01/09/psychographic-specialisations-may-work-for-discipline-generalists/</link><pubDate>Tue, 09 Jan 2024 03:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/01/09/psychographic-specialisations-may-work-for-discipline-generalists/</guid><description>When focusing on a market segment defined by personal beliefs, it&amp;rsquo;s often fine to position yourself as a generalist in your craft.</description></item><item><title>The power of parasocial relationships</title><link>https://yanirseroussi.com/til/2024/01/08/the-power-of-parasocial-relationships/</link><pubDate>Mon, 08 Jan 2024 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2024/01/08/the-power-of-parasocial-relationships/</guid><description>Repeated exposure to media personas creates relationships that help justify premium fees.</description></item><item><title>Positioning is a common problem for data scientists</title><link>https://yanirseroussi.com/til/2023/12/18/positioning-is-a-common-problem-for-data-scientists/</link><pubDate>Mon, 18 Dec 2023 00:30:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/12/18/positioning-is-a-common-problem-for-data-scientists/</guid><description>With the commodification of data scientists, the problem of positioning has become more common: My takeaways from Genevieve Hayes interviewing Jonathan Stark.</description></item><item><title>Transfer learning applies to energy market bidding</title><link>https://yanirseroussi.com/til/2023/12/14/transfer-learning-applies-to-energy-market-bidding/</link><pubDate>Thu, 14 Dec 2023 00:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/12/14/transfer-learning-applies-to-energy-market-bidding/</guid><description>An interesting approach to bidding of energy storage assets, showing that training on New York data is transferable to Queensland.</description></item><item><title>Supporting volunteer monitoring of marine biodiversity with modern web and data tools</title><link>https://yanirseroussi.com/2023/11/29/supporting-volunteer-monitoring-of-marine-biodiversity-with-modern-web-and-data-tools/</link><pubDate>Wed, 29 Nov 2023 02:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/11/29/supporting-volunteer-monitoring-of-marine-biodiversity-with-modern-web-and-data-tools/</guid><description>Summarising the work Uri Seroussi and I did to improve Reef Life Survey&amp;rsquo;s Reef Species of the World app.</description></item><item><title>Our Blue Machine is changing, but we are not helpless</title><link>https://yanirseroussi.com/til/2023/11/28/our-blue-machine-is-changing-but-we-are-not-helpless/</link><pubDate>Tue, 28 Nov 2023 06:40:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/11/28/our-blue-machine-is-changing-but-we-are-not-helpless/</guid><description>One of my many highlights from Helen Czerski&amp;rsquo;s Blue Machine.</description></item><item><title>You don't need a proprietary API for static maps</title><link>https://yanirseroussi.com/til/2023/11/21/you-dont-need-a-proprietary-api-for-static-maps/</link><pubDate>Tue, 21 Nov 2023 06:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/11/21/you-dont-need-a-proprietary-api-for-static-maps/</guid><description>For many use cases, libraries like cartopy are better than the likes of Mapbox and Google Maps.</description></item><item><title>Lessons from reluctant data engineering</title><link>https://yanirseroussi.com/2023/10/25/lessons-from-reluctant-data-engineering/</link><pubDate>Wed, 25 Oct 2023 04:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/10/25/lessons-from-reluctant-data-engineering/</guid><description>Video and summary of a talk I gave at DataEngBytes Brisbane on what I learned from doing data engineering as part of every data science role I had.</description></item><item><title>Artificial intelligence was a marketing term all along – just call it automation</title><link>https://yanirseroussi.com/til/2023/10/06/artificial-intelligence-was-a-marketing-term-all-along-just-call-it-automation/</link><pubDate>Fri, 06 Oct 2023 05:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/10/06/artificial-intelligence-was-a-marketing-term-all-along-just-call-it-automation/</guid><description>Replacing &amp;lsquo;artificial intelligence&amp;rsquo; with &amp;lsquo;automation&amp;rsquo; is a useful trick for cutting through the hype.</description></item><item><title>The lines between solo consulting and product building are blurry</title><link>https://yanirseroussi.com/til/2023/09/25/the-lines-between-solo-consulting-and-product-building-are-blurry/</link><pubDate>Mon, 25 Sep 2023 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/09/25/the-lines-between-solo-consulting-and-product-building-are-blurry/</guid><description>It turns out that problems like finding a niche and defining the ideal clients are key to any solo business.</description></item><item><title>Google's Rules of Machine Learning still apply in the age of large language models</title><link>https://yanirseroussi.com/til/2023/09/21/googles-rules-of-machine-learning-still-apply-in-the-age-of-large-language-models/</link><pubDate>Thu, 21 Sep 2023 21:30:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/09/21/googles-rules-of-machine-learning-still-apply-in-the-age-of-large-language-models/</guid><description>Despite the excitement around large language models, building with machine learning remains an engineering problem with established best practices.</description></item><item><title>My rediscovery of quiet writing on the open web</title><link>https://yanirseroussi.com/2023/08/28/my-rediscovery-of-quiet-writing-on-the-open-web/</link><pubDate>Mon, 28 Aug 2023 05:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/08/28/my-rediscovery-of-quiet-writing-on-the-open-web/</guid><description>Reflections on publishing on this website: Writing publicly to share thoughts and documentation beats chasing views and likes.</description></item><item><title>The Minimalist Entrepreneur is too prescriptive for me</title><link>https://yanirseroussi.com/til/2023/08/21/the-minimalist-entrepreneur-is-too-prescriptive-for-me/</link><pubDate>Mon, 21 Aug 2023 03:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/21/the-minimalist-entrepreneur-is-too-prescriptive-for-me/</guid><description>While I found the story of Gumroad interesting, The Minimalist Entrepreneur seems to over-generalise from the founder&amp;rsquo;s experience.</description></item><item><title>Revisiting Start Small, Stay Small in 2023 (Chapter 2)</title><link>https://yanirseroussi.com/til/2023/08/17/revisiting-start-small-stay-small-in-2023-chapter-2/</link><pubDate>Thu, 17 Aug 2023 07:45:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/17/revisiting-start-small-stay-small-in-2023-chapter-2/</guid><description>A summary of the second chapter of Rob Walling&amp;rsquo;s Start Small, Stay Small, along with my thoughts &amp;amp; reflections.</description></item><item><title>Revisiting Start Small, Stay Small in 2023 (Chapter 1)</title><link>https://yanirseroussi.com/til/2023/08/16/revisiting-start-small-stay-small-in-2023-chapter-1/</link><pubDate>Wed, 16 Aug 2023 05:45:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/16/revisiting-start-small-stay-small-in-2023-chapter-1/</guid><description>A summary of the first chapter of Rob Walling&amp;rsquo;s Start Small, Stay Small, along with my thoughts &amp;amp; reflections.</description></item><item><title>Email notifications on public GitHub commits</title><link>https://yanirseroussi.com/til/2023/08/14/email-notifications-on-public-github-commits/</link><pubDate>Mon, 14 Aug 2023 05:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/14/email-notifications-on-public-github-commits/</guid><description>GitHub publishes an Atom feed, which means you can use any RSS reader to follow commits.</description></item><item><title>The rule of thirds can probably be ignored</title><link>https://yanirseroussi.com/til/2023/08/11/the-rule-of-thirds-can-probably-be-ignored/</link><pubDate>Fri, 11 Aug 2023 03:15:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/08/11/the-rule-of-thirds-can-probably-be-ignored/</guid><description>Turns out that the rule of thirds for composing visuals may not be that important.</description></item><item><title>Using YubiKey for SSH access</title><link>https://yanirseroussi.com/til/2023/07/23/using-yubikey-for-ssh-access/</link><pubDate>Sun, 23 Jul 2023 00:07:15 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/07/23/using-yubikey-for-ssh-access/</guid><description>Some pointers for setting up SSH access with YubiKey on Ubuntu 22.04.</description></item><item><title>Making a TIL section with Hugo and PaperMod</title><link>https://yanirseroussi.com/til/2023/07/17/making-a-til-section-with-hugo-and-papermod/</link><pubDate>Mon, 17 Jul 2023 00:06:15 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/07/17/making-a-til-section-with-hugo-and-papermod/</guid><description>How I added a Today I Learned section to my Hugo site with the PaperMod theme.</description></item><item><title>You can't save time</title><link>https://yanirseroussi.com/til/2023/07/11/you-cant-save-time/</link><pubDate>Tue, 11 Jul 2023 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/til/2023/07/11/you-cant-save-time/</guid><description>Time can be spent doing different activities, but it can&amp;rsquo;t be stored and saved for later.</description></item><item><title>Was data science a failure mode of software engineering?</title><link>https://yanirseroussi.com/2023/06/30/was-data-science-a-failure-mode-of-software-engineering/</link><pubDate>Fri, 30 Jun 2023 00:06:30 +0000</pubDate><guid>https://yanirseroussi.com/2023/06/30/was-data-science-a-failure-mode-of-software-engineering/</guid><description>Yes, data science projects have suffered from classic software engineering mistakes, but the field is maturing with the rise of new engineering roles.</description></item><item><title>How hackable are automated coding assessments?</title><link>https://yanirseroussi.com/2023/05/26/how-hackable-are-automated-coding-assessments/</link><pubDate>Fri, 26 May 2023 00:03:00 +0000</pubDate><guid>https://yanirseroussi.com/2023/05/26/how-hackable-are-automated-coding-assessments/</guid><description>Exploring the hackability of speed-based coding tests, using CodeSignal&amp;rsquo;s Industry Coding Framework as a case study.</description></item><item><title>Remaining relevant as a small language model</title><link>https://yanirseroussi.com/2023/04/21/remaining-relevant-as-a-small-language-model/</link><pubDate>Fri, 21 Apr 2023 00:06:30 +0000</pubDate><guid>https://yanirseroussi.com/2023/04/21/remaining-relevant-as-a-small-language-model/</guid><description>Bing Chat recently quipped that humans are small language models. Here are some of my thoughts on how we small language models can remain relevant (for now).</description></item><item><title>ChatGPT is transformative AI</title><link>https://yanirseroussi.com/2022/12/11/chatgpt-is-transformative-ai/</link><pubDate>Sun, 11 Dec 2022 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/12/11/chatgpt-is-transformative-ai/</guid><description>My perspective after a week of using ChatGPT: This is a step change in finding distilled information, and it&amp;rsquo;s only the beginning.</description></item><item><title>Causal Machine Learning is off to a good start, despite some issues</title><link>https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/</link><pubDate>Mon, 12 Sep 2022 02:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/</guid><description>Reviewing the first three chapters of the book Causal Machine Learning by Robert Osazuwa Ness.</description></item><item><title>The mission matters: Moving to climate tech as a data scientist</title><link>https://yanirseroussi.com/2022/06/06/the-mission-matters-moving-to-climate-tech-as-a-data-scientist/</link><pubDate>Mon, 06 Jun 2022 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/06/06/the-mission-matters-moving-to-climate-tech-as-a-data-scientist/</guid><description>Discussing my recent career move into climate tech as a way of doing more to help mitigate dangerous climate change.</description></item><item><title>Building useful machine learning tools keeps getting easier: A fish ID case study</title><link>https://yanirseroussi.com/2022/03/20/building-useful-machine-learning-tools-keeps-getting-easier-a-fish-id-case-study/</link><pubDate>Sun, 20 Mar 2022 04:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2022/03/20/building-useful-machine-learning-tools-keeps-getting-easier-a-fish-id-case-study/</guid><description>Lessons learned building a fish ID web app with fast.ai and Streamlit, in an attempt to reduce my fear of missing out on the latest deep learning developments.</description></item><item><title>Analysis strategies in online A/B experiments: Intention-to-treat, per-protocol, and other lessons from clinical trials</title><link>https://yanirseroussi.com/2022/01/14/analysis-strategies-in-online-a-b-experiments/</link><pubDate>Fri, 14 Jan 2022 00:05:40 +0000</pubDate><guid>https://yanirseroussi.com/2022/01/14/analysis-strategies-in-online-a-b-experiments/</guid><description>Epidemiologists analyse clinical trials to estimate the intention-to-treat and per-protocol effects. This post applies their strategies to online experiments.</description></item><item><title>Use your human brain to avoid artificial intelligence disasters</title><link>https://yanirseroussi.com/2021/11/22/use-your-human-brain-to-avoid-artificial-intelligence-disasters/</link><pubDate>Mon, 22 Nov 2021 03:45:00 +0000</pubDate><guid>https://yanirseroussi.com/2021/11/22/use-your-human-brain-to-avoid-artificial-intelligence-disasters/</guid><description>Overview of a talk I gave at a deep learning course, focusing on AI ethics as the need for humans to think on the context and consequences of applying AI.</description></item><item><title>Migrating from WordPress.com to Hugo on GitHub + Cloudflare</title><link>https://yanirseroussi.com/2021/11/10/migrating-from-wordpress-com-to-hugo-on-github-cloudflare/</link><pubDate>Wed, 10 Nov 2021 06:30:00 +0000</pubDate><guid>https://yanirseroussi.com/2021/11/10/migrating-from-wordpress-com-to-hugo-on-github-cloudflare/</guid><description>My reasons for switching from WordPress.com to Hugo on GitHub + Cloudflare, along with a summary of the solution components and migration process.</description></item><item><title>My work with Automattic</title><link>https://yanirseroussi.com/2021/10/07/my-work-with-automattic/</link><pubDate>Thu, 07 Oct 2021 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/2021/10/07/my-work-with-automattic/</guid><description>Back-dated meta-post that gathers my posts on Automattic blogs into a summary of the work I&amp;rsquo;ve done with the company.</description></item><item><title>Some highlights from 2020</title><link>https://yanirseroussi.com/2021/04/05/some-highlights-from-2020/</link><pubDate>Mon, 05 Apr 2021 06:41:48 +0000</pubDate><guid>https://yanirseroussi.com/2021/04/05/some-highlights-from-2020/</guid><description>Sharing remote teamwork insights, my climate &amp;amp; sustainability activism, Reef Life Survey publications, and progress on Automattic&amp;rsquo;s Experimentation Platform.</description></item><item><title>Many is not enough: Counting simulations to bootstrap the right way</title><link>https://yanirseroussi.com/2020/08/24/many-is-not-enough-counting-simulations-to-bootstrap-the-right-way/</link><pubDate>Mon, 24 Aug 2020 01:35:17 +0000</pubDate><guid>https://yanirseroussi.com/2020/08/24/many-is-not-enough-counting-simulations-to-bootstrap-the-right-way/</guid><description>Going deeper into correct testing of different methods for bootstrap estimation of confidence intervals.</description></item><item><title>Software commodities are eating interesting data science work</title><link>https://yanirseroussi.com/2020/01/11/software-commodities-are-eating-interesting-data-science-work/</link><pubDate>Sat, 11 Jan 2020 09:22:35 +0000</pubDate><guid>https://yanirseroussi.com/2020/01/11/software-commodities-are-eating-interesting-data-science-work/</guid><description>Being a data scientist can sometimes feel like a race against software commodities that replace interesting work. What can one do to remain relevant?</description></item><item><title>A day in the life of a remote data scientist</title><link>https://yanirseroussi.com/2019/12/12/a-day-in-the-life-of-a-remote-data-scientist/</link><pubDate>Wed, 11 Dec 2019 22:06:19 +0000</pubDate><guid>https://yanirseroussi.com/2019/12/12/a-day-in-the-life-of-a-remote-data-scientist/</guid><description>Video of a talk I gave on remote data science work at the Data Science Sydney meetup.</description></item><item><title>Bootstrapping the right way?</title><link>https://yanirseroussi.com/2019/10/06/bootstrapping-the-right-way/</link><pubDate>Sun, 06 Oct 2019 06:48:07 +0000</pubDate><guid>https://yanirseroussi.com/2019/10/06/bootstrapping-the-right-way/</guid><description>Video and summary of a talk I gave at YOW! Data on bootstrap estimation of confidence intervals.</description></item><item><title>Hackers beware: Bootstrap sampling may be harmful</title><link>https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/</link><pubDate>Mon, 07 Jan 2019 21:07:56 +0000</pubDate><guid>https://yanirseroussi.com/2019/01/08/hackers-beware-bootstrap-sampling-may-be-harmful/</guid><description>Bootstrap sampling has been promoted as an easy way of modelling uncertainty to hackers without much statistical knowledge. But things aren&amp;rsquo;t that simple.</description></item><item><title>The most practical causal inference book I’ve read (is still a draft)</title><link>https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/</link><pubDate>Mon, 24 Dec 2018 02:37:50 +0000</pubDate><guid>https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/</guid><description>Causal Inference by Miguel Hernán and Jamie Robins is a must-read for anyone interested in the area.</description></item><item><title>Reflections on remote data science work</title><link>https://yanirseroussi.com/2018/11/03/reflections-on-remote-data-science-work/</link><pubDate>Sat, 03 Nov 2018 06:33:13 +0000</pubDate><guid>https://yanirseroussi.com/2018/11/03/reflections-on-remote-data-science-work/</guid><description>Discussing the pluses and minuses of remote work eighteen months after joining Automattic as a data scientist.</description></item><item><title>Defining data science in 2018</title><link>https://yanirseroussi.com/2018/07/22/defining-data-science-in-2018/</link><pubDate>Sun, 22 Jul 2018 08:27:43 +0000</pubDate><guid>https://yanirseroussi.com/2018/07/22/defining-data-science-in-2018/</guid><description>Updating my definition of data science to match changes in the field. It is now broader than before, but its ultimate goal is still to support decisions.</description></item><item><title>Advice for aspiring data scientists and other FAQs</title><link>https://yanirseroussi.com/2017/10/15/advice-for-aspiring-data-scientists-and-other-faqs/</link><pubDate>Sun, 15 Oct 2017 09:15:25 +0000</pubDate><guid>https://yanirseroussi.com/2017/10/15/advice-for-aspiring-data-scientists-and-other-faqs/</guid><description>Frequently asked questions by visitors to this site, especially around entering the data science field.</description></item><item><title>State of Bandcamp Recommender, Late 2017</title><link>https://yanirseroussi.com/2017/09/02/state-of-bandcamp-recommender/</link><pubDate>Sat, 02 Sep 2017 10:19:02 +0000</pubDate><guid>https://yanirseroussi.com/2017/09/02/state-of-bandcamp-recommender/</guid><description>Call for BCRecommender maintainers followed by a decision to shut it down, as I don&amp;rsquo;t have enough time and Bandcamp now offers recommendations.</description></item><item><title>My 10-step path to becoming a remote data scientist with Automattic</title><link>https://yanirseroussi.com/2017/07/29/my-10-step-path-to-becoming-a-remote-data-scientist-with-automattic/</link><pubDate>Sat, 29 Jul 2017 05:39:26 +0000</pubDate><guid>https://yanirseroussi.com/2017/07/29/my-10-step-path-to-becoming-a-remote-data-scientist-with-automattic/</guid><description>I wanted a well-paid data science-y remote job with an established company that offers a good life balance and makes products I care about. I got it eventually.</description></item><item><title>Exploring and visualising Reef Life Survey data</title><link>https://yanirseroussi.com/2017/06/03/exploring-and-visualising-reef-life-survey-data/</link><pubDate>Sat, 03 Jun 2017 00:49:05 +0000</pubDate><guid>https://yanirseroussi.com/2017/06/03/exploring-and-visualising-reef-life-survey-data/</guid><description>Web tools I built to visualise Reef Life Survey data and assist citizen scientists in underwater visual census work.</description></item><item><title>Customer lifetime value and the proliferation of misinformation on the internet</title><link>https://yanirseroussi.com/2017/01/08/customer-lifetime-value-and-the-proliferation-of-misinformation-on-the-internet/</link><pubDate>Sun, 08 Jan 2017 20:02:30 +0000</pubDate><guid>https://yanirseroussi.com/2017/01/08/customer-lifetime-value-and-the-proliferation-of-misinformation-on-the-internet/</guid><description>There&amp;rsquo;s a lot of misleading content on the estimation of customer lifetime value. Here&amp;rsquo;s what I learned about doing it well.</description></item><item><title>Ask Why! Finding motives, causes, and purpose in data science</title><link>https://yanirseroussi.com/2016/09/19/ask-why-finding-motives-causes-and-purpose-in-data-science/</link><pubDate>Mon, 19 Sep 2016 21:28:44 +0000</pubDate><guid>https://yanirseroussi.com/2016/09/19/ask-why-finding-motives-causes-and-purpose-in-data-science/</guid><description>Video and summary of a talk I gave at the Data Science Sydney meetup, about going beyond the what &amp;amp; how of predictive modelling.</description></item><item><title>If you don’t pay attention, data can drive you off a cliff</title><link>https://yanirseroussi.com/2016/08/21/seven-ways-to-be-data-driven-off-a-cliff/</link><pubDate>Sun, 21 Aug 2016 21:34:17 +0000</pubDate><guid>https://yanirseroussi.com/2016/08/21/seven-ways-to-be-data-driven-off-a-cliff/</guid><description>Seven common mistakes to avoid when working with data, such as ignoring uncertainty and confusing observed and unobserved quantities.</description></item><item><title>Is Data Scientist a useless job title?</title><link>https://yanirseroussi.com/2016/08/04/is-data-scientist-a-useless-job-title/</link><pubDate>Thu, 04 Aug 2016 22:26:03 +0000</pubDate><guid>https://yanirseroussi.com/2016/08/04/is-data-scientist-a-useless-job-title/</guid><description>It seems like anyone who touches data can call themselves a data scientist, which makes the title useless. The work they do can still be useful, though.</description></item><item><title>Making Bayesian A/B testing more accessible</title><link>https://yanirseroussi.com/2016/06/19/making-bayesian-ab-testing-more-accessible/</link><pubDate>Sun, 19 Jun 2016 10:32:15 +0000</pubDate><guid>https://yanirseroussi.com/2016/06/19/making-bayesian-ab-testing-more-accessible/</guid><description>A web tool I built to interpret A/B test results in a Bayesian way, including prior specification, visualisations, and decision rules.</description></item><item><title>Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions</title><link>https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/</link><pubDate>Sat, 14 May 2016 19:57:03 +0000</pubDate><guid>https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/</guid><description>Discussing the need for untested assumptions and temporality in causal inference. Mostly based on Samantha Kleinberg&amp;rsquo;s Causality, Probability, and Time.</description></item><item><title>The rise of greedy robots</title><link>https://yanirseroussi.com/2016/03/20/the-rise-of-greedy-robots/</link><pubDate>Sun, 20 Mar 2016 20:33:43 +0000</pubDate><guid>https://yanirseroussi.com/2016/03/20/the-rise-of-greedy-robots/</guid><description>Is artificial/machine intelligence a future threat? I argue that it&amp;rsquo;s already here, with greedy robots already dominating our lives.</description></item><item><title>Why you should stop worrying about deep learning and deepen your understanding of causality instead</title><link>https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/</link><pubDate>Sun, 14 Feb 2016 11:04:11 +0000</pubDate><guid>https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/</guid><description>Causality is often overlooked but is of much higher relevance to most data scientists than deep learning.</description></item><item><title>The joys of offline data collection</title><link>https://yanirseroussi.com/2016/01/24/the-joys-of-offline-data-collection/</link><pubDate>Sun, 24 Jan 2016 00:32:25 +0000</pubDate><guid>https://yanirseroussi.com/2016/01/24/the-joys-of-offline-data-collection/</guid><description>Insights on data collection and machine learning from spending a month sailing, diving, and counting fish with Reef Life Survey.</description></item><item><title>This holiday season, give me real insights</title><link>https://yanirseroussi.com/2015/12/08/this-holiday-season-give-me-real-insights/</link><pubDate>Tue, 08 Dec 2015 06:57:25 +0000</pubDate><guid>https://yanirseroussi.com/2015/12/08/this-holiday-season-give-me-real-insights/</guid><description>Some companies present raw data or information as &amp;ldquo;insights&amp;rdquo;. This post surveys some examples, and discusses how they can be turned into real insights.</description></item><item><title>The hardest parts of data science</title><link>https://yanirseroussi.com/2015/11/23/the-hardest-parts-of-data-science/</link><pubDate>Mon, 23 Nov 2015 04:14:21 +0000</pubDate><guid>https://yanirseroussi.com/2015/11/23/the-hardest-parts-of-data-science/</guid><description>Defining feasible problems and coming up with reasonable ways of measuring solutions is harder than building accurate models or obtaining clean data.</description></item><item><title>Migrating a simple web application from MongoDB to Elasticsearch</title><link>https://yanirseroussi.com/2015/11/04/migrating-a-simple-web-application-from-mongodb-to-elasticsearch/</link><pubDate>Wed, 04 Nov 2015 03:53:18 +0000</pubDate><guid>https://yanirseroussi.com/2015/11/04/migrating-a-simple-web-application-from-mongodb-to-elasticsearch/</guid><description>Migrating BCRecommender from MongoDB to Elasticsearch made it possible to offer a richer search experience to users at a similar cost, among other benefits.</description></item><item><title>Miscommunicating science: Simplistic models, nutritionism, and the art of storytelling</title><link>https://yanirseroussi.com/2015/10/19/nutritionism-and-the-need-for-complex-models-to-explain-complex-phenomena/</link><pubDate>Mon, 19 Oct 2015 00:02:32 +0000</pubDate><guid>https://yanirseroussi.com/2015/10/19/nutritionism-and-the-need-for-complex-models-to-explain-complex-phenomena/</guid><description>Nutritionism is a special case of misinterpretation and miscommunication of scientific results – something many data scientists encounter in their work.</description></item><item><title>The wonderful world of recommender systems</title><link>https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/</link><pubDate>Fri, 02 Oct 2015 05:25:57 +0000</pubDate><guid>https://yanirseroussi.com/2015/10/02/the-wonderful-world-of-recommender-systems/</guid><description>Giving an overview of the field and common paradigms, and debunking five common myths about recommender systems.</description></item><item><title>You don’t need a data scientist (yet)</title><link>https://yanirseroussi.com/2015/08/24/you-dont-need-a-data-scientist-yet/</link><pubDate>Mon, 24 Aug 2015 08:25:30 +0000</pubDate><guid>https://yanirseroussi.com/2015/08/24/you-dont-need-a-data-scientist-yet/</guid><description>Hiring data scientists prematurely is wasteful and frustrating. Here are some questions to ask before you hire your first data scientist.</description></item><item><title>Goodbye, Parse.com</title><link>https://yanirseroussi.com/2015/07/31/goodbye-parse-com/</link><pubDate>Fri, 31 Jul 2015 03:29:50 +0000</pubDate><guid>https://yanirseroussi.com/2015/07/31/goodbye-parse-com/</guid><description>Migrating my web apps away from Parse.com due to reliability issues. Self-hosting is a better solution.</description></item><item><title>Learning about deep learning through album cover classification</title><link>https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/</link><pubDate>Mon, 06 Jul 2015 22:21:42 +0000</pubDate><guid>https://yanirseroussi.com/2015/07/06/learning-about-deep-learning-through-album-cover-classification/</guid><description>Progress on my album cover classification project, highlighting lessons that would be useful to others who are getting started with deep learning.</description></item><item><title>Deep learning resources</title><link>https://yanirseroussi.com/deep-learning-resources/</link><pubDate>Mon, 06 Jul 2015 00:38:44 +0000</pubDate><guid>https://yanirseroussi.com/deep-learning-resources/</guid><description>&lt;p>This page summarises the deep learning resources I&amp;rsquo;ve consulted in &lt;a href="https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/">my album cover classification project&lt;/a>.&lt;/p>
+&lt;h3 id="tutorials-and-blog-posts">Tutorials and blog posts&lt;/h3>
+&lt;ul>
+&lt;li>&lt;a href="http://cs231n.github.io/" target="_blank" rel="noopener">Convolutional Neural Networks for Visual Recognition Stanford course notes&lt;/a>: an excellent resource, very up-to-date and useful, despite still being a work in progress&lt;/li>
+&lt;li>&lt;a href="http://deeplearning.net/tutorial/" target="_blank" rel="noopener">DeepLearning.net&amp;rsquo;s Theano-based tutorials&lt;/a>: not as up-to-date as the Stanford course notes, but still a good introduction to some of the theory and general Theano usage&lt;/li>
+&lt;li>&lt;a href="http://lasagne.readthedocs.org/en/latest/" target="_blank" rel="noopener">Lasagne&amp;rsquo;s documentation and tutorials&lt;/a>: still a bit lacking, but good when you know what you&amp;rsquo;re looking for&lt;/li>
+&lt;li>&lt;a href="https://github.com/enlitic/lasagne4newbs" target="_blank" rel="noopener">lasagne4newbs&lt;/a>: Lasagne&amp;rsquo;s convnet example with richer comments&lt;/li>
+&lt;li>&lt;a href="http://danielnouri.org/notes/2014/12/17/using-convolutional-neural-nets-to-detect-facial-keypoints-tutorial/" target="_blank" rel="noopener">Using convolutional neural nets to detect facial keypoints tutorial&lt;/a>: the resource that made me want to use Lasagne&lt;/li>
+&lt;li>&lt;a href="http://benanne.github.io/2015/03/17/plankton.html" target="_blank" rel="noopener">Classifying plankton with deep neural networks&lt;/a>: an epic post, which I found while looking for Lasagne examples&lt;/li>
+&lt;li>&lt;a href="https://en.wikipedia.org/wiki/Main_Page" target="_blank" rel="noopener">Various Wikipedia pages&lt;/a>: a bit disappointing – the above resources are much better&lt;/li>
+&lt;/ul>
+&lt;h3 id="papers">Papers&lt;/h3>
+&lt;ul>
+&lt;li>&lt;a href="http://arxiv.org/abs/1412.6980" target="_blank" rel="noopener">Adam: a method for stochastic optimization (Kingma and Ba, 2015)&lt;/a>: an improvement over SGD with Nesterov momentum, AdaGrad and RMSProp, which I found to be useful in practice&lt;/li>
+&lt;li>&lt;a href="http://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization" target="_blank" rel="noopener">Algorithms for Hyper-Parameter Optimization (Bergstra et al., 2011)&lt;/a>: the work behind &lt;a href="https://github.com/hyperopt/hyperopt" target="_blank" rel="noopener">Hyperopt&lt;/a> – pretty useful stuff, not only for deep learning&lt;/li>
+&lt;li>&lt;a href="http://arxiv.org/abs/1412.1710" target="_blank" rel="noopener">Convolutional Neural Networks at Constrained Time Cost (He and Sun, 2014)&lt;/a>: interesting experimental work on the tradeoffs between number of filters, filter sizes, and depth – deeper is better (but with diminishing returns); smaller filter sizes are better; delayed subsampling and spatial pyramid pooling are helpful&lt;/li>
+&lt;li>&lt;a href="http://arxiv.org/abs/1404.7828" target="_blank" rel="noopener">Deep Learning in Neural Networks: An Overview (Schmidhuber, 2014)&lt;/a>: 88 pages and 888 references (35 content pages) – good for finding references, but a bit hard to follow; not so good for understanding how the various methods work and how to use or implement them&lt;/li>
+&lt;li>&lt;a href="http://arxiv.org/abs/1409.4842" target="_blank" rel="noopener">Going deeper with convolutions (Szegedy et al., 2014)&lt;/a>: the GoogLeNet paper – interesting and compelling results, especially given the improvement in performance while reducing computational complexity&lt;/li>
+&lt;li>&lt;a href="http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks" target="_blank" rel="noopener">ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012)&lt;/a>: the classic paper that arguably started (or significantly boosted) the recent buzz around deep learning – many interesting ideas; fairly accesible&lt;/li>
+&lt;li>&lt;a href="http://www.cs.toronto.edu/~gdahl/papers/momentumNesterovDeepLearning.pdf" target="_blank" rel="noopener">On the importance of initialization and momentum in deep learning (Sutskever et al., 2013)&lt;/a>: applying Nesterov momentum to deep learning – good read, simple concept, interesting results&lt;/li>
+&lt;li>&lt;a href="http://jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf" target="_blank" rel="noopener">Random Search for Hyper-Parameter Optimization (Bergstra and Bengio, 2012)&lt;/a>: very compelling reasoning and experiments showing that random search outperforms grid search in many cases&lt;/li>
+&lt;li>&lt;a href="http://sergeykarayev.com/files/1311.3715v3.pdf" target="_blank" rel="noopener">Recognizing Image Style (Karayev et al., 2014)&lt;/a>: identifying image style, which is similar to album genre – found that using models pretrained on ImageNet yielded the best results in some cases&lt;/li>
+&lt;li>&lt;a href="http://arxiv.org/abs/1409.1556" target="_blank" rel="noopener">Very deep convolutional networks for large scale image recognition (Simonyan and Zisserman, 2014)&lt;/a>: VGGNet paper – interesting experiments and architectures – deep and homogeneous&lt;/li>
+&lt;li>&lt;a href="http://arxiv.org/abs/1311.2901" target="_blank" rel="noopener">Visualizing and Understanding Convolutional Networks (Zeiler and Fergus, 2013)&lt;/a>: interesting work on visualisation, but I&amp;rsquo;ll need to apply it to understand it better&lt;/li>
+&lt;/ul></description></item><item><title>Hopping on the deep learning bandwagon</title><link>https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/</link><pubDate>Sat, 06 Jun 2015 05:00:22 +0000</pubDate><guid>https://yanirseroussi.com/2015/06/06/hopping-on-the-deep-learning-bandwagon/</guid><description>To become proficient at solving data science problems, you need to get your hands dirty. Here, I used album cover classification to learn about deep learning.</description></item><item><title>First steps in data science: author-aware sentiment analysis</title><link>https://yanirseroussi.com/2015/05/02/first-steps-in-data-science-author-aware-sentiment-analysis/</link><pubDate>Sat, 02 May 2015 08:31:10 +0000</pubDate><guid>https://yanirseroussi.com/2015/05/02/first-steps-in-data-science-author-aware-sentiment-analysis/</guid><description>I became a data scientist by doing a PhD, but the same steps can be followed without a formal education program.</description></item><item><title>My divestment from fossil fuels</title><link>https://yanirseroussi.com/2015/04/24/my-divestment-from-fossil-fuels/</link><pubDate>Fri, 24 Apr 2015 00:19:36 +0000</pubDate><guid>https://yanirseroussi.com/2015/04/24/my-divestment-from-fossil-fuels/</guid><description>Recent choices I&amp;rsquo;ve made to reduce my exposure to fossil fuels, including practical steps that can be taken by Australians and generally applicable lessons.</description></item><item><title>My PhD work</title><link>https://yanirseroussi.com/phd-work/</link><pubDate>Mon, 30 Mar 2015 03:23:33 +0000</pubDate><guid>https://yanirseroussi.com/phd-work/</guid><description>An overview of my PhD in data science / artificial intelligence. Thesis title: Text Mining and Rating Prediction with Topical User Models.</description></item><item><title>The long road to a lifestyle business</title><link>https://yanirseroussi.com/2015/03/22/the-long-road-to-a-lifestyle-business/</link><pubDate>Sun, 22 Mar 2015 09:43:47 +0000</pubDate><guid>https://yanirseroussi.com/2015/03/22/the-long-road-to-a-lifestyle-business/</guid><description>Progress since leaving my last full-time job and setting on an independent path that includes data science consulting and work on my own projects.</description></item><item><title>Learning to rank for personalised search (Yandex Search Personalisation – Kaggle Competition Summary – Part 2)</title><link>https://yanirseroussi.com/2015/02/11/learning-to-rank-for-personalised-search-yandex-search-personalisation-kaggle-competition-summary-part-2/</link><pubDate>Wed, 11 Feb 2015 06:34:17 +0000</pubDate><guid>https://yanirseroussi.com/2015/02/11/learning-to-rank-for-personalised-search-yandex-search-personalisation-kaggle-competition-summary-part-2/</guid><description>My team&amp;rsquo;s solution to the Yandex Search Personalisation competition (finished 9th out of 194 teams).</description></item><item><title>Is thinking like a search engine possible? (Yandex search personalisation – Kaggle competition summary – part 1)</title><link>https://yanirseroussi.com/2015/01/29/is-thinking-like-a-search-engine-possible-yandex-search-personalisation-kaggle-competition-summary-part-1/</link><pubDate>Thu, 29 Jan 2015 10:37:39 +0000</pubDate><guid>https://yanirseroussi.com/2015/01/29/is-thinking-like-a-search-engine-possible-yandex-search-personalisation-kaggle-competition-summary-part-1/</guid><description>Insights on search personalisation and SEO from participating in a Kaggle competition (finished 9th out of 194 teams).</description></item><item><title>Automating Parse.com bulk data imports</title><link>https://yanirseroussi.com/2015/01/15/automating-parse-com-bulk-data-imports/</link><pubDate>Thu, 15 Jan 2015 04:41:16 +0000</pubDate><guid>https://yanirseroussi.com/2015/01/15/automating-parse-com-bulk-data-imports/</guid><description>A script for importing data into the Parse backend-as-a-service.</description></item><item><title>Stochastic Gradient Boosting: Choosing the Best Number of Iterations</title><link>https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/</link><pubDate>Mon, 29 Dec 2014 02:30:06 +0000</pubDate><guid>https://yanirseroussi.com/2014/12/29/stochastic-gradient-boosting-choosing-the-best-number-of-iterations/</guid><description>Exploring an approach to choosing the optimal number of iterations in stochastic gradient boosting, following a bug I found in scikit-learn.</description></item><item><title>SEO: Mostly about showing up?</title><link>https://yanirseroussi.com/2014/12/15/seo-mostly-about-showing-up/</link><pubDate>Mon, 15 Dec 2014 04:25:25 +0000</pubDate><guid>https://yanirseroussi.com/2014/12/15/seo-mostly-about-showing-up/</guid><description>Increasing SEO traffic to BCRecommender by adding content and opening up more pages for crawling. It turns out that thin content is better than no content.</description></item><item><title>Fitting noise: Forecasting the sale price of bulldozers (Kaggle competition summary)</title><link>https://yanirseroussi.com/2014/11/19/fitting-noise-forecasting-the-sale-price-of-bulldozers-kaggle-competition-summary/</link><pubDate>Wed, 19 Nov 2014 09:17:34 +0000</pubDate><guid>https://yanirseroussi.com/2014/11/19/fitting-noise-forecasting-the-sale-price-of-bulldozers-kaggle-competition-summary/</guid><description>Summary of a Kaggle competition to forecast bulldozer sale price, where I finished 9th out of 476 teams.</description></item><item><title>BCRecommender Traction Update</title><link>https://yanirseroussi.com/2014/11/05/bcrecommender-traction-update/</link><pubDate>Wed, 05 Nov 2014 02:29:35 +0000</pubDate><guid>https://yanirseroussi.com/2014/11/05/bcrecommender-traction-update/</guid><description>Update on BCRecommender traction using three channels: blogger outreach, search engine optimisation, and content marketing.</description></item><item><title>What is data science?</title><link>https://yanirseroussi.com/2014/10/23/what-is-data-science/</link><pubDate>Thu, 23 Oct 2014 03:22:08 +0000</pubDate><guid>https://yanirseroussi.com/2014/10/23/what-is-data-science/</guid><description>Data science has been a hot term in the past few years. Still, there isn&amp;rsquo;t a single definition of the field. This post discusses my favourite definition.</description></item><item><title>Greek Media Monitoring Kaggle competition: My approach</title><link>https://yanirseroussi.com/2014/10/07/greek-media-monitoring-kaggle-competition-my-approach/</link><pubDate>Tue, 07 Oct 2014 03:21:35 +0000</pubDate><guid>https://yanirseroussi.com/2014/10/07/greek-media-monitoring-kaggle-competition-my-approach/</guid><description>Summary of my approach to the Greek Media Monitoring Kaggle competition, where I finished 6th out of 120 teams.</description></item><item><title>Applying the Traction Book’s Bullseye framework to BCRecommender</title><link>https://yanirseroussi.com/2014/09/24/applying-the-traction-books-bullseye-framework-to-bcrecommender/</link><pubDate>Wed, 24 Sep 2014 04:57:39 +0000</pubDate><guid>https://yanirseroussi.com/2014/09/24/applying-the-traction-books-bullseye-framework-to-bcrecommender/</guid><description>Ranking 19 channels with the goal of getting traction for BCRecommender.</description></item><item><title>Bandcamp recommendation and discovery algorithms</title><link>https://yanirseroussi.com/2014/09/19/bandcamp-recommendation-and-discovery-algorithms/</link><pubDate>Fri, 19 Sep 2014 14:26:55 +0000</pubDate><guid>https://yanirseroussi.com/2014/09/19/bandcamp-recommendation-and-discovery-algorithms/</guid><description>The recommendation backend for my BCRecommender service for personalised Bandcamp music discovery.</description></item><item><title>Building a recommender system on a shoestring budget (or: BCRecommender part 2 – general system layout)</title><link>https://yanirseroussi.com/2014/09/07/building-a-recommender-system-on-a-shoestring-budget/</link><pubDate>Sun, 07 Sep 2014 10:48:44 +0000</pubDate><guid>https://yanirseroussi.com/2014/09/07/building-a-recommender-system-on-a-shoestring-budget/</guid><description>Iterating on my BCRecommender service with the goal of keeping costs low while providing a valuable music recommendation service.</description></item><item><title>Building a Bandcamp recommender system (part 1 – motivation)</title><link>https://yanirseroussi.com/2014/08/30/building-a-bandcamp-recommender-system-part-1-motivation/</link><pubDate>Sat, 30 Aug 2014 08:11:38 +0000</pubDate><guid>https://yanirseroussi.com/2014/08/30/building-a-bandcamp-recommender-system-part-1-motivation/</guid><description>My motivation behind building BCRecommender, a free recommendation &amp;amp; discovery service for Bandcamp music.</description></item><item><title>How to (almost) win Kaggle competitions</title><link>https://yanirseroussi.com/2014/08/24/how-to-almost-win-kaggle-competitions/</link><pubDate>Sun, 24 Aug 2014 12:40:53 +0000</pubDate><guid>https://yanirseroussi.com/2014/08/24/how-to-almost-win-kaggle-competitions/</guid><description>Summary of a talk I gave at the Data Science Sydney meetup with ten tips on almost-winning Kaggle competitions.</description></item><item><title>Data’s hierarchy of needs</title><link>https://yanirseroussi.com/2014/08/17/datas-hierarchy-of-needs/</link><pubDate>Sun, 17 Aug 2014 13:09:30 +0000</pubDate><guid>https://yanirseroussi.com/2014/08/17/datas-hierarchy-of-needs/</guid><description>Discussing the hierarchy of needs proposed by Jay Kreps. Key takeaway: Data-driven algorithms &amp;amp; insights can only be as good as the underlying data.</description></item><item><title>Kaggle competition tips and summaries</title><link>https://yanirseroussi.com/kaggle/</link><pubDate>Sat, 05 Apr 2014 23:46:10 +0000</pubDate><guid>https://yanirseroussi.com/kaggle/</guid><description>Pointers to all my Kaggle advice posts and competition summaries.</description></item><item><title>Kaggle beginner tips</title><link>https://yanirseroussi.com/2014/01/19/kaggle-beginner-tips/</link><pubDate>Sun, 19 Jan 2014 10:34:28 +0000</pubDate><guid>https://yanirseroussi.com/2014/01/19/kaggle-beginner-tips/</guid><description>First post! An email I sent to members of the Data Science Sydney Meetup with tips on how to get started with Kaggle competitions.</description></item><item><title>About Yanir: Startup Data &amp; AI Consultant</title><link>https://yanirseroussi.com/about/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/about/</guid><description>About Yanir Seroussi, a hands-on data tech lead with over a decade of experience. Yanir helps climate/nature tech startups ship data-intensive solutions.</description></item><item><title>Book a free fifteen-minute call</title><link>https://yanirseroussi.com/free-intro-call/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/free-intro-call/</guid><description>Booking form for a quick intro call with Yanir Seroussi.</description></item><item><title>Causal inference resources</title><link>https://yanirseroussi.com/causal-inference-resources/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/causal-inference-resources/</guid><description>&lt;p>This is a list of some causal inference resources, which I update from time to time. You can also check out my posts on &lt;a href="https://yanirseroussi.com/tags/causal-inference/">causal inference&lt;/a> and &lt;a href="https://yanirseroussi.com/tags/a/b-testing/">A/B testing&lt;/a>.&lt;/p>
+&lt;p>&lt;strong>Books&lt;/strong>:&lt;/p>
+&lt;ul>
+&lt;li>&lt;a href="https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/" target="_blank" rel="noopener">&lt;em>Causal Inference: What if&lt;/em>&lt;/a> by Miguel Hernán and Jamie Robins: &lt;a href="https://yanirseroussi.com/2018/12/24/the-most-practical-causal-inference-book-ive-read-is-still-a-draft/">The most practical book I&amp;rsquo;ve read&lt;/a>. Highly recommended.&lt;/li>
+&lt;li>&lt;a href="https://experimentguide.com/" target="_blank" rel="noopener">&lt;em>Trustworthy Online Controlled Experiments : A Practical Guide to A/B Testing&lt;/em>&lt;/a> by Ron Kohavi, Diane Tang, and Ya Xu: Building on the authors&amp;rsquo; decades of industry experience, this is pretty much the bible of online experiments, which is how causal inference is often done in practice.&lt;/li>
+&lt;li>&lt;a href="http://www.skleinberg.org/why/" target="_blank" rel="noopener">&lt;em>Why: A Guide to Finding and Using Causes&lt;/em>&lt;/a> by Samantha Kleinberg: A high-level intro to the topic. I discussed highlights in &lt;a href="https://yanirseroussi.com/2016/02/14/why-you-should-stop-worrying-about-deep-learning-and-deepen-your-understanding-of-causality-instead/">&lt;em>Why you should stop worrying about deep learning and deepen your understanding of causality instead&lt;/em>&lt;/a>.&lt;/li>
+&lt;li>&lt;a href="http://www.skleinberg.org/causality_book/index.html" target="_blank" rel="noopener">&lt;em>Causality, Probability, and Time&lt;/em>&lt;/a> by Samantha Kleinberg: More technical than Kleinberg&amp;rsquo;s other book. As the title suggests, the element of time is central to the methods presented in the book. However, I&amp;rsquo;m still unsure about the practicality of those methods on real data. See my post &lt;a href="https://yanirseroussi.com/2016/05/15/diving-deeper-into-causality-pearl-kleinberg-hill-and-untested-assumptions/">&lt;em>Diving deeper into causality: Pearl, Kleinberg, Hill, and untested assumptions&lt;/em>&lt;/a> for more details.&lt;/li>
+&lt;li>&lt;a href="http://bayes.cs.ucla.edu/PRIMER/" target="_blank" rel="noopener">&lt;em>Causal Inference in Statistics: A Primer&lt;/em>&lt;/a> by Judea Pearl, Madelyn Glymour, Nicholas P. Jewell: A fairly accessible introduction to Judea Pearl&amp;rsquo;s work. I didn&amp;rsquo;t find it that practical, but I believe it helped me understand the graphical modelling parts of &lt;em>Causal Inference&lt;/em> by Hernán and Robins.&lt;/li>
+&lt;li>&lt;a href="https://mitpress.mit.edu/books/elements-causal-inference" target="_blank" rel="noopener">&lt;em>Elements of Causal Inference: Foundations and Learning Algorithms&lt;/em>&lt;/a> by Jonas Peters, Dominik Janzing, and Bernhard Schölkopf: The name of the book is an obvious reference to the classic book &lt;a href="https://web.stanford.edu/~hastie/ElemStatLearn/" target="_blank" rel="noopener">&lt;em>The Elements of Statistical Learning&lt;/em>&lt;/a> by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Unfortunately, the &lt;em>Elements of Causal Inference&lt;/em> isn&amp;rsquo;t as widely applicable as Hastie et al.&amp;rsquo;s book – it contains some interesting ideas, but it appears that algorithms for causal learning from data with minimal assumptions aren&amp;rsquo;t yet scalable enough for practical use. This will probably change in the future.&lt;/li>
+&lt;li>&lt;a href="http://www.mostlyharmlesseconometrics.com/" target="_blank" rel="noopener">&lt;em>Mostly Harmless Econometrics&lt;/em>&lt;/a> by Joshua D. Angrist and Jörn-Steffen Pischke: I started reading this book on my Kindle and was put off by some formatting issues. It also seemed like a less-general version of Pearl&amp;rsquo;s work. I may get back to it one day.&lt;/li>
+&lt;li>&lt;a href="http://bayes.cs.ucla.edu/BOOK-2K/index.html" target="_blank" rel="noopener">&lt;em>Causality: Models, Reasoning, and Inference&lt;/em>&lt;/a> by Judea Pearl: I haven&amp;rsquo;t read it, and I doubt it&amp;rsquo;d be very practical given &lt;a href="https://www.reddit.com/r/statistics/comments/8lu1sr/causal_inference_book_recommendations/" target="_blank" rel="noopener">the opinions of people who have&lt;/a>. But maybe I&amp;rsquo;ll get to it one day.&lt;/li>
+&lt;li>&lt;a href="http://bayes.cs.ucla.edu/WHY/" target="_blank" rel="noopener">&lt;em>The Book of Why: The New Science of Cause and Effect&lt;/em>&lt;/a> by Judea Pearl and Dana Mackenzie: An accessible overview of the field, focusing on Pearl&amp;rsquo;s contributions, but with plenty of historical background. Worth reading to get excited about the causal revolution.&lt;/li>
+&lt;li>&lt;a href="https://www.manning.com/books/causal-machine-learning" target="_blank" rel="noopener">&lt;em>Causal Machine Learning&lt;/em>&lt;/a> by Robert Osazuwa Ness: Still a draft as of September 2022, but &lt;a href="https://yanirseroussi.com/2022/09/12/causal-machine-learning-book-draft-review/">it looks promising&lt;/a>.&lt;/li>
+&lt;/ul>
+&lt;p>&lt;strong>Articles&lt;/strong>:&lt;/p></description></item><item><title>Free Guide: Data-to-AI Health Check for Startups</title><link>https://yanirseroussi.com/data-to-ai-health-check/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/data-to-ai-health-check/</guid><description>Download a free PDF guide that helps you assess a startup&amp;rsquo;s Data-to-AI health by probing eight key areas.</description></item><item><title>Helping climate &amp; nature tech startups ship data-intensive solutions</title><link>https://yanirseroussi.com/consult/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/consult/</guid><description>Consulting for climate &amp;amp; nature tech startups: Strategic advice, implementation of Data/AI/ML solutions, and hiring help by an experienced tech leader.</description></item><item><title>Speaking engagements by Yanir: Startup Data &amp; AI Consultant</title><link>https://yanirseroussi.com/talks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/talks/</guid><description>Yanir Seroussi speaks on data science, artificial intelligence, machine learning, and career journey.</description></item><item><title>Stay in touch</title><link>https://yanirseroussi.com/contact/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://yanirseroussi.com/contact/</guid><description>Contact me or subscribe to the mailing list.</description></item></channel></rss>
\ No newline at end of file

Dataset	CV	SKO	TSO
creditrating	0.9962	0.9771	1
breastcancer	1	0.6675	0.4869
mushrooms	0.9588	0.9963	1
abalone	1	0.9754	0.9963
ionosphere	0.9919	1	0.8129
diabetes	1	0.9869	0.9985
autoprices	1	0.9565	0.5839
autompg	1	0.8753	0.9948
bostonhousing	1	0.8299	0.5412
haberman	1	0.9793	0.9266
cpuperformance	0.9934	0.9160	1
adult	1	0.9824	0.9991
Album	CaffeNet	Clarifai
October by Wille P hiphop_rap	digital clock, spotlight, jack-o’-lantern, volcano, traffic light	tree, landscape, sunset, desert, sun, sunrise, nature, evening, sky, travel
October by Wille P hiphop_rap	digital clock, spotlight, jack-o’-lantern, volcano, traffic light	tree, landscape, sunset, desert, sun, sunrise, nature, evening, sky, travel
Demo by Blackrat metal	spider web, barn spider, chain, bubble, fountain	skull, bone, nobody, death, vector, help, horror, medicine, black and white, tattoo
Demo by Blackrat metal	spider web, barn spider, chain, bubble, fountain	skull, bone, nobody, death, vector, help, horror, medicine, black and white, tattoo
The Kool-Aid Album by Mr. Merge soul	dishrag, paper towel, honeycomb, envelope, chain mail	symbol, nobody, sign, illustration, color, flag, text, stripes, business, character
Aim	Definition
Transparency	Explain how the system works
Scrutability	Allow users to tell the system it is wrong
Trust	Increase user confidence in the system
Effectiveness	Help users make good decisions
Persuasiveness	Convince users to try or buy
Efficiency	Help users make decisions faster
Satisfaction	Increase ease of usability or enjoyment
	Original sample	Resample #1	Resample #2	…
Values	10	30	20	…
	12	20	20
	20	12	30
	30	12	30
	45	45	30

Mean	23.4	23.8	26	…
	Amateur	Professional
Plumbing	Rudimentary pipelines, manually-populated spreadsheets	All necessary data is trustworthy and available on tap
Decisions	Relying on one-off charts and models, along with the intuition of HiPPOs (highest-paid persons’ opinions)	Relying on relevant data and modelling efforts that are proportional to the gravity of each decision
Automation	Superficial use of off-the-shelf tools	Deep, mindful integration of tech to replace manual work where it delivers the most value