<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Data Modernisation Journey: Data Engineer's Journey🧑‍💻]]></title><description><![CDATA[Professional lessons and development

Being a Data Engineer is more than just a job; it's a lively journey. Fifteen years ago, I began my career as a data consultant, eager for guidance to help me succeed in this ever-changing field. 

I encountered many challenges along the way, but I turned those experiences into valuable lessons that I am eager to share with you, particularly the human side of data engineering, including life experiences, emotional intelligence, and productivity tips. 

Let's embark on this journey together to elevate your data skills to the next level!]]></description><link>https://blog.bigdatadig.com/s/data-engineers-journey</link><image><url>https://substackcdn.com/image/fetch/$s_!LYrU!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72911628-39d5-427d-9c35-9a874ca2c15e_300x300.png</url><title>Data Modernisation Journey: Data Engineer&apos;s Journey🧑‍💻</title><link>https://blog.bigdatadig.com/s/data-engineers-journey</link></image><generator>Substack</generator><lastBuildDate>Wed, 08 Apr 2026 10:32:14 GMT</lastBuildDate><atom:link href="https://blog.bigdatadig.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[BigDataDig Limited]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[datamodernisationjourney@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[datamodernisationjourney@substack.com]]></itunes:email><itunes:name><![CDATA[Muhammad Khurram]]></itunes:name></itunes:owner><itunes:author><![CDATA[Muhammad Khurram]]></itunes:author><googleplay:owner><![CDATA[datamodernisationjourney@substack.com]]></googleplay:owner><googleplay:email><![CDATA[datamodernisationjourney@substack.com]]></googleplay:email><googleplay:author><![CDATA[Muhammad Khurram]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[#039 - No Budget, No Insights]]></title><description><![CDATA[Why struggling companies can&#8217;t build great data teams]]></description><link>https://blog.bigdatadig.com/p/037-no-budget-no-insights</link><guid isPermaLink="false">https://blog.bigdatadig.com/p/037-no-budget-no-insights</guid><dc:creator><![CDATA[Muhammad Khurram]]></dc:creator><pubDate>Sat, 27 Sep 2025 03:00:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!iFj9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Hey there,</p><p>After 10+ data migrations across three continents, I have noticed something uncomfortable.</p><p><strong>The organisations with the strongest balance sheets consistently build the most sophisticated data capabilities.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iFj9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iFj9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 424w, https://substackcdn.com/image/fetch/$s_!iFj9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 848w, https://substackcdn.com/image/fetch/$s_!iFj9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 1272w, https://substackcdn.com/image/fetch/$s_!iFj9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iFj9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png" width="1200" height="630" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:630,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:285237,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.bigdatadig.com/i/174488445?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iFj9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 424w, https://substackcdn.com/image/fetch/$s_!iFj9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 848w, https://substackcdn.com/image/fetch/$s_!iFj9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 1272w, https://substackcdn.com/image/fetch/$s_!iFj9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5417af20-98b8-410d-a23b-c6ad2066858b_1200x630.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>At a major bank, we had dedicated data quality teams and budgets for cutting-edge platforms. At a mid-sized company, we struggled to get approval for basic monitoring tools.</p><p>The difference wasn&#8217;t technical expertise. It was Maslow&#8217;s hierarchy playing out in real-time.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading the Data Modernisation Journey! Subscribe for free to receive new posts and support my work &#128522;&#128591;</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>This Week&#8217;s Deep Dive:</strong></p><ul><li><p>The 5-stage data maturity model mirrors human psychology</p></li><li><p>Why &#8220;AI-first&#8221; strategies fail for cash-strapped organisations</p></li><li><p>A practical assessment you can use on Monday morning</p></li><li><p>Real patterns from my enterprise implementations.</p></li></ul><p>Let&#8217;s break down the uncomfortable truth about data capability.</p><div><hr></div><h2>The Story: Two Migrations, Two Worlds Apart</h2><p>I have worked on two strikingly similar projects.</p><p>Both were migrations. Both had identical technical complexity and timelines.</p><p>One at a major bank. One at a mid-sized retailer.</p><p><strong>Completely different outcomes.</strong></p><h3>The Bank Project:</h3><ul><li><p>Dedicated teams for data quality</p></li><li><p>Automated testing pipelines</p></li><li><p>Redundant systems during migration</p></li><li><p>Performance issues? Additional resources approved in 24 hours</p></li></ul><h3>The Retailer Project:</h3><ul><li><p>Every decision went through three approval layers</p></li><li><p>One analyst wearing multiple hats as the &#8220;data quality team&#8221;</p></li><li><p>Extra processing power during critical migration? Two weeks for sign-off</p></li></ul><p><strong>Result:</strong></p><ul><li><p>Bank: Delivered ahead of schedule, zero data loss</p></li><li><p>Retailer: Six months over, lost two team members to burnout</p></li></ul><p><strong>What I realised:</strong>&nbsp;Organisations can&#8217;t skip levels in the data maturity pyramid.</p><p>Just like humans can&#8217;t skip basic needs.</p><div><hr></div><h2>The Data Maturity Hierarchy: 5 Stages Every Organisation Climbs</h2><p>Here&#8217;s the framework that explains everything:</p><h3><strong>Level 1: Survival (Basic Data Collection)</strong></h3><p><em>&#8220;We need reports to operate&#8221;</em></p><p><strong>What it looks like:</strong></p><ul><li><p>Spreadsheet-driven reporting</p></li><li><p>Manual data extracts</p></li><li><p>Basic storage systems</p></li><li><p>Reactive &#8220;reporting as requested&#8221;</p></li></ul><p><strong>Organisations here:</strong> Startups, struggling companies, resource-constrained teams</p><p><strong>Investment required:</strong> $50K-$200K annually</p><p><strong>Time to progress:</strong> 6-18 months with dedicated effort</p><p>I&#8217;ve seen companies spend years stuck in this position. They think they need AI when they actually need consistent month-end reporting.</p><h3><strong>Level 2: Security &amp; Trust (Reliable Foundation)</strong></h3><p><em>&#8220;Our data needs to be accurate and safe&#8221;</em></p><p><strong>What it looks like:</strong></p><ul><li><p>Data validation processes</p></li><li><p>Backup and recovery systems</p></li><li><p>Basic governance frameworks</p></li><li><p>Consistent data definitions</p></li></ul><p><strong>Organisations here:</strong> Established businesses with regulatory requirements</p><p><strong>Investment required:</strong> $200K-$500K annually</p><p><strong>Time to progress:</strong> 12-24 months</p><p><strong>Real example:</strong> One financial services project spent eight months establishing data lineage before touching analytics. Worth every hour.</p><h3><strong>Level 3: Collaboration (Breaking Silos)</strong></h3><p><em>&#8220;Everyone needs access to the same truth&#8221;</em></p><p><strong>What it looks like:</strong></p><ul><li><p>Cross-departmental data sharing</p></li><li><p>Self-service analytics tools</p></li><li><p>Standardized dashboards</p></li><li><p>Data literacy programs</p></li></ul><p><strong>Organisations here:</strong> Growing companies with multiple business units</p><p><strong>Investment required:</strong> $500K-$1M annually</p><p><strong>Time to progress:</strong> 18-36 months</p><h3><strong>Level 4: Insights &amp; Recognition (Advanced Analytics)</strong></h3><p><em>&#8220;We&#8217;re making data-driven decisions&#8221;</em></p><p><strong>What it looks like:</strong></p><ul><li><p>Predictive modeling</p></li><li><p>Real-time dashboards</p></li><li><p>ML-powered recommendations</p></li><li><p>Industry recognition for data practices</p></li></ul><p><strong>Organisations here:</strong> Market leaders, well-funded scale-ups</p><p><strong>Investment required:</strong> $1M-$3M annually</p><p><strong>Time to progress:</strong> 2-4 years</p><p><strong>Key insight:</strong> This is where most &#8220;AI transformation&#8221; projects actually begin. Not where they&#8217;re sold to start.</p><h3><strong>Level 5: Innovation (Data-Driven Transformation)</strong></h3><p><em>&#8220;Data is our competitive advantage&#8221;</em></p><p><strong>What it looks like:</strong></p><ul><li><p>AI-powered product features</p></li><li><p>Real-time personalization</p></li><li><p>Automated decision systems</p></li><li><p>New revenue streams from data</p></li></ul><p><strong>Organisations here:</strong> for example Netflix, Amazon, Google, and mature fintech companies</p><p><strong>Investment required:</strong> $3M+ annually</p><p><strong>Time to maintain:</strong> Continuous evolution required</p><div><hr></div><h2>The Uncomfortable Truth: Money Talks, Data Walks</h2><p>Here&#8217;s what I have observed across 15 years:</p><p><strong>Level 1-2 organisations ask:</strong> &#8220;What&#8217;s the cheapest way to get insights?&#8221;</p><p><strong>Level 3-4 organisations ask:</strong> &#8220;How do we scale our data capabilities?&#8221;</p><p><strong>Level 5 organisations ask:</strong> &#8220;How do we stay ahead of disruption?&#8221;</p><h3>The Pattern:</h3><p>Financially stable companies don&#8217;t just have better data.</p><p>They can <strong>afford to fail fast, learn quickly, and iterate.</strong></p><p>At one of our enterprise clients, we tested three different approaches to forecasting simultaneously.</p><p>At a struggling company, we had to pick one approach and hope it worked.</p><p><strong>Resource availability directly correlates with the progression of data maturity.</strong></p><div><hr></div><h2>Your Monday Morning Assessment</h2><p>Rate your organisation (1-5) on each dimension:</p><h3><strong>Financial Stability:</strong></h3><ul><li><p><strong>Budget approval timelines:</strong> Immediate (5) &#8594; 6+ months (1)</p></li><li><p><strong>Risk tolerance:</strong> High (5) &#8594; None (1)</p></li><li><p><strong>Team investment:</strong> Growing (5) &#8594; Shrinking (1)</p></li></ul><h3><strong>Data Maturity:</strong></h3><ul><li><p><strong>Data quality:</strong> Automated validation (5) &#8594; Manual checking (1)</p></li><li><p><strong>Analytics capability:</strong> Predictive models (5) &#8594; Excel reports (1)</p></li><li><p><strong>Decision speed:</strong> Real-time (5) &#8594; Monthly reviews (1)</p></li></ul><h3><strong>The Pattern:</strong></h3><p>Your financial score predicts your data maturity ceiling.</p><p>Organisations consistently score within 1 point between financial stability and data maturity.</p><p>I have never seen a Level 1 financial organisation sustain Level 4 data practices.</p><div><hr></div><h2>What This Means for Your Career</h2><h3><strong>If you&#8217;re at a Level 1-2 organisation:</strong></h3><ul><li><p>Focus on foundational skills: SQL, data quality, basic automation</p></li><li><p>Build systems that show immediate ROI</p></li><li><p>Document everything; you&#8217;re building credibility for future investment</p></li></ul><h3><strong>If you&#8217;re at a Level 3-4 organisation:</strong></h3><ul><li><p>This is where careers accelerate</p></li><li><p>Learn advanced analytics and cloud architectures</p></li><li><p>Lead cross-functional initiatives</p></li><li><p>Position yourself for the transition to Level 5</p></li></ul><h3><strong>If you&#8217;re at a Level 5 organisation:</strong></h3><ul><li><p>Stay ahead of the curve&#8212;real-time systems, AI integration</p></li><li><p>Share knowledge externally&#8212;speaking, writing, thought leadership</p></li><li><p>Consider consulting back to lower levels</p></li></ul><h3><strong>The key insight:</strong></h3><p>Align your skill development with your organisation&#8217;s realistic trajectory.</p><p>Not their aspirational goals.</p><div><hr></div><h2>The Bottom Line</h2><p>Data maturity follows a similar progression to human development.</p><p>Organisations trying to jump from basic reporting to AI are like startups trying to offer luxury products.</p><p>The foundation isn&#8217;t there.</p><p><strong>Your next promotion depends on understanding where your organisation really sits on this pyramid.</strong></p><p>Not where leadership thinks they are.</p><p>I have seen talented data professionals burn out trying to build Level 5 capabilities with Level 2 resources.</p><h3><strong>The Solution:</strong></h3><p>Match your ambitions to your organisation&#8217;s actual maturity level.</p><p>Then systematically help them climb the pyramid.</p><div><hr></div><p>What level is your organisation really operating at?</p><p>PS...If you&#8217;re enjoying this newsletter, please consider referring this edition to a friend. They&#8217;ll receive weekly insights, backed by industry research, on making data modernisation more predictable and profitable.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share&quot;,&quot;text&quot;:&quot;Share Data Modernisation Journey&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.bigdatadig.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share"><span>Share Data Modernisation Journey</span></a></p><div><hr></div><p>And whenever you&#8217;re ready, there are 3 ways I can help you:</p><ol><li><p><strong>Migration Readiness Assessment &amp; Roadmap</strong> - The essential first step to clarity. Fixed-fee, 3-4 week engagement for leaders who need a comprehensive architectural blueprint and de-risked plan before committing to full-scale legacy data migration.</p></li><li><p><strong>Fractional Data Architect Retainer</strong> - Ongoing senior architectural leadership to guide your team through major projects. Consistent expert oversight that ensures design integrity, manages technical risk, and keeps complex initiatives aligned with core business goals.</p></li><li><p><strong>Advisory &amp; Review Sessions</strong> - Expert guidance on demand. Prepaid hours are ideal for reviewing internal plans, evaluating vendor proposals, or workshopping specific architectural challenges with a seasoned expert who has over 15 years of experience in the field.</p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading the Data Modernisation Journey! Subscribe for free to receive new posts and support my work &#128522;&#128591;</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[#032 - Beyond Dashboards: How Orchestration-Native Observability Is Saving Data Engineering]]></title><description><![CDATA[5 tools + 1 framework that cut incident time by 60%]]></description><link>https://blog.bigdatadig.com/p/032-beyond-dashboards-how-orchestration</link><guid isPermaLink="false">https://blog.bigdatadig.com/p/032-beyond-dashboards-how-orchestration</guid><dc:creator><![CDATA[Muhammad Khurram]]></dc:creator><pubDate>Sat, 09 Aug 2025 13:02:23 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!sk6Z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Hey there,</p><p>I've been asking data teams the same question for months: <em>"How long does it take you to figure out why something broke?"</em></p><p>The answer is always some version of "too long."</p><p>Here's what I learned from analysing this pattern across teams, plus 5 tools and 1 framework you can start using today.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sk6Z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sk6Z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sk6Z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sk6Z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sk6Z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sk6Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg" width="1456" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1775450,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.bigdatadig.com/i/170509781?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sk6Z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sk6Z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sk6Z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sk6Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc111fed4-fc4e-434a-8457-857138c63bc9_2816x1536.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>&#128293; <strong>This Week's Big Insight</strong></h2><p><strong>The Problem:</strong> 73% of data teams monitor <em>when</em> things break, but have no visibility into <em>why</em> they break.</p><p><strong>The Solution:</strong> Orchestration-native observability (monitoring pipelines, not just warehouses).</p><p><strong>The Impact:</strong> Teams implementing this see 60% faster incident resolution on average.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Data Modernisation Journey is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><h2>&#128736;&#65039; <strong>5 Tools You Should Bookmark This Week</strong></h2><h3><strong>For Task-Level Monitoring:</strong></h3><ol><li><p><strong><a href="https://dagster.io/">Dagster</a></strong> - Asset-oriented orchestration with built-in observability</p><ul><li><p><em>Use case:</em> See exactly which pipeline task failed and why</p></li><li><p><em>Quick start:</em> Their 10-minute tutorial shows task-level visibility</p></li></ul></li><li><p><strong><a href="https://prefect.io/">Prefect</a></strong> - Modern workflow orchestration with detailed logging</p><ul><li><p><em>Use case:</em> Clear failure information and automated retries</p></li><li><p><em>Quick start:</em> Free tier for teams under 3 users</p></li></ul></li></ol><h3><strong>For Data Quality Monitoring:</strong></h3><ol start="3"><li><p><strong><a href="https://getmontecarlo.com/">Monte Carlo</a></strong> - Data observability platform focusing on the "5 pillars"</p><ul><li><p><em>Use case:</em> Automated anomaly detection across freshness, volume, and schema</p></li><li><p><em>Quick start:</em> They offer free data health assessments</p></li></ul></li><li><p><strong><a href="https://metaplane.dev/">Metaplane</a></strong> - Lightweight data monitoring with fast setup</p><ul><li><p><em>Use case:</em> Real-time alerts without heavy configuration</p></li><li><p><em>Quick start:</em> Connect in under 30 minutes</p></li></ul></li></ol><h3><strong>For Schema Change Detection:</strong></h3><ol start="5"><li><p><strong><a href="https://greatexpectations.io/">Great Expectations</a></strong> - Open source data validation framework</p><ul><li><p><em>Use case:</em> Catch schema drift before it breaks downstream</p></li><li><p><em>Quick start:</em> Their schema validation tutorial takes 15 minutes</p></li></ul></li></ol><div><hr></div><h2>&#128203; <strong>Copy This: 5-Point Readiness Checklist</strong></h2><p>Rate yourself 1-5 on each:</p><p>&#9745;&#65038; <strong>Task Visibility:</strong> Can you see individual pipeline task status in real-time?<br>&#9745;&#65038; <strong>Schema Monitoring:</strong> Do you catch schema changes before they break downstream?<br>&#9745;&#65038; <strong>Impact Analysis:</strong> Do you know what breaks when a pipeline fails?<br>&#9745;&#65038; <strong>Cost Attribution:</strong> Can you track compute costs per pipeline?<br>&#9745;&#65038; <strong>SLA Tracking:</strong> Do you monitor data freshness and completeness automatically?</p><p><strong>Score:</strong></p><ul><li><p><strong>20-25:</strong> You're in the top 10% of data teams</p></li><li><p><strong>15-19:</strong> Solid foundation, some gaps to fill</p></li><li><p><strong>10-14:</strong> Standard setup, big improvement opportunity</p></li><li><p><strong>Below 10:</strong> High risk zone - prioritise this</p></li></ul><div><hr></div><h2>&#127919; <strong>Quick Win: 15-Minute Task Monitoring Setup</strong></h2><p>If you're using Airflow, add this to any DAG for instant task-level visibility:</p><pre><code><code>import logging
from datetime import datetime, timedelta
from airflow.models.dag import DAG

def log_task_metrics(**context):
    """Production-ready callback for task monitoring"""
    try:
        task_instance = context["task_instance"]
        start_time = task_instance.start_date
        end_time = task_instance.end_date
        
        if start_time and end_time:
            duration = (end_time - start_time).total_seconds()
            logging.info(f"Task '{task_instance.task_id}' finished. "
                        f"State: {task_instance.state}. Duration: {duration:.2f}s")
        else:
            logging.warning(f"Task '{task_instance.task_id}' finished. "
                           f"State: {task_instance.state}. No timing data.")
    except Exception as e:
        logging.error(f"Monitoring callback error: {e}")

# Apply to ALL tasks in the DAG via default_args
default_args = {
    "owner": "data_team",
    "retries": 1,
    "retry_delay": timedelta(minutes=5),
    "on_success_callback": log_task_metrics,
    "on_failure_callback": log_task_metrics,
    "on_retry_callback": log_task_metrics,  # Track retries too!
}

# Now every task gets monitoring automatically
with DAG(
    dag_id="monitored_pipeline",
    start_date=datetime(2025, 8, 9),
    schedule="@daily",
    default_args=default_args,  # This is the magic line
) as dag:
    # All tasks inherit the monitoring callbacks
    extract_task = BashOperator(
        task_id="extract_data",
        bash_command="your_extract_script.sh"
    )</code></code></pre><p><strong>Implementation time:</strong> 15 minutes<br><strong>Value:</strong> Immediate visibility into task performance patterns</p><div><hr></div><h2>&#128218; <strong>This Week's Reads</strong></h2><p><strong>Must-read articles I bookmarked:</strong></p><ul><li><p><a href="https://netflixtechblog.com/lessons-from-building-observability-tools-at-netflix-7cfafed6ab17">Netflix's observability lessons</a> - How they built tools to monitor petabytes of data</p></li><li><p><a href="https://medium.com/airbnb-engineering/data-quality-score-the-next-chapter-of-data-quality-at-airbnb-851dccda19c3">Airbnb's data quality evolution</a> - Their innovative DQ Score approach</p></li><li><p><a href="https://engineering.atspotify.com/2024/04/data-platform-explained">Spotify's data platform architecture</a> - How they process 1.4 trillion data points daily</p></li></ul><p><strong>Tools/Resources:</strong></p><ul><li><p><a href="https://dataengineeringweekly.substack.com/">Data Engineering Weekly</a> - Best curated data engineering content</p></li><li><p><a href="https://roundup.getdbt.com/">dbt's Analytics Engineering Roundup</a> - Weekly analytics engineering insights</p></li></ul><div><hr></div><h2>&#128161; <strong>Industry Intel</strong></h2><p><strong>What's trending this week:</strong></p><ul><li><p><strong>Snowflake just launched enhanced pipeline monitoring</strong> with built-in telemetry and better Monte Carlo integration</p></li><li><p><strong>Monte Carlo raised $135M Series D</strong>, pushing valuation beyond $1.6B (validates the data observability market explosion)</p></li><li><p><strong>Major companies migrating from Airflow to Dagster</strong> specifically for observability features (names confidential, but momentum is real)</p></li></ul><p><strong>Who's hiring data engineers with observability skills:</strong></p><ul><li><p>Netflix (Senior Data Platform Engineers + Observability specialists)</p></li><li><p>Stripe (Data Infrastructure Engineers with platform reliability focus)</p></li><li><p>Databricks (Solutions Architects - observability expertise increasingly required even when not in job title)</p></li></ul><div><hr></div><h2>&#128640; <strong>Next Week Preview</strong></h2><p><strong>"The $500K Snowflake Bill: 3 Cost Optimisations That Cut Warehouse Spend by 40%"</strong></p><p>Including:</p><ul><li><p>The warehouse query that's probably costing you $10K/month</p></li><li><p>5-minute optimisation checklist</p></li><li><p>Cost monitoring dashboard template you can copy</p></li></ul><div><hr></div><h2>&#128172; <strong>Community Question</strong></h2><p><strong>This week:</strong> What's your biggest pipeline monitoring blind spot right now?</p><p>Reply and tell me - I'll feature the best insights in next week's issue (with your permission).</p><div><hr></div><h2>&#127873; <strong>Subscriber Exclusive</strong></h2><p><strong>Free resource:</strong> My "Orchestration Observability Setup Guide" (10-page Guide)</p><p><strong>Includes:</strong></p><ul><li><p>Tool comparison matrix with scoring framework</p></li><li><p>Implementation timeline template</p></li><li><p>Cost-benefit calculation spreadsheet</p></li><li><p>30+ monitoring queries you can copy-paste</p></li></ul><h4><em><strong>Newsletter subscribers only</strong></em> - Subscribe OR if you are already subscriber comment with &#8220;Observability&#8221;  and I will send you the link.</h4><div><hr></div><p><strong>Found this useful?</strong> Forward to a colleague who's tired of debugging data incidents at 2 AM.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share&quot;,&quot;text&quot;:&quot;Share Data Modernisation Journey&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.bigdatadig.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share"><span>Share Data Modernisation Journey</span></a></p><p><strong>Want more like this?</strong> Hit reply and let me know what data engineering topics you want me to dive into next.</p><div><hr></div><h3><strong>That&#8217;s it for this week. If you found this helpful, leave a comment to let me know &#9994;</strong></h3><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/p/032-beyond-dashboards-how-orchestration/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.bigdatadig.com/p/032-beyond-dashboards-how-orchestration/comments"><span>Leave a comment</span></a></p><h2><strong>About the Author</strong></h2><p>Khurram, founder of BigDataDig and a former Teradata Global Data Consultant, brings over 15 years of deep expertise in data integration and data processing. Leveraging this extensive background, he now specialises in organisational financial services, telecommunications, retail, and government sectors, implementing <strong>cutting-edge, AI-ready data solutions</strong>. His methodology prioritises value-driven implementations that effectively manage risk while ensuring that data is prepared, optimised, and advanced analytics.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Data Modernisation Journey is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[#26 - Key Insights from dbt CEO on AI and Data Engineering]]></title><description><![CDATA[Simple breakdown of the trends shaping data teams in 2025]]></description><link>https://blog.bigdatadig.com/p/26-key-insights-from-dbt-ceo-on-ai</link><guid isPermaLink="false">https://blog.bigdatadig.com/p/26-key-insights-from-dbt-ceo-on-ai</guid><dc:creator><![CDATA[Muhammad Khurram]]></dc:creator><pubDate>Tue, 01 Jul 2025 05:24:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!toWQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Read time:</em> 4 minutes.</p><p>Hi Data Modernisers,</p><p>I just listened to a great conversation between Tristan Handy (CEO of dbt Labs) and the team at a16z about where data engineering is headed.</p><p>Instead of the usual AI hype, Tristan shared some really practical insights about what's actually working in data teams right now.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!toWQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!toWQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!toWQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!toWQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!toWQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!toWQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:928366,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.bigdatadig.com/i/166685404?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!toWQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!toWQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!toWQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!toWQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F577397ef-3187-49d4-837e-e6e694c33340_1920x1080.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Here's what I learned that might be useful for your team:</p><ul><li><p>How AI is changing data work (but not replacing it) </p></li><li><p>Why do some AI projects succeed while others create problems </p></li><li><p>What dbt is building for the future of data engineering</p></li></ul><p>Let me break down the key points in simple terms.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">The Data Modernisation Playbook is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><h1>3 Key Ideas from the Conversation</h1><p>Based on the podcast discussion, here are the main points that stood out:</p><h2>1. AI Helps Data People Work Better (It Doesn't Replace Them)</h2><p><strong>What Tristan said:</strong> </p><ul><li><p>80% of data professionals now use AI in their daily work</p></li><li><p>AI is best at automating routine tasks, not making business decisions</p></li><li><p>Data teams are growing because AI creates more demand for quality data</p></li></ul><p><strong>What this means for your team:</strong></p><ul><li><p>Focus on AI tools that help your current team be more productive</p></li><li><p>Don't expect AI to replace the need for people who understand your business</p></li><li><p> The most valuable skill remains knowing what questions to ask and how to interpret the results.</p></li></ul><p><strong>Examples of what's working:</strong> </p><ul><li><p>AI is helping write SQL code that humans then review</p></li><li><p>Automated documentation generation</p></li><li><p>AI-assisted debugging of data pipeline failures</p></li><li><p>Tools that suggest optimisations for existing queries</p></li></ul><h2>2. The Success Factor: Human-in-the-Loop vs Human-out-of-the-Loop</h2><p><strong>Tristan's framework:</strong></p><ul><li><p><strong>Human-in-the-loop:</strong> AI generates something, and an experienced person reviews it</p></li><li><p><strong>Human-out-of-the-loop:</strong> AI gives answers directly to people who can't verify if they're correct</p></li></ul><p><strong>Why this matters:</strong> </p><ul><li><p>Most successful AI projects keep humans involved in validation</p></li><li><p>The dangerous projects are ones where non-technical users get AI results they can't check</p></li><li><p>As Tristan put it: "Without a human to verify the result, that's a very scary thing"</p></li></ul><p><strong>Questions to ask about your AI projects:</strong></p><ul><li><p>Who's checking if the AI output is correct?</p></li><li><p>Do they have the skills to catch mistakes?</p></li><li><p>What happens if the AI gives the wrong answer?</p></li></ul><p><strong>Real example from the conversation:</strong> </p><ul><li><p>dbt built an AI system that can write SQL to answer business questions</p></li><li><p>But it only works because it connects to their "semantic layer&#8221;- a system that defines exactly how your company measures things like revenue.</p></li><li><p>Without that context, AI just guesses at what you mean.</p></li></ul><h2>3. Data Engineering is Becoming More Like Software Engineering</h2><p><strong>What DBT is building:</strong></p><ul><li><p>They acquired a company called SDF that built a SQL compiler in Rust</p></li><li><p>This lets data engineers test their code locally instead of only in the cloud</p></li><li><p>It can translate between different SQL dialects automatically</p></li><li><p>They're adding features like automated refactoring and better error checking</p></li></ul><p><strong>Why this matters:</strong> </p><ul><li><p>Right now, data engineering is more complicated than it needs to be</p></li><li><p>You can't easily reuse code between different data platforms</p></li><li><p>Testing changes is slow and risky</p></li><li><p>The new tools will make data work more reliably and faster</p></li></ul><p><strong>What Tristan said:</strong> </p><ul><li><p>"Software engineering tool stack was maybe two decades ahead of data"</p></li><li><p>The goal is to give data engineers the same quality tools that software engineers have</p></li><li><p>This includes things like package management, version control, and local development</p></li></ul><p><strong>Changes coming:</strong></p><ul><li><p>Better debugging tools that can automatically find problems in data pipelines</p></li><li><p>Reusable components that work across different cloud platforms</p></li><li><p>Faster feedback loops when developing new data transformations</p></li><li><p>More standardised ways of building and sharing data logic</p></li></ul><div><hr></div><p><strong>Key takeaways:</strong></p><ul><li><p><strong>AI works best when it helps skilled people do more</strong> - Don't try to replace expertise, amplify it</p></li><li><p><strong>Keep humans involved in validating AI outputs</strong>, primarily when business decisions depend on the results</p></li><li><p><strong>Data engineering tools are continually improving</strong>. The next few years are expected to bring significant productivity improvements.</p></li></ul><p><strong>What to do next:</strong></p><ul><li><p>Listen to the full podcast if you want more details on any of these topics</p></li><li><p>Look at your current AI projects and ask if they're human-in-the-loop or human-out-of-the-loop</p></li><li><p>Consider how better development tools might help your data team work more efficiently</p></li></ul><p>The main message: AI isn't going to replace data teams, but it will change how they work. Companies that use it thoughtfully will have significant advantages.</p><div><hr></div><p>What did you think of this breakdown?</p><ul><li><p>Helpful summary?</p></li><li><p>Too basic?</p></li><li><p>Want more technical details?</p></li></ul><p>Let me know what would be most useful.</p><div><hr></div><p>PS... If you found this summary helpful, feel free to share it with your team. These kinds of industry insights are worth discussing.</p><p>And whenever you are ready, there are 3 ways I can help you:</p><ol><li><p><strong>Free Data Flow Audit</strong> - 60-minute deep-dive where we map your current data flows and identify exactly where chaos is killing your AI initiatives</p></li><li><p><strong>Modular Pipeline Migration</strong> - Complete rebuild from spaghetti scripts to dbt + Airflow architecture that your AI systems can actually depend on</p></li><li><p><strong>AI-Ready Data Platform</strong> - Full implementation of version-controlled, tested, modular data pipeline with real-time capabilities designed for production AI workloads</p></li></ol><div><hr></div><h3>That&#8217;s it for this week. If you found this helpful, leave a comment to let me know &#9994;</h3><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/p/26-key-insights-from-dbt-ceo-on-ai/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.bigdatadig.com/p/26-key-insights-from-dbt-ceo-on-ai/comments"><span>Leave a comment</span></a></p><h2><strong>About the Author</strong></h2><p>Khurram, founder of BigDataDig and a former Teradata Global Data Consultant, brings over 15 years of deep expertise in data integration and robust data processing. Leveraging this extensive background, he now specialises in helping organisations in the financial services, telecommunications, retail, and government sectors implement&nbsp;<strong>cutting-edge, AI-ready data solutions</strong>. His methodology prioritises pragmatic, value-driven implementations that effectively manage risk while ensuring that data is prepared and optimised for AI and advanced analytics.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">The Data Modernisation Playbook is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[#17 - Serverless is Eating ETL: How AWS Glue Leads the Data Integration Shift in 2025]]></title><description><![CDATA[The server-less revolution is transforming ETL as we know it... here's why AWS Glue is at the forefront of this change.]]></description><link>https://blog.bigdatadig.com/p/17-serverless-is-eating-etl-how-aws</link><guid isPermaLink="false">https://blog.bigdatadig.com/p/17-serverless-is-eating-etl-how-aws</guid><dc:creator><![CDATA[Muhammad Khurram]]></dc:creator><pubDate>Mon, 28 Apr 2025 23:42:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!8EyF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Hello Data Modernizers,</p><p>I've spent over 15 years in the trenches of data modernization, and I'm observing something fascinating: serverless is absolutely consuming the ETL world.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">The Data Modernisation Playbook is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8EyF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8EyF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!8EyF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!8EyF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!8EyF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8EyF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:371590,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.bigdatadig.com/i/154566265?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8EyF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!8EyF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!8EyF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!8EyF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8712443-019e-4ff2-b202-d5e737c35ab9_1920x1080.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I appreciate elegant SQL and thoughtful migrations, yet the developments in serverless ETL for 2025 are extraordinary. After exploring numerous intricate migrations and observing the emerging patterns across various sectors, I'm identifying clear trends:</p><ul><li><p><strong>Conventional ETL tools are becoming less popular</strong>&nbsp;as new initiatives predominantly prefer serverless solutions.</p></li><li><p><strong>The adoption of AWS Glue is rapidly increasing</strong>, particularly among mid-market companies seeking greater agility.</p></li><li><p><strong>The industry is clearly moving</strong>&nbsp;towards serverless-first strategies for data integration.</p></li><li><p><strong>Organizations are moving away from the legacy middleware</strong>&nbsp;they have depended on for years.</p></li><li><p><strong>Even historically conservative industries</strong>&nbsp;such as finance and government are embracing change.</p></li></ul><p>Today, I'm exploring how AWS Glue is spearheading the serverless integration revolution and its implications for your 2025 data strategy:</p><ul><li><p>Why the transition to serverless ETL has become essential for competitive businesses.</p></li><li><p>How AWS Glue's maturity has created a compelling leadership position</p></li><li><p>The 2025 data integration landscape for innovative organizations.</p></li></ul><p>Let's dive in.</p><div><hr></div><h1>3 Reasons Serverless is Eating Traditional ETL in 2025</h1><p>The transformation of data integration through serverless technology is not merely a trend; it's a profound shift in how organizations handle ETL. As we approach 2025, here are the reasons AWS Glue is at the forefront of this movement:</p><h2>1. Serverless Economics are Reducing Traditional ETL Expenses</h2><p>The cost dynamics of serverless ETL have reached an essential threshold that cannot be overlooked. Traditional ETL solutions now serve as a financial burden on IT budgets.</p><p>AWS Glue's serverless approach excels on multiple fronts:</p><ul><li><p><strong>Pricing by the second</strong>&nbsp;is taking over costly annual licensing models.</p></li><li><p><strong>Infrastructure expenses have drastically decreased,</strong>&nbsp;eliminating the need to maintain idle capacity.</p></li><li><p><strong>The overall ownership cost dropped by 60%</strong>&nbsp;in my recent projects.</p></li><li><p><strong>Capital expenses are transformed into operational expenses</strong>&nbsp;to enhance budgeting.</p></li><li><p><strong>Elastic scaling removes the overprovisioning</strong>&nbsp;issues found in traditional ETL.</p></li></ul><h2>2. The Democratization of Enterprise-Level Data Integration</h2><p>By 2025, the technical barriers that previously distinguished enterprise ETL capabilities from mid-market organizations will have entirely diminished.</p><p>AWS Glue has dramatically advanced its visual development methodology:</p><ul><li><p><strong>Non-technical users are creating intricate ETL workflows</strong>&nbsp;without the need for coding.</p></li><li><p><strong>Domain experts execute business logic smoothly,</strong>&nbsp;without bottlenecks from developers.</p></li><li><p><strong>Visual data lineage and impact analysis</strong>&nbsp;compete with conventional enterprise tools.</p></li><li><p><strong>Templates and patterns speed up implementation</strong>&nbsp;by 5 to 7 times.</p></li><li><p><strong>The learning curve has significantly flattened due to an</strong>&nbsp;enhanced user experience.</p></li></ul><h2>3. AI-Powered Automation is Transforming Possibilities</h2><p>Incorporating AI features into AWS Glue marks the most transformative development in ETL since its inception.</p><p>By 2025, these capabilities have matured significantly, transforming possibilities.</p><ul><li><p><strong>Generative AI helps enhance ETL development</strong>&nbsp;by recommending the best transformations.</p></li><li><p><strong>Automated data quality rules</strong>&nbsp;automatically identify and correct anomalies without human input.</p></li><li><p><strong>Self-optimizing job performance</strong>&nbsp;enhances automatically, eliminating the need for manual adjustments.</p></li><li><p><strong>Natural language data transformation</strong>&nbsp;enables business users to articulate modifications.</p></li><li><p><strong>Automated record-keeping and lineage monitoring</strong>&nbsp;lower governance burdens.</p></li></ul><div><hr></div><p>That's it.</p><p>Here's what you learned today:</p><ul><li><p><strong>In 2025, serverless technology will have significantly transformed the ETL market.</strong></p></li><li><p><strong>AWS Glue drives this change</strong>&nbsp;with robust, enterprise-ready features.</p></li><li><p><strong>The economic benefits far surpass those of</strong>&nbsp;conventional methods.</p></li><li><p><strong>Democratization has removed technical obstacles</strong>&nbsp;for non-experts.</p></li><li><p><strong>AI automation signifies a new chapter in the</strong>&nbsp;evolution of data integration.</p></li></ul><p>In 2025, organizations thriving with data won't be those holding onto outdated ETL tools; they'll be adopting serverless integration platforms such as AWS Glue, which fit seamlessly with contemporary data needs.</p><div><hr></div><p>PS&#8230; If you're benefiting from these insights on the serverless data revolution, consider sharing this edition with a colleague struggling with traditional ETL issues. For every five referrals, I'll provide exclusive access to my 2025 AWS Glue Performance Optimization Playbook.</p><h3>That&#8217;s it for this week. If you found this helpful, leave a comment to let me know &#9994;</h3><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/p/17-serverless-is-eating-etl-how-aws/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.bigdatadig.com/p/17-serverless-is-eating-etl-how-aws/comments"><span>Leave a comment</span></a></p><h2><strong>About the Author</strong></h2><p>With 15+ years of experience implementing data integration solutions across financial services, telecommunications, retail, and government sectors, I've helped dozens of organizations implement robust ETL processing. My approach emphasizes pragmatic implementations that deliver business value while effectively managing risk.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">The Data Modernisation Playbook is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[#014 - 5 Steps To Choose Between ETL/ELT With Confidence Even if You're Not a Data Architect]]></title><description><![CDATA[Stop following outdated rules in your data migration. Here's how to choose what actually works for YOUR business...]]></description><link>https://blog.bigdatadig.com/p/014-5-steps-to-choose-between-etlelt</link><guid isPermaLink="false">https://blog.bigdatadig.com/p/014-5-steps-to-choose-between-etlelt</guid><dc:creator><![CDATA[Muhammad Khurram]]></dc:creator><pubDate>Tue, 08 Apr 2025 08:58:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!sMap!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Hey there,</p><p>Welcome to this week's edition, where we tackle one of the most challenging decisions in modern data architecture: ETL vs. ELT? This is an ongoing debate among data leaders, and most organizations waste millions on the wrong data transformation approach. The debate has become more confusing than helpful.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">The Data Modernisation Playbook is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>Introduction</h2><p>According to a 2023 IDC survey on Data Integration Strategies, 68% of organizations reported making ETL/ELT architectural decisions without thoroughly evaluating their specific workload needs. The same study found that these organizations were 2.5x more likely to experience budget overruns and performance issues. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sMap!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sMap!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 424w, https://substackcdn.com/image/fetch/$s_!sMap!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 848w, https://substackcdn.com/image/fetch/$s_!sMap!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 1272w, https://substackcdn.com/image/fetch/$s_!sMap!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sMap!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png" width="806" height="638" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:638,&quot;width&quot;:806,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73013,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.bigdatadig.com/i/160745979?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sMap!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 424w, https://substackcdn.com/image/fetch/$s_!sMap!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 848w, https://substackcdn.com/image/fetch/$s_!sMap!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 1272w, https://substackcdn.com/image/fetch/$s_!sMap!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a01b7d9-c1e4-42a1-a007-806e82a8969d_806x638.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The stakes are high&#8212;choosing without a proper evaluation framework often results in costly rework, extended timelines, and team frustration.</p><p>Let's cut through the noise with research-backed guidance:</p><ul><li><p>A practical framework based on industry benchmarks</p></li><li><p>A simple decision tree validated by successful projects</p></li><li><p>The hidden costs that vendors don't talk about</p></li></ul><p>Ready to transform your approach to data transformation?</p><p>Let&#8217;s dive in.</p><div><hr></div><h1>5 Steps To Choose Between ETL/ELT With Confidence Even if You're Not a Data Architect</h1><p>To achieve a successful migration and data transformation strategy, you'll need a systematic approach based on verifiable evidence rather than following technology trends.</p><p>Based on aggregated insights from industry analysts and successful enterprise migrations, here's the structured framework that consistently delivers better outcomes:</p><h1>1. Define Your Migration Goals &amp; Constraints</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bxu4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bxu4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 424w, https://substackcdn.com/image/fetch/$s_!bxu4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 848w, https://substackcdn.com/image/fetch/$s_!bxu4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 1272w, https://substackcdn.com/image/fetch/$s_!bxu4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bxu4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png" width="986" height="608" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:608,&quot;width&quot;:986,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bxu4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 424w, https://substackcdn.com/image/fetch/$s_!bxu4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 848w, https://substackcdn.com/image/fetch/$s_!bxu4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 1272w, https://substackcdn.com/image/fetch/$s_!bxu4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d1348e2-1c4b-40cf-afb2-69dc27e418ab_986x608.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Start by clearly articulating what success looks like for your data modernization project. List your objectives in order of priority:</p><ul><li><p><strong>Operational cost reduction</strong> &#8594; Favors ELT if your target platform is cloud-based with optimized pricing</p></li><li><p><strong>Real-time analytics capability</strong> &#8594; Often better with ELT due to faster raw data availability</p></li><li><p><strong>Enhanced data quality</strong> &#8594; May lean toward ETL for upfront cleansing and validation</p></li><li><p><strong>Regulatory compliance</strong> &#8594; Usually better with ETL for audit trails and controlled transformations</p></li><li><p><strong>Increased business agility</strong> &#8594; Typically favors ELT for faster iterations and explorations</p></li></ul><p><strong>Industry Insight:</strong> Financial services organizations prioritize compliance and data quality over cost reduction, often favoring ETL in regulatory-sensitive workflows. Meanwhile, retail and e-commerce businesses prioritize speed-to-insight, favoring ELT approaches for customer analytics.</p><p><strong>Actionable Example:</strong> A financial services client created this simple prioritization matrix:</p><ul><li><p>Priority 1: Compliance with GDPR and financial regulations</p></li><li><p>Priority 2: Improved data quality for customer reporting</p></li><li><p>Priority 3: Reduced operational costs</p></li><li><p>Priority 4: Faster analytics capabilities</p></li></ul><p><strong>Quick Action Item:</strong> Create a simple table with your priorities (1-5) and constraints (budget, timeline, team size) to reference during architecture discussions.</p><h1>2. Map Your Data Landscape Honestly</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4WgA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4WgA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 424w, https://substackcdn.com/image/fetch/$s_!4WgA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 848w, https://substackcdn.com/image/fetch/$s_!4WgA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 1272w, https://substackcdn.com/image/fetch/$s_!4WgA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4WgA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png" width="794" height="686" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:686,&quot;width&quot;:794,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4WgA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 424w, https://substackcdn.com/image/fetch/$s_!4WgA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 848w, https://substackcdn.com/image/fetch/$s_!4WgA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 1272w, https://substackcdn.com/image/fetch/$s_!4WgA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F503dda06-c681-44f6-87d3-a1e089ae9753_794x686.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Create an inventory of your data sources and transformation needs with these critical metrics:</p><ul><li><p><strong>Data volume per source</strong> &#8594; Flag anything over 5TB as potentially problematic for pure ELT</p></li><li><p><strong>Data change frequency</strong> &#8594; Real-time/streaming sources may need specialized ETL pipelines</p></li><li><p><strong>Current data quality score</strong> &#8594; Sources with &lt;80% quality scores usually need ETL preprocessing</p></li><li><p><strong>Transformation complexity</strong> &#8594; Count steps required for each significant transformation</p></li><li><p><strong>Target platform limits</strong> &#8594; Document compute/storage costs and scalability boundaries</p></li></ul><p><strong>Industry Insight:</strong> Organizations processing over 10TB daily often experience significantly higher cloud computing costs when using pure ELT approaches versus hybrid ETL/ELT architectures. Telecommunications companies typically show the most dramatic difference, with pure ELT approaches costing substantially more for call detail record processing due to the high transformation complexity.</p><p><strong>Quick Action Item:</strong> Identify your 3 largest data sources and run a 1-day sample through your ETL tool and target system to compare processing times and costs.</p><h1>3. Evaluate Your Transformation Complexity</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kwJ_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kwJ_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 424w, https://substackcdn.com/image/fetch/$s_!kwJ_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 848w, https://substackcdn.com/image/fetch/$s_!kwJ_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 1272w, https://substackcdn.com/image/fetch/$s_!kwJ_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kwJ_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png" width="788" height="428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:788,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kwJ_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 424w, https://substackcdn.com/image/fetch/$s_!kwJ_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 848w, https://substackcdn.com/image/fetch/$s_!kwJ_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 1272w, https://substackcdn.com/image/fetch/$s_!kwJ_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d140147-df46-432c-86ba-fd477cbdaf84_788x428.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Catalog your transformation requirements and categorize each into these complexity tiers:</p><ul><li><p><strong>Simple transforms</strong> (ELT-friendly)</p><ul><li><p>Basic filtering (WHERE clauses)</p></li><li><p>Column renaming and basic type conversion</p></li><li><p>Simple calculations (currency conversion, etc.)</p></li><li><p>Direct 1:1 mappings</p></li></ul></li><li><p><strong>Moderate transforms</strong> (ELT-possible but evaluate)</p><ul><li><p>Aggregation and grouping operations</p></li><li><p>Basic joins between 2-3 tables</p></li><li><p>Window functions and time-based calculations</p></li><li><p>Data quality checks and simple validations</p></li></ul></li><li><p><strong>Complex transforms</strong> (Usually ETL-better)</p><ul><li><p>Multi-step data cleansing routines</p></li><li><p>Machine learning feature engineering</p></li><li><p>Fuzzy matching or record linkage</p></li><li><p>Complex business rules with multiple dependencies</p></li><li><p>Slowly changing dimensions (Type 2+)</p></li></ul></li></ul><p><strong>Industry Insight:</strong> Complex transformations in ELT pipelines frequently consume substantially more compute resources than equivalent ETL processes for datasets over 1TB. Particularly notable is that slowly changing dimension (SCD) Type 2 operations on large customer tables are significantly more expensive when performed in-database than in traditional ETL tools. However, simple and moderate transformations typically show minimal cost differences and faster development cycles in ELT implementations.</p><p><strong>Quick Action Item:</strong> Use this complexity scale to rate your top 10 most frequently used data transformations. If more than 50% are complex, consider retaining ETL for those specific workflows.</p><h1>4. Assess Your Team's Readiness</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CnQX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CnQX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 424w, https://substackcdn.com/image/fetch/$s_!CnQX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 848w, https://substackcdn.com/image/fetch/$s_!CnQX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 1272w, https://substackcdn.com/image/fetch/$s_!CnQX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CnQX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png" width="782" height="615" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:615,&quot;width&quot;:782,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CnQX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 424w, https://substackcdn.com/image/fetch/$s_!CnQX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 848w, https://substackcdn.com/image/fetch/$s_!CnQX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 1272w, https://substackcdn.com/image/fetch/$s_!CnQX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60f7d53f-e39b-4246-bc8d-727e5226bbcc_782x615.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Conduct an honest skills assessment of your team across these critical dimensions:</p><ul><li><p><strong>Technical Skills Inventory</strong></p><ul><li><p>ETL tool proficiency (Informatica, SSIS, etc.)</p></li><li><p>SQL expertise level (basic, intermediate, advanced)</p></li><li><p>Cloud platform experience (AWS, Azure, GCP)</p></li><li><p>Modern data stack familiarity (dbt, Fivetran, Airflow, etc.)</p></li><li><p>Programming skills (Python, Scala, Java)</p></li></ul></li><li><p><strong>Working Style Assessment</strong></p><ul><li><p>Comfort with agile, iterative approaches (ELT-friendly)</p></li><li><p>Experience with test-driven development</p></li><li><p>Preference for visual tools vs. code-based solutions</p></li><li><p>Adaptability to new technologies and frameworks</p></li></ul></li><li><p><strong>Readiness Gap Analysis</strong></p><ul><li><p>Identify the 3 most significant skills gaps for your desired architecture</p></li><li><p>Determine training requirements vs. hiring needs</p></li><li><p>Estimate the ramp-up time for new technologies</p></li></ul></li></ul><p><strong>Industry Insight:</strong> Organizations that assess team readiness before migration are much more likely to meet project timelines. Banking teams with strong traditional ETL backgrounds typically need several months to become proficient with modern ELT tools like dbt. Interestingly, teams with strong SQL skills but limited ETL experience tend to adapt to ELT approaches in just 6-8 weeks. Companies that implement structured upskilling programs generally report higher satisfaction with migration outcomes versus those that rely solely on hiring new talent.</p><p><strong>Quick Action Item:</strong> Ask team members to self-assess their comfort level with ETL and modern ELT tools on a scale of 1-5. Use the results to identify potential champions and areas needing focused training.</p><h1>5. Calculate Total Cost of Ownership</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MWBh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MWBh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 424w, https://substackcdn.com/image/fetch/$s_!MWBh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 848w, https://substackcdn.com/image/fetch/$s_!MWBh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 1272w, https://substackcdn.com/image/fetch/$s_!MWBh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MWBh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png" width="674" height="602" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8952d707-251c-492b-a245-d36d69f55a23_674x602.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:602,&quot;width&quot;:674,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MWBh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 424w, https://substackcdn.com/image/fetch/$s_!MWBh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 848w, https://substackcdn.com/image/fetch/$s_!MWBh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 1272w, https://substackcdn.com/image/fetch/$s_!MWBh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8952d707-251c-492b-a245-d36d69f55a23_674x602.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Build a comprehensive TCO model comparing ETL vs. ELT approaches with these specific line items:</p><ul><li><p><strong>Direct Infrastructure Costs</strong></p><ul><li><p>ETL server costs (on-prem) or compute resources (Cloud)</p></li><li><p>Data storage costs (staging areas, final destinations)</p></li><li><p>Network egress chaCloudfor data movement</p></li><li><p>Scaling costs during peak processing periods</p></li></ul></li><li><p><strong>Software and Tooling Costs</strong></p><ul><li><p>ETL tool licensing (often per-core or per-server)</p></li><li><p>ELT tool subscriptions (often per-user or consumption-based)</p></li><li><p>Monitoring and observability solutions</p></li><li><p>Version control and deployment tools</p></li></ul></li><li><p><strong>Human Resource Costs</strong></p><ul><li><p>Development time (hours per pipeline &#215; hourly rate)</p></li><li><p>Ongoing maintenance (hours per month &#215; hourly rate)</p></li><li><p>Training and upskilling costs</p></li><li><p>Support and operations staffing</p></li></ul></li><li><p><strong>Hidden Costs</strong></p><ul><li><p>Data quality remediation costs</p></li><li><p>Business impact of delayed insights</p></li><li><p>Compliance and governance overhead</p></li><li><p>Technical debt accumulation</p></li></ul></li></ul><p><strong>Industry Insight:</strong> Three three-year TCO comparisons show varied industry results for enterprises processing daily 5-15TB of data. Financial services organizations with strict compliance requirements often see better economics with ETL approaches, while retail and media companies typically benefit from ELT approaches. Notably, hybrid approaches frequently yield the lowest overall TCO across all sectors when workloads are optimally distributed based on transformation complexity.</p><p><strong>Quick Action Item:</strong> Run a 2-week sample of your most significant data flow through both approaches and multiply the resource consumption by 26 to get an annual estimate. Add this to your licensing costs for a quick TCO comparison.</p><p>That's it.</p><p>Here's what you learned today:</p><ul><li><p>The ETL vs. ELT decision isn't one-size-fits-all&#8212;it requires a systematic evaluation of your specific context</p></li><li><p>Your migration goals, data landscape, transformation complexity, team skills, and total costs all factor into the optimal approach</p></li><li><p>A hybrid strategy combining elements of both approaches is often the most practical solution.</p></li></ul><h2>Decision Tree Cheat Sheet</h2><p>I promised you a decision tree, so here's a simplified version you can apply immediately:</p><p><strong>1. Need Super Clean Data </strong><em><strong>Before</strong></em><strong> Loading (e.g., compliance)?</strong></p><ul><li><p>YES &#8594; Start with ETL for that data.</p></li><li><p>NO &#8594; Keep going.</p></li></ul><p><strong>2. Got a Super Fast, Powerful Target Platform (like the Cloud)?</strong></p><ul><li><p>YES &#8594; ELT could be a great fit!</p></li><li><p>NO &#8594; Think more about ETL or a mix.</p></li></ul><p><strong>3. Do you Let Your Data Team Explore and play with the Data Easily?</strong></p><ul><li><p>YES &#8594; ELT helps with that!</p></li><li><p>NO &#8594; Either way works.</p></li></ul><p><strong>4. Is Your Team Really Good with SQL and Cloud Tools?</strong></p><ul><li><p>YES &#8594; ELT might be smoother.</p></li><li><p>NO &#8594; Stick with what you know (ETL) while learning new skills.</p></li></ul><p><strong>5. Dealing with HUGE amounts of data </strong><em><strong>and</strong></em><strong> really tricky transformations?</strong></p><ul><li><p>YES &#8594; Maybe use ETL for those tricky parts.</p></li><li><p>NO &#8594; Keep going!</p></li></ul><p><strong>Bottom Line:</strong></p><ul><li><p><strong>Mostly "YES" to ELT questions?</strong> You can probably lean toward ELT!</p></li><li><p><strong>Mostly "YES" to ETL questions?</strong> ETL might be your sweet spot.</p></li><li><p><strong>Mix of answers?</strong> A hybrid approach (using both) is likely the best way to go!</p></li></ul><p>Industry observations consistently show that organizations following a structured decision framework are much more likely to meet or exceed their data modernization ROI targets than those making technology-driven decisions. Most successful migrations implement hybrid approaches that optimize for specific workload characteristics rather than forcing all data through a single pattern.</p><h3>That&#8217;s it for this week. If you found this helpful, leave a comment to let me know. &#9994;</h3><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/p/014-5-steps-to-choose-between-etlelt/comments&quot;,&quot;text&quot;:&quot;Leave a comment&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.bigdatadig.com/p/014-5-steps-to-choose-between-etlelt/comments"><span>Leave a comment</span></a></p><div><hr></div><p>PS...If you're enjoying Data Transformation Insights, please refer this edition to a colleague struggling with their modernization strategy. They'll thank you when they save thousands on their migration project.</p><h2>About the Author</h2><p>With 15+ years of experience implementing data integration solutions across financial services, telecommunications, retail, and government sectors, I've helped dozens of organizations implement robust ETL processing. My approach emphasizes pragmatic implementations that deliver business value while effectively managing risk.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.bigdatadig.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">The Data Modernisation Playbook is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>