Clinic Records 42
Digital Supplements for a Healthy Mind
The Midnight Job: Why Batch Processing Has an Expiry Date
Note
DIAGNOSTIC BRIEF: Scheduled batch jobs are the drip-feed IV of the data world — …
Terraform Hardening Protocol: 5 Safeguards That Could Have Saved Everything
Important
PRE-OPERATIVE CHECKLIST INITIATED: Before any infrastructure surgery, verify all …
The State File Incident: One Missing File, One Destroyed Database
Caution
CRITICAL SYSTEM FAILURE: Patient vitals flatlined. Root cause: state amnesia …
PyFlink II: Windows, Watermarks, and Late Events
Goal: Understand and apply the core mechanisms of Flink stream processing—Windows and …
PyFlink I: Architecture, Checkpoints, and Pass-Through Jobs
Goal: Understand Flink’s internal architecture (JobManager and TaskManager). Build …
Streaming Foundations II: Python Consumers and PostgreSQL
Goal: Write a Python Consumer to read and deserialize Kafka messages. Setup a target …
Streaming Foundations I: Redpanda and Python Producers
Goal: Understand the fundamentals of message brokers, use Redpanda to simplify Kafka …
Running Spark in the Cloud: GCS, Standalone Clusters, and Dataproc
Goal: Move from local Spark development to cloud execution — connecting to Google Cloud …
Spark Internals: Clusters, Shuffles, Joins, and RDDs
Goal: Understand how Spark executes jobs across a cluster, how operations like GROUP BY …
Spark SQL: Running SQL Queries on DataFrames
Goal: Learn how to combine multiple datasets in Spark, register DataFrames as temporary …