Clinic Records 30
Digital Supplements for a Healthy Mind
The State File Incident: One Missing File, One Destroyed Database
Caution
CRITICAL SYSTEM FAILURE: Patient vitals flatlined. Root cause: state amnesia …
PyFlink II: Windows, Watermarks, and Late Events
Goal: Understand and apply the core mechanisms of Flink stream processing—Windows and …
PyFlink I: Architecture, Checkpoints, and Pass-Through Jobs
Goal: Understand Flink’s internal architecture (JobManager and TaskManager). Build …
Streaming Foundations II: Python Consumers and PostgreSQL
Goal: Write a Python Consumer to read and deserialize Kafka messages. Setup a target …
Streaming Foundations I: Redpanda and Python Producers
Goal: Understand the fundamentals of message brokers, use Redpanda to simplify Kafka …
Running Spark in the Cloud: GCS, Standalone Clusters, and Dataproc
Goal: Move from local Spark development to cloud execution — connecting to Google Cloud …
Spark Internals: Clusters, Shuffles, Joins, and RDDs
Goal: Understand how Spark executes jobs across a cluster, how operations like GROUP BY …
Spark SQL: Running SQL Queries on DataFrames
Goal: Learn how to combine multiple datasets in Spark, register DataFrames as temporary …
Getting Started with PySpark: Sessions, DataFrames, and Transformations
Goal: Set up a PySpark environment, read data into Spark DataFrames, understand …
Batch Processing Fundamentals & Introduction to Apache Spark
Goal: Understand what batch processing is, how it compares to streaming, and why Apache …