Clinic Records 37
Digital Supplements for a Healthy Mind
Streaming Foundations II: Python Consumers and PostgreSQL
Goal: Write a Python Consumer to read and deserialize Kafka messages. Setup a target …
Streaming Foundations I: Redpanda and Python Producers
Goal: Understand the fundamentals of message brokers, use Redpanda to simplify Kafka …
Running Spark in the Cloud: GCS, Standalone Clusters, and Dataproc
Goal: Move from local Spark development to cloud execution — connecting to Google Cloud …
Spark Internals: Clusters, Shuffles, Joins, and RDDs
Goal: Understand how Spark executes jobs across a cluster, how operations like GROUP BY …
Spark SQL: Running SQL Queries on DataFrames
Goal: Learn how to combine multiple datasets in Spark, register DataFrames as temporary …
Getting Started with PySpark: Sessions, DataFrames, and Transformations
Goal: Set up a PySpark environment, read data into Spark DataFrames, understand …
Batch Processing Fundamentals & Introduction to Apache Spark
Goal: Understand what batch processing is, how it compares to streaming, and why Apache …
From API to Warehouse in Minutes: Building a dlt Pipeline with AI
Goal: Build a complete, working data pipeline — from the Open Library API to a local …
Understanding dlt: The Modern Python Library for Data Ingestion
Goal: Understand what
dlt(data load tool) is, why it exists, and how its core Extract → …
Advanced Bruin Workflows: AI Agents & Cloud Deployment
Goal: Explore advanced Bruin capabilities by integrating AI agents via the Model Context …