How I combined Apache Flink, Redpanda, Airflow, dbt Cloud, and Grafana to track Bitcoin, Ethereum, Solana, BNB, and Cardano in real time — all running on Google Cloud for less than a cup of coffee per month.
If you've been learning data engineering...
The database world has always needed a reliable bridge between operational systems and analytical or AI workloads. For decades, Oracle GoldenGate has served that role — silently moving terabytes of transactional changes across databases, clouds, and ...
Introduction
Global sovereign debt is one of the most consequential datasets in existence. It shapes foreign policy, determines credit ratings, drives IMF bailout decisions, and affects the daily lives of billions of people in developing countries....
Oracle's Exadata platform has always been synonymous with extreme database performance. But with the release of Exadata System Software 24ai and 25ai, alongside the debut of Oracle Exadata X11M in January 2025, Oracle has taken a decisive step into t...
Just completed my DEZ Zoomcamp 2026 capstone project — the Tech Ecosystem Observatory
Built a full cloud-native batch data pipeline from scratch that answers: which industries are shedding the most jobs, and how does that correlate with YC startup a...
Week 7 of Data Engineering Zoomcamp by @DataTalksClub complete
Just finished Module 7 - Streaming with PyFlink. Learned how to:
Set up Redpanda as a Kafka replacement
Build Kafka producers and consumers in Python
Create tumbling and session windo...
Oracle Database 23ai's AI Vector Search provides multiple strategies for finding similar vectors, each with different trade-offs between accuracy, speed, and resource usage. Understanding when to use exact search, approximate search, or multi-vector ...
Oracle Database 23ai introduces native vector capabilities that enable semantic search directly within SQL. Understanding how to create vectors, calculate distances, and choose appropriate metrics is fundamental to building effective AI-powered appli...
Week 6 of Data Engineering Zoomcamp by @DataTalksClub complete
Just finished Module 6 - Batch Processing with Spark. Learned how to:
✅ Set up PySpark and create Spark sessions
✅ Read and process Parquet files at scale
✅ Repartition data for opti...
dlt Workshop of Data Engineering Zoomcamp by @DataTalksClub complete
Just finished the Data Ingestion workshop with @dltHub. Learned how to:
✅ Build REST API data pipelines with dlt
✅ Use AI-assisted development with dlt MCP Server
✅ Load paginated...
Oracle Database 23ai introduces the VECTOR data type, enabling you to store AI embeddings alongside traditional business data. Understanding how to perform Data Manipulation Language DML and Data Definition Language DDL operations on vector columns i...
Week 5 of Data Engineering Zoomcamp by @DataTalksClub complete
Just finished Module 5 - Data Platforms with Bruin. Learned how to:
✅ Build end-to-end ELT pipelines with Bruin
✅ Configure environments and connections
✅ Use materialization strategies...
Oracle Database 23ai's AI Vector Search enables powerful semantic search capabilities, but to truly harness its potential, you need to understand how to query vector data efficiently and when to use vector indexes. This guide explores querying vector...
Week 4 of Data Engineering Zoomcamp by @DataTalksClub complete
Just finished Module 4 - Analytics Engineering with dbt. Learned how to:
✅ Build transformation models with dbt
✅ Create staging, intermediate, and fact tables
✅ Write tests to ensure d...
Week 3 of Data Engineering Zoomcamp by @DataTalksClub complete!
Just finished Module 3 - Data Warehousing with BigQuery. Learned how to:
Create external tables from GCS bucket data
Build materialized tables in BigQuery
Partition and cluster tables...
Oracle AI Vector Search Workflow: From Data to Semantic Search
Oracle Database 23ai introduces AI Vector Search, enabling semantic search capabilities directly within the database. Unlike traditional keyword-based searches, vector search understand...
Semantic search represents a fundamental shift in how we retrieve information from databases and search engines. Unlike traditional keyword-based search that relies on exact text matches, semantic search understands the meaning and context behind que...
Just completed Module 2 of the Data Engineering Zoomcamp 2026. Built production-ready data pipelines using Kestra to process 26 million NYC taxi trip records.
What I accomplished:
Orchestrated ETL workflows with Kestra
Ingested data from GitHub to G...
Oracle Database 23ai formerly 23c represents a significant leap forward in integrating artificial intelligence capabilities directly into the database. One of its most groundbreaking features is AI Vector Search, which enables the database to underst...
Oracle Cloud Infrastructure OCI Generative AI Agents represents a paradigm shift in building AI applications—enabling enterprises to deploy production-grade RAG Retrieval-Augmented Generation systems without writing a single line of code. This compre...