tuni56

@tuni56

Rocio Baigorria

Data Engineer | AWS & Data Platforms | Co-leader AWS Girls Argentina
Buenos Aires, Argentina dxaokewn60u4i.cloudfront.net Joined March 2026
2k Points23 Badges11 Connections11 Followers16 Following

About

Pivoted from Industrial Engineering to Data. I’ve traded factory floors for well-tuned clusters. Technical Rebel dedicated to turning "messy flows" into elegant, event-driven architectures✨

Top Skills

AWSEvent-drivenKafkaSQL

Experience

Data Engineer

Independent Data Engineer Consultant

Data Engineer specializing in AWS-based data platforms, event-driven architectures, and cost-efficient analytics infrastructure for SMEs and distributed teams.
Key impact:

• Designed streaming pipelines using Kinesis and AWS Glue Streaming ETL, reducing data freshness from hours to minutes.
• Built centralized S3 data lakes integrating multiple sources through Glue and Athena for unified analytics.
• Reduced pipeline latency through Spark job tuning, optimized partitioning, and columnar storage strategies.
• Implemented event-driven architectures using Java, Spring Boot, and Apache Kafka for real-time data exchange.
• Improved pipeline reliability through monitoring and alerting with CloudWatch, Prometheus, and Grafana.
• Implemented infrastructure as code with Terraform to standardize ingestion layers and deployment workflows.
• Automated analytics pipelines reducing manual reporting effort by ~85%.
• Designed serverless architectures prioritizing low operational cost and maintainability.

Business Analyst

Electromecánica Bomeq SA

Business-facing analytical role focused on cost optimization, operational efficiency, and decision support.

Key impact:

Conducted root-cause analysis on operational and maintenance costs, reducing annual budgets by 15–20%.

Designed KPI dashboards for operational decision-making, reducing waste by USD 12K+ per quarter.

Managed the full customer lifecycle for 50+ SME accounts, achieving 95%+ retention.

Worked cross-functionally with engineering, procurement, and finance teams to align technical decisions with profitability.

Education

Universidad Tecnologica Nacional

Industrial Engineer, Engineering

Projects

Real-Time Event-Driven Data Pipeline

Visit Project open_in_new

Real-Time Stream Processing
Kafka Streams: Complex event processing with windowing and aggregations
Exactly-Once Semantics: Idempotent producers with transactional guarantees
Schema Evolution: Avro schemas with backward/forward compatibility
Fraud Detection: Real-time anomaly detection using sliding windows
???? Production-Grade Observability
Prometheus Metrics: Custom business and technical metrics
Grafana Dashboards: Real-time visualization and alerting
Circuit Breakers: Resilience4j for fault tolerance
Distributed Tracing: Request correlation across services
⚡ High-Performance Optimizations
Batch Processing: Optimized producer batching (32KB, 10ms linger)
Compression: Snappy compression for 40% bandwidth reduction
Parallel Consumers: Multi-threaded processing with manual acknowledgment
Connection Pooling: Optimized database and Redis connections
????️ Enterprise Security & Reliability
Health Checks: Comprehensive service health monitoring
Graceful Degradation: Circuit breaker patterns with fallbacks
Data Validation: Schema registry enforcement
Audit Logging: Structured logging with correlation IDs

Ecommerce Data Warehouse

Visit Project open_in_new

A production-grade data warehouse implementation for ecommerce analytics, built on AWS using modern data engineering practices. This project demonstrates end-to-end data pipeline design, from raw ingestion to analytics-ready dimensional models.

Business Value
For C-Level:

Single source of truth for ecommerce metrics (revenue, customer lifetime value, product performance)
Serverless architecture reduces operational overhead and scales automatically with demand
Cost-optimized design with pay-per-query pricing model
Historical tracking enables trend analysis and forecasting
For Technical Teams:

Medallion architecture (Bronze → Silver → Gold) ensures data quality and lineage
Infrastructure as Code enables reproducible deployments across environments
Incremental processing minimizes compute costs and latency
Star schema design optimized for BI tool performance

Licenses & Certifications

Data Streaming Engineer Foundations

Confluent
Credential ID 177247999
Show Credential open_in_new

Language & Tools

- Data Engineering & Cloud:
AWS: S3, Glue, Lambda, Athena, Redshift, Step Functions.
- Infrastructure as Code: Terraform (the only way to build).
- Streaming & Event-Driven: Apache Kafka, Flink (Stateful Processing).
- Environment Management: LXD (System containers), `uv` (Fast Python management).
- Languages:
Python: Data processing, automation, and Boto3 wizardry.
SQL: Complex queries and database sandboxing.

Currently Exploring

Diving deep into the AWS Solutions Architect Associate (SAA) exam (scheduled for May 19th!). Tinkering with Agentic AI on AWS using Bedrock, mastering stateful processing with Apache Flink.

Achievements

Preparing upcoming talks on event-driven architectures for the AWS User Group Arequipa and GCP User Group Tijuana this May and June⚡

Fun Fact

I have to confess: I’m so deep into the ecosystem that if I were a piece of Java code, I’d definitely be dating Kafka. Our relationship would be low-latency, high-throughput, and we’d never have sync issues thanks to perfect partitioning. ☕️

Sorry RabbitMQ, you're great, but my heart (and my offsets) belong to Kafka.

Random Dev Quote

Build on, Always!

User Activities

JunJulAugSepOctNovDecJanFebMarAprMay
Mon
Tue
Wed
Thu
Fri
Sat
Sun
Less More
Joined: 2 months (since Mar 19)
Extra privileges: Editing any comment
Full Name: Rocio Baigorria
Headline: Data Engineer | AWS & Data Platforms | Co-leader AWS Girls Argentina
About: Pivoted from Industrial Engineering to Data. I’ve traded factory floors for well-tuned clusters. Technical Rebel dedicated to turning "messy flows" into elegant, event-driven architectures✨
Location: Buenos Aires, Argentina
Website: https://dxaokewn60u4i.cloudfront.net/
Languges & Tools: - Data Engineering & Cloud:
AWS: S3, Glue, Lambda, Athena, Redshift, Step Functions.
- Infrastructure as Code: Terraform (the only way to build).
- Streaming & Event-Driven: Apache Kafka, Flink (Stateful Processing).
- Environment Management: LXD (System containers), `uv` (Fast Python management).
- Languages:
Python: Data processing, automation, and Boto3 wizardry.
SQL: Complex queries and database sandboxing.
Currently Exploring: Diving deep into the AWS Solutions Architect Associate (SAA) exam (scheduled for May 19th!). Tinkering with Agentic AI on AWS using Bedrock, mastering stateful processing with Apache Flink.
Achievements: Preparing upcoming talks on event-driven architectures for the AWS User Group Arequipa and GCP User Group Tijuana this May and June⚡
Fun Fact: I have to confess: I’m so deep into the ecosystem that if I were a piece of Java code, I’d definitely be dating Kafka. Our relationship would be low-latency, high-throughput, and we’d never have sync issues thanks to perfect partitioning. ☕️

Sorry RabbitMQ, you're great, but my heart (and my offsets) belong to Kafka.
Random Dev Quote: Build on, Always!
Latest Video:
chevron_left

Latest Jobs

View all jobs →