Senior Data Engineer / AI Data Platform Engineer
patriciabaez.net/jobs
- Santo Domingo
- Permanente
- Tiempo completo
Employment Type: Full-Time
Industry: AI / Software Engineering / Data PlatformsAbout the RoleWe are looking for a Senior Data Engineer / AI Data Platform Engineer to design and build high-performance, scalable data pipelines that power AI/ML systems and data-driven applications.This role goes beyond traditional ETL — it focuses on distributed computing, large-scale data processing, and data infrastructure for AI workloads using technologies like Apache Spark, Python, and cloud-native architectures.You will play a key role in enabling machine learning pipelines, real-time data processing, and high-volume data systems.Key Responsibilities
- Design and build scalable data pipelines for AI/ML workloads
- Develop distributed data processing systems using Apache Spark (batch & streaming)
- Optimize large-scale data transformations using Python and SQL
- Architect and maintain data platforms on AWS, Azure, or Google Cloud
- Implement parallel processing, partitioning strategies, and performance tuning
- Enable data ingestion pipelines for structured and unstructured data (logs, events, APIs)
- Collaborate with ML Engineers and Software Engineers to support AI models
- Ensure data reliability, observability, and system scalability
- Strong experience with Apache Spark (core + performance optimization)
- Advanced SQL (analytical + optimization level)
- Strong programming in Python (data + performance oriented)
- Proven experience building ETL / ELT pipelines at scale
- Experience with cloud-native architectures (AWS, Azure, or GCP)
- Deep understanding of distributed systems and data processing at scale
- Experience handling large datasets (10M–1B+ records)
- Experience supporting ML pipelines (feature engineering, data prep)
- Familiarity with Spark Streaming / Kafka / real-time pipelines
- Experience with Databricks / Snowflake / BigQuery
- Knowledge of data lakehouse architectures
- Experience with containerization (Docker, Kubernetes)
- Exposure to MLOps workflows
- Competitive salary aligned with AI/Engineering market (USD-based)
- Flexible work model (Hybrid Remote)
- Opportunity to work on AI-driven systems and scalable platforms
- High-impact engineering environment (not BI / not reporting-focused)
- Updated LinkedIn profile link
- A short written response including:
- Your experience with Apache Spark and distributed data systems
- A description of the most complex data pipeline or system you have built
- Your experience working with large-scale data or AI-related systems
- Why you are a strong fit for this position