We are looking for an experienced Data Engineer to join the Yoshi en team and take ownership of our data infrastructure, ensuring that data flows reliably, efficiently, and in a usable state across the organization.
In this role, you will design, build, and maintain the pipelines and systems that enable analysts, data scientists, and business teams to work effectively with data.
This is a pivotal position with high impact on how data is structured, stored, and used. Beyond building and maintaining pipelines, you will act as a key expert for cloud infrastructure, containerization, CI/CD practices, and MLOps, helping to continuously improve our data platform and technical standards.
Design, develop, and maintain scalable data pipelines, ETL/ELT processes, and data integration workflows.
Build and optimize data models, schemas, partitioning strategies, and fact/dimension tables for a scalable architecture.
Manage batch jobs, streaming data, and API integrations across multiple systems.
Monitor the performance and cost-efficiency of the data platform and introduce improvements.
Identify and troubleshoot data-related issues and implement effective solutions.
Collaborate closely with data analysts, software engineers, and business stakeholders to translate data requirements into technical solutions.
Support integrations with tools such as Bloomreach/Exponea, Linkster, Zendesk, Magento, and Odoo.
Contribute to improving cloud infrastructure, orchestration, CI/CD, and MLOps practices.
Stay up to date with modern data engineering technologies and best practices.
4+ years of experience in Data Engineering, Data Analysis, Data Science, Data Architecture, or similar roles involving complex data pipelines.
A degree in Computer Science, Engineering, Information Technology, or a related field.
Strong proficiency in Python and SQL and proven experience building ETL pipelines.
Hands-on experience with Google Cloud Platform, especially BigQuery, Cloud Storage, and Datastream.
Experience with dbt (including testing) and Prefect for orchestration.
Solid understanding of broader technical infrastructure and how a data platform integrates with other systems (e.g., martech, APIs, reporting tools).
Strong problem-solving skills, attention to detail, and the ability to work independently.
Excellent communication and collaboration skills.
Experience with tools such as Bloomreach Exponea, Linkster, Tableau, Magento, or Odoo.
Experience with containerization, CI/CD, or MLOps practices.