We are looking for a highly skilled and dedicated Data Engineer to join a new AI solutions development squad that will be building cutting-edge applications leveraging Large Language Models (LLMs). We will be building AI solutions end-to-end: from concept, through prototyping, productization, to operations.

The Data Engineer will be responsible for designing, building, and maintaining a robust data infrastructure to support AI applications.

The ideal candidate will have expertise in handling structured and unstructured data, vector databases, real-time data processing, and cloud-based AI solutions (AWS or Azure).

Responsibilities

  • Generative AI Application Co-creation: Collaborate with AI engineers, data scientists, product owners, and other developers in Agile teams to integrate LLMs into scalable, robust, fair, and ethical end-user applications, focusing on user experience, relevance, and real-time performance
  • Data Infrastructure Development and Data Integration: Design and implement scalable, high-performance data pipelines for AI/GenAI applications, ensuring efficient data ingestion, transformation, storage, and retrieval; integrate different databases, requiring understanding of data architectures / Domain data ecosystem
  • Vector Database Management: Work with vector databases (e.g., AWS OpenSearch or Azure AI Search) to store and retrieve high-dimensional data for Generative AI workloads
  • Cloud-Based Data Engineering: Build and maintain cloud-based data solutions using AWS (OpenSearch, S3) or Azure (Azure AI Search, Azure Blob Storage)
  • Snowflake Implementation: Design and optimize data storage and processing using Snowflake for scalable, cloud-native analytics solutions
  • Data Processing & Transformation: Develop ETL/ELT pipelines to enable real-time and batch data processing
  • Support AI Model Workflows: Collaborate with AI/ML Engineers and Data Scientists to ensure seamless integration of data pipelines with AI finetuning, inference, and training workflows
  • Performance Optimization: Optimize data storage, retrieval, and processing strategies for efficiency, scalability, and cost-effectiveness
  • Security & Compliance: Implement data governance, security best practices, and compliance measures aligned with Roche’s standards
  • Monitoring & Maintenance: Set up monitoring, alerting, and logging for data pipelines, ensuring high availability and reliability

Requirements

  • 3+ years in data engineering, preferably supporting AI/ML applications
  • Proficiency in Python, SQL, and vector database native languages
  • Experience with relational, NoSQL, vector databases, and Snowflake in particular
  • Hands-on experience with AWS (OpenSearch, S3, Lambda) or Azure (Azure AI Search, Azure Blob Storage, Azure Automation)
  • Experience building scalable ETL/ELT workflows using DBT, Apache Airflow, or similar
  • Ability to design and integrate RESTful APIs for data exchange
  • Understanding of encryption and role-based access controls
  • Familiarity with Git, CI/CD, containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform, CloudFormation)
  • Experience working with AI-specific data needs, such as embeddings, RAG (Retrieval Augmented Generation), and LLM fine-tuning data preparation
  • Proficiency in the best practices of software engineering
  • Excellent analytical skills and the ability to tackle complex challenges with innovative solutions
  • Hold a B.Sc., B.Eng., or higher, or equivalent in Computer Science, Data Engineering, or related fields.
  • Be able to communicate in English at the level of C1+

We offer

  • Paid Time Off (Vacation, Sick & Public Holidays in your country)
  • Long-term B2B contract fully remote
  • Friendly atmosphere and Trust-based managerial culture
  • 100% remote work
  • Innovative Environment: Work on cutting-edge AI technologies in a highly impactful program
  • Growth Opportunities: Opportunities for professional development and learning in the rapidly evolving field of AI
  • Collaborative Culture: Be a part of a diverse and inclusive team that values collaboration and innovation
  • Participate only in international projects
  • Referral bonuses for recommending your friends

If interested, please share your CV at iuliana@euroasiarecruiting.com.