We are looking for a highly skilled and dedicated Data Engineer to join a new AI solutions development squad that will be building cutting-edge applications leveraging Large Language Models (LLMs). We will be building AI solutions end-to-end: from concept, through prototyping, productization, to operations.
The Data Engineer will be responsible for designing, building, and maintaining a robust data infrastructure to support AI applications.
The ideal candidate will have expertise in handling structured and unstructured data, vector databases, real-time data processing, and cloud-based AI solutions (AWS or Azure).
Responsibilities
- Generative AI Application Co-creation: Collaborate with AI engineers, data scientists, product owners, and other developers in Agile teams to integrate LLMs into scalable, robust, fair, and ethical end-user applications, focusing on user experience, relevance, and real-time performance
- Data Infrastructure Development and Data Integration: Design and implement scalable, high-performance data pipelines for AI/GenAI applications, ensuring efficient data ingestion, transformation, storage, and retrieval; integrate different databases, requiring understanding of data architectures / Domain data ecosystem
- Vector Database Management: Work with vector databases (e.g., AWS OpenSearch or Azure AI Search) to store and retrieve high-dimensional data for Generative AI workloads
- Cloud-Based Data Engineering: Build and maintain cloud-based data solutions using AWS (OpenSearch, S3) or Azure (Azure AI Search, Azure Blob Storage)
- Snowflake Implementation: Design and optimize data storage and processing using Snowflake for scalable, cloud-native analytics solutions
- Data Processing & Transformation: Develop ETL/ELT pipelines to enable real-time and batch data processing
- Support AI Model Workflows: Collaborate with AI/ML Engineers and Data Scientists to ensure seamless integration of data pipelines with AI finetuning, inference, and training workflows
- Performance Optimization: Optimize data storage, retrieval, and processing strategies for efficiency, scalability, and cost-effectiveness
- Security & Compliance: Implement data governance, security best practices, and compliance measures aligned with Roche’s standards
- Monitoring & Maintenance: Set up monitoring, alerting, and logging for data pipelines, ensuring high availability and reliability
Requirements
- 3+ years in data engineering, preferably supporting AI/ML applications
- Proficiency in Python, SQL, and vector database native languages
- Experience with relational, NoSQL, vector databases, and Snowflake in particular
- Hands-on experience with AWS (OpenSearch, S3, Lambda) or Azure (Azure AI Search, Azure Blob Storage, Azure Automation)
- Experience building scalable ETL/ELT workflows using DBT, Apache Airflow, or similar
- Ability to design and integrate RESTful APIs for data exchange
- Understanding of encryption and role-based access controls
- Familiarity with Git, CI/CD, containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform, CloudFormation)
- Experience working with AI-specific data needs, such as embeddings, RAG (Retrieval Augmented Generation), and LLM fine-tuning data preparation
- Proficiency in the best practices of software engineering
- Excellent analytical skills and the ability to tackle complex challenges with innovative solutions
- Hold a B.Sc., B.Eng., or higher, or equivalent in Computer Science, Data Engineering, or related fields.
- Be able to communicate in English at the level of C1+
We offer
- Paid Time Off (Vacation, Sick & Public Holidays in your country)
- Long-term B2B contract fully remote
- Friendly atmosphere and Trust-based managerial culture
- 100% remote work
- Innovative Environment: Work on cutting-edge AI technologies in a highly impactful program
- Growth Opportunities: Opportunities for professional development and learning in the rapidly evolving field of AI
- Collaborative Culture: Be a part of a diverse and inclusive team that values collaboration and innovation
- Participate only in international projects
- Referral bonuses for recommending your friends
If interested, please share your CV at iuliana@euroasiarecruiting.com.