We are looking for a Data Engineer to design and maintain scalable data solutions that power advanced analytics and AI-driven insights. This role combines expertise in big data engineering and web scraping, enabling you to work on high-impact projects involving large, complex, and unstructured datasets.
Requirements
- Architect, develop, and maintain high-throughput data pipelines in Databricks and AWS
- Ingest, normalize, and enrich large volumes of structured and unstructured data
- Collaborate with AI Engineers, ML scientists, and software teams to translate requirements into scalable data architectures
- Optimize pipeline performance and cost using distributed processing techniques
- Enforce data governance, privacy, and lineage standards
- Build automated validation, testing, and monitoring frameworks to ensure data quality and freshness
- Support onboarding and integration of new external data vendors
- Continuously evaluate emerging GenAI tooling and drive proof-of-concepts
- Own the creation of tools and workflows for web crawling and scraping using compliance-approved technologies
Benefits
- Equal Opportunities employer
- Inclusive workplace