We are seeking a skilled Data Engineer with expertise in PySpark and SQL to join our Data Engineering team. The role involves designing, building, and maintaining scalable data pipelines and processing systems to support business intelligence, analytics, and machine learning initiatives. The ideal candidate will collaborate with data scientists, analysts, and other engineers to understand data requirements and deliver efficient solutions.
Requirements
- Develop and maintain scalable, robust data pipelines using Scala and big data technologies.
- Work with large datasets to ingest, transform, and make data available for analytics and reporting.
- Collaborate with Data Scientists, Analysts, and other engineers to understand data requirements and deliver efficient solutions.
- Optimize ETL jobs for performance and cost.
- Ensure data quality, governance, and consistency across all environments.
- Monitor production jobs, troubleshoot issues, and ensure system reliability.
- Implement best practices for data engineering, including code reviews, testing, and documentation.