We are seeking a skilled and motivated Data Engineer with strong expertise in PySpark, SQL to join our Data Engineering team. This role involves designing, building, and maintaining scalable data pipelines and processing systems to support business intelligence, analytics, and machine learning initiatives. The successful candidate will work closely with Data Scientists, Analysts, and other engineers to deliver efficient solutions.
Requirements
- Develop and maintain scalable, robust data pipelines using Scala and big data technologies.
- Work with large datasets from multiple sources to ingest, transform, and make data available for analytics and reporting.
- Collaborate with Data Scientists, Analysts, and other engineers to understand data requirements.
- Optimize ETL jobs for performance and cost.
- Ensure data quality, governance, and consistency across all environments.
- Monitor production jobs, troubleshoot issues, and ensure system reliability.
- Implement best practices for data engineering, including code reviews, testing, and documentation.