Senior Engineer - Data Engineering
Date: 28 Mar 2026
Location: Pune, MH, IN, 411006
Company: Altimetrik
Engineer - Data, Python, Pyspark, 5-7 yrs, Lead the design, development, and optimization of data pipelines using Python and PySpark to ensure efficient data processing and integration
- Advanced proficiency in Python for data manipulation, analysis, and development of machine learning algorithms.
- Utilization of libraries such as Pandas, NumPy, and scikit-learn for extracting insights from complex datasets and automating data processing tasks.
- Strong background in PySpark for efficient handling of large-scale data processing and analytics across distributed computing environments.
- Adept at writing optimized PySpark code to execute transformations and actions on large datasets, facilitating real-time data processing and analysis.
- Proficiency in both Python and PySpark expected at an advanced level, with the ability to integrate these technologies into production-grade data pipelines.
- Education: Master of Technology (M.Tech) in Data Science, and Bachelor of Engineering (B.E.) or Bachelor of Technology (B.Tech) in Computer Science or Information Technology.
- Strong foundation in both theoretical and practical aspects of data science and engineering from educational background.
- Certifications preferred: Databricks Certified Data Engineer Associate and AWS Certified Big Data – Specialty.
- Demonstrates commitment to continuous learning and expertise in cloud-based data solutions.
- Lead the design, development, and optimization of data pipelines using Python and PySpark to ensure efficient data processing and integration.
- Collaborate with cross-functional teams, including data scientists and analysts, to understand data requirements and provide robust data solutions that meet business needs.
- Develop scalable data models and architectures to support large-scale data processing and storage.
- Implement data validation and testing processes to ensure high data quality and integrity throughout the data lifecycle.
- Mentor junior data engineers by providing technical guidance and support, fostering a culture of continuous learning and improvement within the team.
- Conduct performance tuning and troubleshooting of data workflows to enhance processing efficiency.
- Stay updated with the latest industry trends and best practices in data engineering and big data technologies, applying this knowledge to drive improvements in processes and tools.
- Document data engineering processes and workflows for knowledge sharing and operational continuity.
Long Description
Engineer - Data, Python, Pyspark, 5-7 yrs