Senior Specialist, Lead Data Engineer
Software Engineering, Data Science
Hyderabad, Telangana, India · Pune, Maharashtra, India
Job Description
Senior Data Engineer, Specialist
The Opportunity
- Based in Hyderabad, join a global healthcare biopharma company and be part of a 130- year legacy of success backed by ethical integrity, forward momentum, and an inspiring mission to achieve new milestones in global healthcare.
- Be part of an organisation driven by digital technology and data-backed approaches that support a diversified portfolio of prescription medicines, vaccines, and animal health products.
- Drive innovation and execution excellence. Be a part of a team with passion for using data, analytics, and insights to drive decision-making, and which creates custom software, allowing us to tackle some of the world's greatest health threats.
Our Technology Centers focus on creating a space where teams can come together to deliver business solutions that save and improve lives. An integral part of our company's’ IT operating model, Tech Centers are globally distributed locations where each IT division has employees to enable our digital transformation journey and drive business outcomes. These locations, in addition to the other sites, are essential to supporting our business and strategy.
A focused group of leaders in each Tech Center helps to ensure we can manage and improve each location, from investing in growth, success, and well-being of our people, to making sure colleagues from each IT division feel a sense of belonging to managing critical emergencies. And together, we must leverage the strength of our team to collaborate globally to optimize connections and share best practices across the Tech Centers.
Role Overview
We are seeking an experienced Lead Data Engineer to architect, build, and lead scalable data platforms. This role requires strong hands-on experience in AWS, Spark, Databricks, and Airflow, along with technical leadership capabilities to guide teams and establish best practices. This role requires deep expertise in data modelling, modern data architectures, along with exposure to Agentic AI systems and AI-driven data workflows.
What will you do in this role
- Lead the design and implementation of end-to-end data engineering solutions
- Architect and optimize large-scale data pipelines using Spark, Python, and SQL
- Design and manage orchestration frameworks using Apache Airflow
- Build scalable and cost-efficient solutions on AWS ecosystem (S3, Glue, Redshift, Lambda)
- Should have worked on Redshift Tables and must be aware of different distribution techniques.
- Drive development and optimization of Databricks-based data platforms . Unity Catalog and access control knowledge is required
- Mentor and guide data engineering teams
- Collaborate with cross-functional stakeholders to deliver business value
- Implement and drive CI/CD practices, including GitHub Actions
- Ensure data platform scalability, performance, and reliability
- Design and implement scalable data models (conceptual, logical, physical) for enterprise data platforms
- Develop and maintain dimensional models (Star/Snowflake schemas) for analytics and reporting
- Build and optimize data Lakehouse architectures using Databricks & Delta Lake
- Define data standards, naming conventions, and modeling best practices
What Should you have
- Strong expertise in SQL, Python, and Apache Spark
- Extensive hands-on experience with Apache Airflow
- Deep experience with AWS cloud services (S3, Glue, Redshift, Lambda)
- Strong experience building solutions on Databricks platform
- Should have experience in DBT as well.
- Proven experience in data architecture, modeling, and big data systems
- Experience leading engineering teams and projects
- Strong analytical, problem-solving, and communication skills
🔹 Good to Have
- Experience with CI/CD tools, especially GitHub Actions
- Knowledge of Pharmaceutical / Life Sciences domain
- Experience with real-time/streaming pipelines (Kafka, etc.)
- Exposure to ML/data science workflows
🔹 Education & Experience
- Bachelor’s/Master’s degree in Computer Science or related discipline
- Typically 9+ years of experience in data engineering (Lead role)
Primary Skills.
- SQL, Pyton,
- PySpark
- Aws cloud Platform
Who we are
We are known as well-known org Inc., Rahway, New Jersey, USA in the United States and Canada and MSD everywhere else. For more than a century, bringing forward medicines and vaccines for many of the world's most challenging diseases. Today, our company continues to be at the forefront of research to deliver innovative health solutions and advance the prevention and treatment of diseases that threaten people and animals around the world.
What we look for
Imagine getting up in the morning for a job as important as helping to save and improve lives around the world. Here, you have that opportunity. You can put your empathy, creativity, digital mastery, or scientific genius to work in collaboration with a diverse group of colleagues who pursue and bring hope to countless people who are battling some of the most challenging diseases of our time. Our team is constantly evolving, so if you are among the intellectually curious, join us—and start making your impact today.
#HYDIT2025
Required Skills:
Amazon Web Services (AWS), Apache Airflow, Best Practices Research, Business Intelligence (BI), Collaborative Development, Computer Science, Data Architecture Development, Database Administration, Databricks Platform, Databricks Unified Data Analytics Platform, Data Engineering, Data Management, Data Modeling, Data Science, Data Visualization, Design Applications, Enterprise Data, GitHub Actions, Information Management, Software Development, Software Development Life Cycle (SDLC), System Designs, Technical LeadershipPreferred Skills:
Current Employees apply HERE
Current Contingent Workers apply HERE
Search Firm Representatives Please Read Carefully
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company. No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails.
Employee Status:
RegularRelocation:
VISA Sponsorship:
Travel Requirements:
Flexible Work Arrangements:
HybridShift:
Valid Driving License:
Hazardous Material(s):
Job Posting End Date:
06/29/2026*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.