AWS Data Engineer · Mumbai
2+ years crafting high-performance data pipelines, data mesh architectures, and cloud-native ETL solutions at Capgemini — with a focus on AWS, PySpark, and Infrastructure as Code.
About
I'm Sarthak, an AWS Data Engineer based in Mumbai with a B.E. in Computer Science (9.46 CGPA) from St. Francis Institute of Technology.
At Capgemini, I've designed and deployed scalable data mesh infrastructure, migrated legacy ETL pipelines to modern AWS architectures, and helped teams ship data products reliably — with a focus on performance, cost, and governance.
I thrive at the intersection of cloud engineering, data architecture, and automation — always looking for ways to cut complexity and ship faster.
Skills
Experience
Projects
Natural Language Processing project exploring text analysis and machine learning techniques.
Converts natural language or speech input into SQL queries — bridging human language and databases.
A Python-based memory card game — a fun personal project exploring game logic and UI.
Step-by-step implementation of a ChatGPT-like LLM in PyTorch — exploring the internals of large language models.
Certifications
Microsoft Azure Fundamentals
Google Cloud Digital Leader
Agile Software Development
Service Delivery Award — 2025
Contact
I'm open to Data Engineering roles, cloud architecture projects, and interesting collaborations. Feel free to reach out — I typically respond within a day.
Send an email →