1. What is a Data Engineer at Persistent?
A Data Engineer at Persistent plays a critical role in driving digital transformation, cloud modernization, and data-driven decision-making for a global clientele. Persistent is known for its deep expertise in software product engineering and cloud technologies, making the data engineering function a cornerstone of its enterprise solutions. In this role, you will design, build, and optimize highly scalable data pipelines that ingest, process, and store massive volumes of structured and unstructured data.
The impact of a Data Engineer at Persistent extends across multiple high-value domains, including healthcare, life sciences, and financial services. You will be responsible for migrating legacy data warehouses to modern lakehouse architectures, such as Microsoft Fabric and Azure Databricks, and ensuring that data is readily available for advanced analytics and machine learning. Furthermore, you will build and support data enablement pipelines that prepare training-ready datasets for Generative AI, Large Language Models (LLMs), and Defensive AI initiatives.
What makes this role exceptionally challenging and rewarding is the focus on data governance, compliance, and lifecycle management. At Persistent, you will not only move data but also execute sophisticated governance policies, such as automated archival and deletion procedures using tools like BigID. By ensuring referential integrity, security, and performance across multi-cloud environments, you will directly enable clients to unleash the full potential of their data while maintaining strict regulatory compliance.

