What is a Data Engineer at Sayari?
At Sayari, a Data Engineer is at the absolute center of our mission to map the global economy and provide unparalleled risk intelligence. Our platform equips public and private sector organizations with instant visibility into complex, hidden commercial relationships. Because we ingest, clean, and resolve corporate and trade data from over 250 jurisdictions worldwide, our data pipeline is one of the most massive and complex graph-building operations in the industry. As a Data Engineer, your work directly powers the risk resilience and mission-critical investigations of Fortune 500 companies, financial institutions, and global government agencies.
You will join a highly collaborative team where you are responsible for turning raw, unstructured global registry data into clean, structured, and connected entity profiles. This is not a standard data warehousing role; it is a highly specialized pipeline engineering position where you will work with cutting-edge technologies like Apache Spark, Airflow, Elasticsearch, and graph databases like Memgraph. Your primary objective will be to design and build scalable pipelines that can resolve millions of disparate data points into a single, cohesive global graph.
The impact of this role is immediate and profound. The pipelines you build and optimize will process billions of records, directly influencing the accuracy and latency of our risk intelligence products. Whether you are working on complex identity resolution algorithms or optimizing cloud infrastructure to handle massive data volumes, your engineering decisions will directly protect global financial systems from bad actors, illicit trade, and financial crime.