We are looking for a Staff Software Engineer to design and build the systems that power how our organization collects, processes, and manages large multimodal datasets.

These systems ingest multimodal data from highly diverse sources, including web crawling, APIs, external providers, large document corpora, and robotics or sensor streams and transform it into structured datasets used by our ML research and evaluation teams.

As a senior member of the team, you will help define the architecture of this platform as it evolves from prototype systems into production data infrastructure. The platform directly feeds into model capability development.

You will:

Our Stack

You'll be a great fit if...