We’re not just building AI—we’re redefining what’s possible with it. As our Senior Data Engineer, you’ll own the full data lifecycle: sourcing, structuring, and optimizing the multimodal datasets that power cutting-edge AI models.
What You’ll Do
- Build & scale robust data pipelines from scratch
- Source and structure diverse datasets (text, video, image, audio) for AI training
- Collaborate with ML teams to curate, fine-tune, and improve model performance
- Automate labeling, cleaning, and structuring processes at scale
- Unlock insights from internal product data and external sources
- Tackle complex video data challenges—segmentation, classification, and more
- Ensure data quality, privacy, and pipeline reliability in everything you touch
What Sets You Apart
- 5+ years of experience in data engineering, data pipelines, or ML data workflows
- Expertise in Python, SQL, and large-scale data tools (Spark, Airflow, dbt, etc.)
- Deep understanding of AI data needs (especially multimodal)
- Passion for automation, data quality, and high-impact infrastructure
- Experience sourcing data from APIs, web scraping, or third-party tools
- Bonus: Video data structuring, LLM training, or generative AI experience
We move fast, think big, and build the impossible. If you’re a builder at heart and data is your playground—let’s talk.