Software Data Engineer

New

Skills

Cloud Data Platforms LLMs Python Sql

Join our team as a Software Data Engineer and work on exciting document mining challenges in collaboration with ML and full-stack teams. Scale data pipelines efficiently and apply best practices for cloud-based data platforms. Architect data pipelines using LLMs for high-fidelity entity extraction and implement evaluation frameworks for monitoring accuracy and drift.

Key Responsibilities
  • Collaborate with ML and full-stack teams on document mining challenges
  • Scale data pipelines to move data quickly from research to platform
  • Define best practices for a cloud-based data platform
  • Architect data pipelines using LLMs for high-fidelity entity extraction
  • Implement evaluation frameworks to monitor accuracy, drift, and hallucination
Requirements & Qualifications
  • Degree in Computer Science/Engineering or related field
  • 3+ years experience as a software developer
  • Proficient with Python
  • Proficient with SQL
  • Experience using LLMs for structured data extraction
  • Experience with event-driven architecture with Pub/Sub

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: 12 Months

Share this job:

Similar Jobs

DBA at Wikimedia Foundation

Posted 372 days ago

The Wikimedia Foundation is seeking a Senior DBA. Our objective is to make the sum of all human knowledge available to everyone, and we persist most of this knowledge in MariaDB.

Implementation, maintenance and troubleshooting of relational database systems in production and staging environments.

SQL Database Optimization LAMP Administration Linux
overtime