New
Design a structured, configurable evaluation engine combining deterministic checks with LLM-as-judge verdicts. Build calibration workflows using expert-labeled examples, measure precision and recall accurately, handle delayed outcomes and low-confidence review flows, and store structured verdicts to power dashboards and analytics.
New
Lead AI automation across internal operations
Prototyping MVPs using LLM APIs and agents
Posted 5 days ago
Define and lead ISV ecosystem strategy
Expand AWS co-sell partnerships with key players
Posted 10 days ago
Set architectural direction for agentic systems
Identify and fix reliability and scalability gaps
Posted 29 days ago
Automate risk workflows using Go and AI tools.
Prototype and turn experiments into production.
Posted 304 days ago
Rapid prototyping and deployment of AI features
Building scalable applications using modern cloud stacks