Saga — AI Travel Extraction
Built an internal observability and evaluation system for an LLM-based travel data extraction pipeline, turning prompt iteration from guesswork into a measurable process.
+42.8% field accuracy on flights after one iteration
100+ real emails in the evaluation dataset
MVP in 4–5 days, deployed within a week
Read case study