Testing the limits of AI — so real users don't have to. Specializing in GPT-style LLM evaluation, hallucination detection, and AI safety validation.
01 // about_me
I'm an AI-focused Quality Assurance Engineer with over 2 years of experience in manual and automation testing, with deep specialization in GPT-style LLM evaluation and AI safety validation.
My work sits at the intersection of AI quality and safety — I help ensure that conversational AI systems behave accurately, ethically, and reliably before they reach real users.
From hallucination detection to adversarial prompt design, I test the edges of AI systems so they're ready for the real world. I thrive in Agile environments and collaborate closely with cross-functional teams.
02 // skills
03 // work_experience
04 // education
05 // projects
A web-based AI tool that generates structured QA test cases (Positive, Negative, Edge Case) from a plain-English feature description — built end-to-end and deployed live. Demonstrates prompt engineering, LLM output validation, and QA-focused product thinking.
An AI-powered diagnostic tool that analyzes why a Tavus Phoenix-4 training video gets rejected — before the customer even submits it. Validates 7 official Phoenix-4 requirements and uses Gemini AI to write a plain-English fix plan with approval probability estimate.
Quality assurance for an AI-powered video platform that turns simple inputs into fully personalized, AI-generated videos.
Comprehensive validation of digital human and conversational AI interaction systems for production reliability.
06 // achievements
Improved AI response validation efficiency by implementing a structured prompt testing strategy across teams.
Strengthened AI safety compliance through systematic harmful-content detection workflows.
Recognized for analytical debugging skills and proactive communication during AI product releases.
Consistently ensured high model response accuracy and safety before every production deployment.
07 // contact
I'm open to new opportunities, collaborations, and conversations about AI quality and safety. Feel free to reach out any time.