Introducing EvalsOne - A New Way to Evaluate LLM prompts

February 13, 2024 · 2 min read

Yue Zhang

founder of EvalsOne

We are thrilled to announce that EvalsOne has officially entered the pre-launch stage after months of hard work and dedication, and we're excited to unveil our creation to the world.

Preview image

For over two years, our team has been developing an AI-powered mental health chatbot. As early adopters of OpenAI's API, we integrated prompt engineering and fine-tuned models to provide users with a unique and valuable experience. However, the inherent unpredictability of large language models (LLMs) can sometimes lead to inconsistencies in the user experience – a significant concern when dealing with the sensitive topic of mental health.

Initially, we relied on manual prompt testing in playgrounds, but quickly realized the need for a more efficient approach to improve prompt quality. This led us to delve into the world of prompt evaluation.

While existing prompt evaluation tools and products were available, they often lacked the comprehensiveness or ease of use we required. Determined to optimize our workflow, we decided to develop our own solution.

Our ideal product should be:

Easy to use: It should not require a high technical threshold, and all roles in the team should be able to use it easily.
Open and flexible: It should be able to evaluate various models and flexibly set evaluation metrics.
Systematic and comprehensive: It should cover the whole process from sample preparation, model selection, metric setting, to result feedback.

From this vision, EvalsOne was born. Now an integral part of our internal workflows, EvalsOne has dramatically improved our efficiency while significantly boosting team satisfaction. By automating tedious tasks, we've gained the freedom to focus on innovation and creativity.

We're now inviting a limited number of seed users (over 200) to join our private beta testing and help us shape the future of LLM prompt evaluation. As a thank you, you'll receive:

$50 in initial credit
3 months of Standard Plan for free

Join the EvalsOne community and experience the power of our advanced prompt engineering platform firsthand. Sign up to join the waitlist of private beta testing at https://evalsone.com and start building better AI apps!