Skip to content
TopicTracker
From HackerNewsView original
TranslationTranslation

Test yourself against local open-source LLMs benchmark questions

A Streamlit app lets users test their own knowledge against benchmark questions used to evaluate local open-source LLMs, providing a direct comparison between human and model performance.

Related stories