1297 search results found
🚀 Projects
19 1

NKalavros/LLMBenchmark

Benchmarking LLM agents on DREAM challenges using DSPy + DeepEval, ideally in a headless way | Language: R | License: GNU General Public License v3.0

r