I’m open sourcing my financial LLM evals The initial github repo is set up My vision: 1 • evaluate single LLM 2 • evaluate chain of LLMs 3 • evaluate agentic LLM flows All tasks will be on investment research.
Code:
2,67K