I check AI work for a living, including my own.
LLM evaluation, dataset quality assurance, and evaluation-harness review for AI quality-assurance platforms, the working anchor of the practice. Behind it sits a self-funded research program with its methods published in the open.
Enter the research wing