Ever thought about identifying if a text was written by an LLM or even by which model?
Finally a new standard in the evaluation of factual question-answering capabilities of LLMs.
We all know: the more data, the better the training. But what if we lack enough data to actually be able to train our own model? The answer: synthetic datasets.