There are two techniques I’m using for evaluating whether a large language model (LLM) is maximally compliant—or helpful when answering questions that it’s not supposed to answer—that I haven't seen elsewhere:
Share this post
Is My Large Language Model Maximally…
Share this post
There are two techniques I’m using for evaluating whether a large language model (LLM) is maximally compliant—or helpful when answering questions that it’s not supposed to answer—that I haven't seen elsewhere: