User Frustration Template
How to Run
The following is an example of code snippet for implementation:| Eval | GPT-4 | GPT-4o | GPT-4 Turbo | Gemini Pro | GPT-3.5 | GPT-3.5 Turbo Instruct | Palm (Text Bison) | Claude V2 |
|---|---|---|---|---|---|---|---|---|
| Precision | 1 | 1 | 1 | 1 | 0.99 | 0.42 | 1 | 1 |
| Recall | 0.89 | 0.92 | 0.98 | 0.98 | 0.83 | 1 | 0.94 | 0.64 |
| F1 | 0.94 | 0.96 | 0.99 | 0.99 | 0.90 | 0.59 | 0.97 | 0.78 |