Large language model performance matrix
editLarge language model performance matrixedit
This table describes the performance of various large language models (LLMs) for different use cases in Elastic Security, based on our internal testing. To learn more about these use cases, refer to Attack discovery or AI Assistant.
Feature | Model | |||||
---|---|---|---|---|---|---|
Claude 3: Opus |
Claude 3: Sonnet |
Claude 3: Haiku |
GPT-4o |
GPT-4 Turbo |
GPT-4 32K |
|
Assistant - General |
Excellent |
Excellent |
Excellent |
Excellent |
Excellent |
Excellent |
Assistant - ES|QL Generation |
Great |
Great |
Poor |
Excellent |
Poor |
Excellent |
Assistant - Alert Questions |
Excellent |
Excellent |
Excellent |
Excellent |
Poor |
Good (limited context) |
Attack discovery |
Excellent |
Great |
Poor |
Poor |
Good |
Good (limited context) |