Large language model performance matrix

Learn how different models perform on different tasks in Elastic Security.

This table describes the performance of various large language models (LLMs) for different use cases in Elastic Security, based on our internal testing. To learn more about these use cases, refer to Attack discovery or AI Assistant.

FeatureModel
Claude 3: Opus
Claude 3.5: Sonnet
Claude 3: Haiku
GPT-4o
GPT-4 Turbo
Gemini 1.5 Pro
Gemini 1.5 Flash
Assistant: general
Excellent
Excellent
Excellent
Excellent
Excellent
Excellent
Excellent
Assistant: ES|QL generation
Great
Great
Poor
Excellent
Poor
Good
Poor
Assistant: alert questions
Excellent
Excellent
Excellent
Excellent
Poor
Excellent
Good
Attack discovery
Excellent
Excellent
Poor
Poor
Good
Great
Poor

On this page