Large language model performance matrix

edit

Large language model performance matrixedit

This table describes the performance of various large language models (LLMs) for different use cases in Elastic Security, based on our internal testing. To learn more about these use cases, refer to Attack discovery or AI Assistant.

Feature Model

Claude 3: Opus

Claude 3: Sonnet

Claude 3: Haiku

GPT-4o

GPT-4 Turbo

GPT-4 32K

Assistant - General

Excellent

Excellent

Excellent

Excellent

Excellent

Excellent

Assistant - ES|QL Generation

Great

Great

Poor

Excellent

Poor

Excellent

Assistant - Alert Questions

Excellent

Excellent

Excellent

Excellent

Poor

Good (limited context)

Attack discovery

Excellent

Great

Poor

Poor

Good

Good (limited context)