Research

DEMIST-2: Darktrace Embedding Model for Investigation of Security Threats

DEMIST-2 is Darktrace’s latest language model tailored for the cybersecurity domain. With just 95 million parameters, it delivers powerful text analysis for threat detection and classification, while remaining lightweight enough for local deployment. Trained on both natural language and security-specific data, DEMIST-2 excels in embedded environments using custom LoRA adapters to specialize in diverse tasks with minimal memory and compute overhead. This report details its architecture, training process, and performance evaluations. It also highlights DEMIST-2’s improvements over earlier models and its role in enhancing the accuracy and speed of cybersecurity operations within Darktrace’s product ecosystem.

Download this research paper

In this report, we detail our latest research in developing language models specialized for the cyber security domain. We present our latest model, DEMIST-2, a sub-100 million parameter model that is capable of analyzing a vast range of cyber security-related text. We describe the training process including training data, model architecture and optimization choices before evaluating DEMIST-2 against comparable models. By using DEMIST-2 in combination with a custom LoRA swapping architecture, we can specialize DEMIST-2 into a range of tasks, whilst minimizing computational overhead and memory usage. We overview several unique tasks and evaluate our performance.

AI Research Centre

Backed in Research.

In existence since Darktrace’s inception in 2013, the Darktrace AI Research Centre is foundational to our continued innovation. Rather than a defined product roadmap, the Centre looks at how AI can be applied to real-world challenges, to find solutions that cannot be achieved by humans alone.