In this report, we detail our latest research in developing language models specialized for the cyber security domain. We present our latest model, DEMIST-2, a sub-100 million parameter model that is capable of analyzing a vast range of cyber security-related text. We describe the training process including training data, model architecture and optimization choices before evaluating DEMIST-2 against comparable models. By using DEMIST-2 in combination with a custom LoRA swapping architecture, we can specialize DEMIST-2 into a range of tasks, whilst minimizing computational overhead and memory usage. We overview several unique tasks and evaluate our performance.
In existence since Darktrace’s inception in 2013, the Darktrace AI Research Centre is foundational to our continued innovation. Rather than a defined product roadmap, the Centre looks at how AI can be applied to real-world challenges, to find solutions that cannot be achieved by humans alone.