Identifying Data Upload Services

Malicious actors sometimes exfiltrate data to a single endpoint, but more frequently upload data to multiple endpoints. They may do so to obscure the exfiltration and avoid detection, or because legitimate services, such as cloud storage, use multiple endpoints for scalability.

Consequently, determining whether an external data upload is unusual by looking for previous uploads to the same endpoint is not sufficient, because external services may have additional endpoints that have been observed but not associated with the service. Therefore, the service itself needs to be characterized.

We can characterize a service by finding data-upload connections with common properties, such as identical or similar hostnames, identical JA3 client hashes, or identical ASNs. By identifying the dominant transfer endpoints involved in a single upload event, remaining external connections made by the device can be successively restricted based on various properties of those in the dominant group, until a plausible characterization of an external service is made.

We can then search for previous connections and uploads to the same service using these properties rather than specific hostnames or endpoints and associate observed endpoints with services, even if the hostnames are different.

This prevents legitimate uploads to commonly-used services from being identified as possible cases of exfiltration, even if these uploads are not always to the same endpoint, while also enabling precise characterization of malicious exfiltration patterns.

Researcher

Dr. Tim Bazalgette

Chief AI Officer, Darktrace

Research Abstracts

Rapid Process-Chain Anomaly Detection Using a Multistage Classifier

Sorting long lists of file names by relevance and sensitive content

Detect Stealthy Crypto Mining

Autonomous detection of the intended function of a corporate inbox through meta-scoring

Using epidemiology theory to identify the most damaging network devices

Automatic Identification of Scanned IP Ranges

A real-time, self-correcting similarity classifier for emails

Using graph theory to identify critical nodes within computer networks

Analyzing network activity to detect compromised devices sending spam emails

Detecting and preventing misdirected emails with correspondence semantics

Identification of services associated with data uploads for analysis of exfiltration

How the intelligent analysis of connection properties can help assess whether specific uploads are associated with malicious exfiltration.

Malicious actors sometimes exfiltrate data to a single endpoint, but more frequently upload data to multiple endpoints. They may do so to obscure the exfiltration and avoid detection, or because legitimate services, such as cloud storage, use multiple endpoints for scalability.

Backed in Research.