Quick Links
Overview
This is the documentation of ECS version 1.11.0-7.
What is ECS?
The Elastic Common Schema (ECS) is an open source specification, developed with support from the Elastic user community. ECS defines a common set of fields to be used when storing event data in Elasticsearch, such as logs and metrics.
ECS specifies field names and Elasticsearch datatypes for each field, and provides descriptions and example usage. ECS also groups fields into ECS levels, which are used to signal how much a field is expected to be present. You can learn more about ECS levels in [ecs-guidelines]. Finally, ECS also provides a set of naming guidelines for adding custom fields.
The goal of ECS is to enable and encourage users of Elasticsearch to normalize their event data, so that they can better analyze, visualize, and correlate the data represented in their events. ECS has been scoped to accommodate a wide variety of events, spanning:
-
Event sources: whether the source of your event is an Elastic product, a third- party product, or a custom application built by your organization.
-
Ingestion architectures: whether the ingestion path for your events includes Beats processors, Logstash, Elasticsearch ingest node, all of the above, or none of the above.
-
Consumers: whether consumed by API, Kibana queries, dashboards, apps, or other means.
New to ECS?
If you’re new to ECS and looking for an introduction to its benefits and examples of the core concepts, [ecs-getting-started] is a great place to start.
My events don’t map with ECS
ECS is a permissive schema. If your events have additional data that cannot be mapped to ECS, you can simply add them to your events, using custom field names.
Maturity
ECS improvements are released following Semantic Versioning. Major ECS releases are planned to be aligned with major Elastic Stack releases.
Any feedback on the general structure, missing fields, or existing fields is appreciated. For contributions please read the Contribution Guidelines.
Using ECS
ECS fields follow a series of guidelines, to ensure a consistent and predictable feel, across various use cases.
If you’re new to ECS and would like an introduction on implementing and using the schema, check out the [ecs-getting-started] guide.
Whether you’re trying to recall a field name, implementing a solution that follows ECS, or proposing a change to the schema, the [ecs-guidelines] and [ecs-conventions] will help get you there.
If you’re wondering how to best capture event details that don’t map to existing ECS fields, head over to [ecs-custom-fields-in-ecs].
Mapping network events provides a detailed walk-through of how to best map and categorize an example network event to the schema.
Getting Started
ECS enables and encourages users to normalize event data in order to better analyze, visualize, and correlate their events. Collected events can be normalized at ingest time, consistently searched across indices, and visualized predictably.
Note that when adopting an Elastic solution, such as Observability or Security, all events will map to ECS out of the box. Elastic provides an extensive set of integrations to simplify ingesting your data sources.
If you rely on custom data pipelines and/or building content around specific needs, ECS can still help to alleviate the challenges of searching, analyzing, and visualizing across your data. Let’s see how using a common schema can simplify the search experience, and then take a look at how an event’s contents can be mapped to ECS field sets.
Simplified Search
With ECS defining a normalized schema across all of your data sources, querying against those sources is simplified. Consider searching for a particular source IP address prior to adopting ECS. All the various data sources and their field mappings would need to be considered in your query:
src:10.42.42.42 OR client_ip:10.42.42.42 OR apache.access.remote_ip:10.42.42.42 OR context.user.ip:10.42.42.42 OR src_ip:10.42.42.42
With all sources mapped to ECS, the query becomes much simpler:
source.ip:10.42.42.42
Not only does this simplify writing queries, but saved queries shared with other users become much more obvious. To gain familiarity with ECS fields, you can also take a look at the [ecs-field-reference] section.
Unified Visualizations
With normalized data from different data sources, building insightful visualizations across sources is simple. From a single, centralized dashboard, events from web servers, IDS/IPS devices, and firewalls can be aggregated and visualized, and enhanced with drill-downs, and pivoting for delving into deeper investigations. Centralized monitoring of diverse data sources is straightforward with normalized ECS data.
Translating Data Sources
To align events to ECS, some sort of parsing will usually be necessary to transform the contents of the original event into the relevant ECS fields. Depending on how you’ve designed your Elastic Stack data ingestion pipelines, the amount of work to parse your events will vary.
For example, an Apache web server log event:
10.42.42.42 - - [15/Jul/2020:20:48:32 +0000] "GET /content HTTP/1.1" 200 2571 "-" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.106 Safari/537.36"
In order to map this event to ECS, the contents of the event is associated with the appropriate ECS fields.
Field Name | Value |
---|---|
@timestamp |
|
event.original |
10.42.42.42 - - [15/Jul/2020:20:48:32 +0000] "GET /content HTTP/1.1" 200 2571 "-" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.106 Safari/537.36 |
http.request.method |
GET |
http.response.body.bytes |
2571 |
http.response.status_code |
200 |
http.version |
1.1 |
message |
GET /content HTTP/1.1" 200 2571 "-" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.106 Safari/537.36 |
source.address |
10.42.42.42 |
source.ip |
10.42.42.42 |
url.original |
|
user_agent.original |
|
Beyond extracting values that are present in the original event, we also populate other fields to provide additional context about the event itself.
-
ecs.version
: States which version of ECS the ingest pipeline was developed against. -
event.dataset
andevent.module
: Answers "where is this event from" and are expected to have a hardcoded value per pipeline, per source. -
event.kind
,event.category
,event.type
, andevent.outcome
: The [ecs-category-field-values-reference] should also be hardcoded using knowledge of each type of event the source emits. The contents of these fields are limited to the specifically allowed values detailed in the ECS documentation.
Field Name | Value |
---|---|
ecs.version |
1.5.0 |
event.module |
apache |
event.dataset |
apache.access |
event.kind |
event |
event.category |
[ "network", "web" ] |
event.type |
[ "access" ] |
event.outcome |
success |
Lastly, existing field values can be interpreted or enriched using a processor, with the results populating additional fields in the final event.
-
The
user_agent
processor extracts details from the original user agent string,user_agent.original
. -
IP fields like
source.ip
can provide enrichment using thegeoip
processor to add information about the location and autonomous system number (ASN) associated with an IP address. -
The
registered domain
processor reads a field containing a hostname and writes the registered domain to another field -
each event with metadata from the machine’s hosting provider (cloud) and/or from the host machine (host).
Here are some examples of additional fields processed by metadata or parser processors.
Field Name | Value | Processor |
---|---|---|
host.architecture |
x86_64 |
|
host.hostname |
mbp.example.com |
|
host.ip |
[ "192.168.1.100" ] |
|
host.os.family |
darwin |
|
host.os.kernel |
19.4.0 |
|
host.os.name |
Mac OS X |
|
host.os.version |
10.15.4 |
|
user_agent.name |
Chrome |
|
user_agent.os.full |
Mac OS X 10.15.4 |
|
user_agent.os.name |
Mac OS X |
|
user_agent.os.version |
10.15.4 |
|
user_agent.version |
83.0.4103.106 |
|
Field Mapping Reference Guides
We’ve covered at a high level how to map your events to ECS. Now if you’d like your events to render well in the Elastic solutions, check out the reference guides below to learn more about each:
Guidelines and Best Practices
The ECS schema serves best when you follow schema guidelines and best practices.
ECS Field Levels
ECS defines "Core" and "Extended" fields.
-
Core fields. Fields that are most common across all use cases are defined as core fields.
These generalized fields are used by analysis content (searches, visualizations, dashboards, alerts, machine learning jobs, reports) across use cases. Analysis content designed to operate on these fields should work properly on data from any relevant source.
Focus on populating these fields first.
-
Extended fields. Any field that is not a core field is defined as an extended field. Extended fields may apply to more narrow use cases, or may be more open to interpretation depending on the use case. Extended fields are more likely to change over time.
Each ECS field in a table is identified as core or extended.
General guidelines
-
The document MUST have the
@timestamp
field. -
defined for an ECS field.
-
Use the
ecs.version
field to define which version of ECS is used. -
Map as many fields as possible to ECS.
Guidelines for field names
-
Field names must be lower case
-
Combine words using underscore
-
No special characters except underscore
-
Use present tense unless field describes historical information.
-
Use singular and plural names properly to reflect the field content.
-
For example, use
requests_per_sec
rather thanrequest_per_sec
.
-
-
Use prefixes for all fields, except for the base fields.
-
For example, all
host
fields are prefixed withhost.
. Such a grouping is called a field set.
-
-
Nest fields inside a field set with dots
-
The document structure should be nested JSON objects. If you use Beats or Logstash, the nesting of JSON objects is done for you automatically. If you’re ingesting to Elasticsearch using the API, your fields must be nested objects, not strings containing dots.
-
See [dot-notation] for more details.
-
-
General to specific. Organise the nesting of field sets from general to specific, to allow grouping fields into objects with a prefix like
host.*
. -
Avoid repetition or stuttering of words
-
If part of the field name is already in the name of the field set, avoid repeating it. Example:
host.host_ip
should behost.ip
. -
Exceptions can be made, when changing the name of the field would break a strong convention in the community. Example:
host.hostname
is an exception to this rule.
-
-
Avoid abbreviations when possible
-
Exceptions can be made, when the name used for the concept is too strongly in favor of the abbreviation. Example:
ip
fields, or field sets such asos
,geo
.
-
Conventions
The implementation of ECS follows a few conventions. Understanding them will help you understand ECS better.
Datatype for integers
Unless otherwise noted, the datatype used for integer fields should be long
.
IDs and most codes are keywords, not integers
Despite the fact that IDs and codes (such as error codes) are often integers,
this is not always the case.
Since we want to make it possible to map as many systems and data sources
to ECS as possible, we default to using the keyword
type for IDs and codes.
Some specific kinds of codes are always integers, like HTTP status codes.
If those have a specific corresponding specific field (as HTTP status does),
its type can safely be an integer type.
But generic fields like error.code
cannot have this guarantee, and are therefore keyword
.
Text indexing and multi-fields
Elasticsearch can index text using datatypes:
-
text
Text indexing allows for full text search, or searching arbitrary words that are part of the field. -
keyword
Keyword indexing offers faster exact match filtering, prefix search (like autocomplete),
Default Elasticsearch convention for indexing text fields
Unless your index mapping or index template specifies otherwise
(as the ECS index template does),
Elasticsearch indexes a text field as text
at the canonical field name,
and indexes a second time as keyword
, nested in a multi-field.
Default Elasticsearch convention:
-
Canonical field:
myfield
istext
-
Multi-field:
myfield.keyword
iskeyword
ECS convention for indexing text fields
ECS flips the above convention around.
For monitoring use cases, keyword
indexing is needed almost exclusively, with
full text search needed on very few fields.
Moreover, indexing for full text search on lots of fields, where it’s not expected
to be used is wasteful of resources.
Given these two premises, ECS defaults
all text indexing to keyword
datatype (with very few exceptions).
Any use case that requires full text search indexing on additional fields
for full text search. Doing so does not conflict with ECS,
as the canonical field name will remain keyword
indexed.
So the ECS multi-field convention for text is:
-
Canonical field:
myfield
iskeyword
-
Multi-field:
myfield.text
istext
Exceptions
The only exceptions to this convention are fields message
and error.message
,
which are indexed for full text search only, with no multi-field.
These two fields don’t follow the new convention because they are deemed too big
of a breaking change with these two widely used fields in Beats.
Any future field that will be indexed for full text search in ECS will however
follow the multi-field convention where text
indexing is nested in the multi-field.
Custom Fields
ECS defines fields, their datatypes and their usage, and classifies them in "core" and "extended" levels.
However, ECS does not define anything about custom fields. By definition, they are additional fields, exactly as the user or the integration defines them, independently of ECS.
Users and integrations are welcome to capture additional information in their events, as custom fields. This flexibility is by design, and ensures that no one is ever blocked by something not being supported by ECS yet.
ECS is under active development, however. Adding custom fields carries a small risk of conflicting with a future ECS field. There are ways of modeling custom fields that will lead to lower chances of conflict with future versions of ECS. This section outlines a few of these strategies.
Modeling to Reduce Chances of Conflict
The labels
Field
Any time a data source has a few extra fields that can be modeled with the
keyword
data type, the simplest way to capture them is with the ECS field labels
.
Example:
{ "labels": { "foo_id": "beef42", "env": "production" }, "message": "...", "event": { ... } }
If labels
doesn’t work for your use case, here’s a few more tips to avoid conflicts.
Proper Names
ECS tries to model information by using the name of concepts, and avoids proper names such as tool names, project names or company names. By extension, nesting custom fields under a proper name is a relatively safe approach to adding custom fields. This is the approach taken by Filebeats modules, for example.
As an example, an HTTP log from HAProxy will contain typical HTTP information,
as well as proxy details and statistics. The standard HTTP information can be
captured in the ECS field sets http
and url
, and the extra details in a custom
haproxy
section:
{ "http": { "request": { "method": "get", ... }, "response": { "status_code": 200, ... } }, "url": { "original": "/favicon.ico", ... }, "haproxy": { "frontend_name": "myfrontend", "backend_name": "mybackend_prod", "backend_queue": 0, ... } }
Capitalization
ECS strives for a consistent feel by using nesting to group related concepts, and underscores to join words. Following these guidelines for custom fields ensures you preserve the same consistent experience throughout your schemas.
Note however, that breaking away from these guidelines for your custom fields can be used to your advantage. It can be a good way to differentiate between ECS fields and custom fields. Since ECS doesn’t use capitalization for field names, this approach virtually guarantees that custom fields will not conflict with future ECS fields.
Common proxy concepts could modelled via a capitalized, but generic concept name.
HAProxy example:
{ "http": { "request": { "method": "get", ... } }, "url": { "original": "/favicon.ico", ... }, "Proxy": { "FrontendName": "myfrontend", "BackendName": "mybackend_prod" }, "event": { "module": "haproxy" } }
NGINX example:
{ "http": { "request": { "method": "get", ... } }, "url": { "original": "/favicon.ico", ... }, "Proxy": { "FrontendName": "another_frontend", "BackendName": "another_backend_prod" }, "event": { "module": "nginx" } }
The above demonstrates that using a common concept name in custom fields
can still be beneficial to correlate among multiple sources that populate them.
Using capitalization ensures a future version of ECS that defines a proxy
field set will not conflict.
Here’s a sample event, during a migration from the custom field, to using a new equivalent ECS field set:
{ "http": { "request": { "method": "get", ... } }, "Proxy": { "FrontendName": "myfrontend", "BackendName": "mybackend_prod" }, "proxy": { "frontend_name": "myfrontend", "backend_name": "mybackend_prod" } }
The above will look strange during the migration. However the ability to start populating ECS fields while custom fields are still present in your events makes it possible to decouple the upgrade to a new version of ECS from the time you adjust your pipelines and analysis content.
Mapping Network Events
Network events capture the details of one device communicating with another. The initiator is referred to as the source, and the recipient as the destination. Depending on the data source, a network event can contain details of addresses, protocols, headers, and device roles.
This guide describes the different field sets available for network-related events in ECS and provides direction on the ECS best practices for mapping to them.
Source and destination baseline
When an event contains details about the sending and receiving hosts, the baseline for capturing these values will be the source and destination fields.
Some events may also indicate each host’s role in the exchange: client or server. When this information is available, the client and server fields should be used in addition to the source
and destination
fields. The fields and values mapped under source
/destination
should be copied under client
/server
.
Network event mapping example
Below is a DNS network event. The source device (192.168.86.222
) makes a DNS query, acting as the client and the DNS server is the destination (192.168.86.1
).
Note this event contains additional details that would populate additional fields (such as the [ecs-dns]) if this was a complete mapping example. These additional fields are omitted here to focus on the network details.
{ "ts":1599775747.53056, "uid":"CYqFPH3nOAa0kPxA0d", "id.orig_h":"192.168.86.222", "id.orig_p":54162, "id.resp_h":"192.168.86.1", "id.resp_p":53, "proto":"udp", "trans_id":28899, "rtt":0.02272200584411621, "query":"example.com", "qclass":1, "qclass_name":"C_INTERNET", "qtype":1, "qtype_name":"A", "rcode":0, "rcode_name":"NOERROR", "AA":false, "TC":false, "RD":true, "RA":true, "Z":0, "answers":["93.184.216.34"], "TTLs":[21209.0], "rejected":false }
Source and destination fields
First, the source.*
and destination.*
field sets are populated:
"source": { "ip": "192.168.86.222", "port": 54162 }
"destination": { "ip": "192.168.86.1", "port": 53 }
Client and server fields
Looking back at the original event, it shows the source device is the DNS client and the destination device is the DNS server. The values mapped under source
and destination
are copied and mapped under client
and server
, respectively:
"client": { "ip": "192.168.86.222", "port": 54162 }
"server": { "ip": "192.168.86.1", "port": 53 }
Mapping both pairs of field sets gives query visibility of the same network transaction in two ways.
-
source.ip:192.168.86.222
returns all events sourced from192.168.86.222
, regardless its role in a transaction -
client.ip:192.168.86.222
returns all events with host192.168.86.222
acting as a client
The same applies for the destination
and server
fields:
-
destination.ip:192.168.86.1
returns all events destined to192.168.86.1
-
server.ip:192.168.86.1
returns all events with192.168.86.1
acting as the server
It’s important to note that while the values for the source
and destination
fields may reverse between events in a single network transaction, the values for client
and server
typically will not. The following two tables demonstrate how two DNS transactions involving two clients and one server would map to source.ip
/destination.ip
vs. client.ip
/server.ip
:
source.ip | destination.ip | event |
---|---|---|
192.168.86.222 |
192.168.86.1 |
DNS query request 1 |
192.168.86.1 |
192.168.86.222 |
DNS answer response 1 |
192.168.86.42 |
192.168.86.1 |
DNS answer request 2 |
192.168.86.1 |
192.168.86.42 |
DNS answer request 2 |
client.ip | server.ip | event |
---|---|---|
192.168.86.222 |
192.168.86.1 |
DNS query request 1 |
192.168.86.222 |
192.168.86.1 |
DNS answer response 1 |
192.168.86.42 |
192.168.86.1 |
DNS query request 2 |
192.168.86.42 |
192.168.86.1 |
DNS answer response 2 |
Related fields
The related.ip
field captures all the IPs present in the event in a single array:
"related": { "ip": [ "192.168.86.222", "192.168.86.1", "93.184.216.34" ] }
The related fields are meant to facilitate pivoting. Since these IP addresses can appear in many different fields (source.ip
, destination.ip
, client.ip
, server.ip
, etc.), you can search for the IP trivially no matter what field it appears using a single query, e.g. related.ip:192.168.86.222
.
Network events are not only limited to using related.ip
. If hostnames or other host identifiers were present in the event, related.hosts
should be populated too.
Categorization using event fields
When considering the event categorization fields, the category
and type
fields are populated using their respective allowed values which best classify the source network event.
"event": { "category": [ "network" ], "type": [ "connection", "protocol" ], "kind": "event" }
Most event.category/event.type ECS pairings are complete on their own. However, the pairing of event.category:network
and event.type:protocol
is an exception. When these two fields/value pairs both used to categorize an event, the network.protocol
field should also be populated:
"network": { "protocol": "dns", "type": "ipv4", "transport": "udp" }
Result
Putting everything together covered so far, we have a final ECS-mapped event:
{ "event": { "category": [ "network" ], "type": [ "connection", "protocol" ], "kind": "event" }, "network": { "protocol": "dns", "type": "ipv4", "transport": "udp" }, "source": { "ip": "192.168.86.222", "port": 54162 }, "destination": { "ip": "192.168.86.1", "port": 53 }, "client": { "ip": "192.168.86.222", "port": 64734 }, "server": { "ip": "192.168.86.1", "port": 53 }, "related": { "ip": [ "192.168.86.222", "192.168.86.1", "93.184.216.34" ] }, "dns": { ... }, <= Again, not diving into the DNS fields here but included for completeness. "zeek": { "ts":1599775747.53056, ... } <= Original fields can optionally be kept around as custom fields. }
ECS Field Reference
This is the documentation of ECS version 1.11.0-7.
ECS defines multiple groups of related fields. They are called "field sets". The Base field set is the only one whose fields are defined at the root of the event.
all fields are defined.
For a single page representation of all fields, please see the /generated/csv/fields.csv[generated CSV of fields].
Field Sets
Field Set | Description |
---|---|
All fields defined directly at the root of the events. |
|
Fields describing an Autonomous System (Internet routing prefix). |
|
Fields about the client side of a network connection, used with server. |
|
Fields describing the container that generated this event. |
|
Fields about the destination side of a network connection, used with source. |
|
Meta-information specific to ECS. |
|
Fields about errors of any kind. |
|
Fields breaking down the event details. |
|
Fields describing a location. |
|
Fields related to git repositories |
|
Hashes, usually file hashes. |
|
Fields describing the relevant computing instance. |
|
Fields describing an HTTP request. |
|
Fields to describe observer interface information. |
|
Details about the event’s logging mechanism. |
|
Fields for metrics generated from logs. |
|
Fields describing the communication path over which the event happened. |
|
Fields generated and normalized by the logging pipeline. |
|
Fields describing an entity observing the event from outside the host. |
|
Fields relevant to container orchestrators. |
|
OS fields contain information about the operating system. |
|
These fields contain information about an installed software package. |
|
These fields contain information about a process. |
|
Fields to capture details about rules used to generate alerts or other notable events. |
|
Fields about the server side of a network connection, used with client. |
|
Fields describing the service for or from which the data was collected. |
|
Fields about the source side of a network connection, used with destination. |
|
Fields describing a TLS connection. |
|
Fields related to distributed tracing. |
|
Fields that let you store URLs in various forms. |
|
Fields to describe the user relevant to the event. |
|
Fields to describe a browser user_agent string. |
Base Fields
The base
field set contains all fields which are at the root of the events. These fields are common across all types of events.
Field | Description | Level |
---|---|---|
Date/time when the event originated. This is the date/time extracted from the event, typically representing when the event was generated by the source. If the event source has no original timestamp, this value is typically populated by the first time the event was received by the pipeline. Required field for all events. type: date example: |
core |
|
Custom key/value pairs. Can be used to add meta information to events. Should not contain nested objects. All values are stored as keyword. Example: type: object example: |
core |
|
For log events the message field contains the log message, optimized for viewing in a log viewer. For structured logs without an original message field, other fields can be concatenated to form a human-readable summary of the event. If multiple messages exist, they can be combined into one message. type: text example: |
core |
|
List of keywords used to tag each event. type: keyword Note: this field should contain an array of values. example: |
core |
Autonomous System Fields
An autonomous system (AS) is a collection of connected Internet Protocol (IP) routing prefixes under the control of one or more network operators on behalf of a single administrative entity or domain that presents a common, clearly defined routing policy to the internet.
Field | Description | Level |
---|---|---|
Unique number allocated to the autonomous system. The autonomous system number (ASN) uniquely identifies each network on the Internet. type: long example: |
extended |
|
Organization name. type: keyword Multi-fields: * as.organization.name.text (type: text) example: |
extended |
The as
fields are expected to be nested at:
-
client.as
-
destination.as
-
server.as
-
source.as
-
threat.enrichments.indicator.as
-
threat.indicator.as
Note also that the as
fields are not expected to be used directly at the root of the events.
Client Fields
A client is defined as the initiator of a network connection for events regarding sessions, connections, or bidirectional flow records.
For TCP events, the client is the initiator of the TCP connection that sends the SYN packet(s). For other protocols, the client is generally the initiator or requestor in the network transaction. Some systems use the term "originator" to refer the client in TCP connections. The client fields describe details about the system acting as the client in the network event. Client fields are usually populated in conjunction with server fields. Client fields are generally not populated for packet-level events.
Client / server representations can add semantic context to an exchange, which is helpful to visualize the data in certain situations. If your context falls in that category, you should still ensure that source and destination are filled appropriately.
Field | Description | Level |
---|---|---|
Some event client addresses are defined ambiguously. The event will sometimes list an IP, a domain or a unix socket. You should always store the raw address in the Then it should be duplicated to type: keyword |
extended |
|
Bytes sent from the client to the server. type: long example: |
core |
|
Client domain. type: keyword |
core |
|
IP address of the client (IPv4 or IPv6). type: ip |
core |
|
MAC address of the client. The notation format from RFC 7042 is suggested: Each octet (that is, 8-bit byte) is represented by two [uppercase] hexadecimal digits giving the value of the octet as an unsigned integer. Successive octets are separated by a hyphen. type: keyword example: |
core |
|
Translated IP of source based NAT sessions (e.g. internal client to internet). Typically connections traversing load balancers, firewalls, or routers. type: ip |
extended |
|
Translated port of source based NAT sessions (e.g. internal client to internet). Typically connections traversing load balancers, firewalls, or routers. type: long |
extended |
|
Packets sent from the client to the server. type: long example: |
core |
|
Port of the client. type: long |
core |
|
The highest registered client domain, stripped of the subdomain. For example, the registered domain for "foo.example.com" is "example.com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last two labels will not work well for TLDs such as "co.uk". type: keyword example: |
extended |
|
The subdomain portion of a fully qualified domain name includes all of the names except the host name under the registered_domain. In a partially qualified domain, or if the the qualification level of the full name cannot be determined, subdomain contains all of the names below the registered domain. For example the subdomain portion of "www.east.mydomain.co.uk" is "east". If the domain has multiple levels of subdomain, such as "sub2.sub1.example.com", the subdomain field should contain "sub2.sub1", with no trailing period. type: keyword example: |
extended |
|
The effective top level domain (eTLD), also known as the domain suffix, is the last part of the domain name. For example, the top level domain for example.com is "com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last label will not work well for effective TLDs such as "co.uk". type: keyword example: |
extended |
Container Fields
Container fields are used for meta information about the specific container that is the source of information.
These fields help correlate data based containers from any runtime.
Field | Description | Level |
---|---|---|
Unique container id. type: keyword |
core |
|
Name of the image the container was built on. type: keyword |
extended |
|
Container image tags. type: keyword Note: this field should contain an array of values. |
extended |
|
Image labels. type: object |
extended |
|
Container name. type: keyword |
extended |
|
Runtime managing this container. type: keyword example: |
extended |
Destination Fields
Destination fields capture details about the receiver of a network exchange/packet. These fields are populated from a network event, packet, or other event containing details of a network transaction.
Destination fields are usually populated in conjunction with source fields. The source and destination fields are considered the baseline and should always be filled if an event contains source and destination details from a network transaction. If the event also contains identification of the client and server roles, then the client and server fields should also be populated.
Field | Description | Level |
---|---|---|
Some event destination addresses are defined ambiguously. The event will sometimes list an IP, a domain or a unix socket. You should always store the raw address in the Then it should be duplicated to type: keyword |
extended |
|
Bytes sent from the destination to the source. type: long example: |
core |
|
Destination domain. type: keyword |
core |
|
IP address of the destination (IPv4 or IPv6). type: ip |
core |
|
MAC address of the destination. The notation format from RFC 7042 is suggested: Each octet (that is, 8-bit byte) is represented by two [uppercase] hexadecimal digits giving the value of the octet as an unsigned integer. Successive octets are separated by a hyphen. type: keyword example: |
core |
|
Translated ip of destination based NAT sessions (e.g. internet to private DMZ) Typically used with load balancers, firewalls, or routers. type: ip |
extended |
|
Port the source session is translated to by NAT Device. Typically used with load balancers, firewalls, or routers. type: long |
extended |
|
Packets sent from the destination to the source. type: long example: |
core |
|
Port of the destination. type: long |
core |
|
The highest registered destination domain, stripped of the subdomain. For example, the registered domain for "foo.example.com" is "example.com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last two labels will not work well for TLDs such as "co.uk". type: keyword example: |
extended |
|
The subdomain portion of a fully qualified domain name includes all of the names except the host name under the registered_domain. In a partially qualified domain, or if the the qualification level of the full name cannot be determined, subdomain contains all of the names below the registered domain. For example the subdomain portion of "www.east.mydomain.co.uk" is "east". If the domain has multiple levels of subdomain, such as "sub2.sub1.example.com", the subdomain field should contain "sub2.sub1", with no trailing period. type: keyword example: |
extended |
|
The effective top level domain (eTLD), also known as the domain suffix, is the last part of the domain name. For example, the top level domain for example.com is "com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last label will not work well for effective TLDs such as "co.uk". type: keyword example: |
extended |
ECS Fields
Meta-information specific to ECS.
Field | Description | Level |
---|---|---|
ECS version this event conforms to. Must match When querying across multiple indices — which may conform to slightly different ECS versions — this field lets integrations adjust to the schema version of the events. type: keyword example: |
core |
Error Fields
These fields can represent errors of any kind.
Use them for errors that happen while fetching events or in cases where the event itself contains an error.
Field | Description | Level |
---|---|---|
Error code describing the error. type: keyword |
core |
|
Unique identifier for the error. type: keyword |
core |
|
Error message. type: text |
core |
|
The first line of the stack trace useful for aggregation. This is an attribute of the type: keyword |
custom |
|
The previous stack trace. type: keyword Multi-fields: * error.stack.previous_trace.text (type: text) |
custom |
|
The stack trace of this error in plain text. type: keyword Multi-fields: * error.stack_trace.text (type: text) |
extended |
|
The type of the error, for example the class name of the exception. type: keyword example: |
extended |
Event Fields
The event fields are used for context information about the log or metric event itself.
A log is defined as an event containing details of something that happened. Log events must include the time at which the thing happened. Examples of log events include a process starting on a host, a network packet being sent from a source to a destination, or a network connection between a client and a server being initiated or closed. A metric is defined as an event containing one or more numerical measurements and the time at which the measurement was taken. Examples of metric events include memory pressure measured on a host and device temperature. See the event.kind
definition in this section for additional details about metric and state events.
Field | Description | Level |
---|---|---|
The action captured by the event. This describes the information in the event. It is more specific than type: keyword example: |
core |
|
Agents are normally responsible for populating the For example if the agent’s connection is authenticated with mTLS and the client cert contains the ID of the agent to which the cert was issued then the If no validation is performed then the field should be omitted. The allowed values are:
type: keyword example: |
extended |
|
This is one of four ECS Categorization Fields, and indicates the second level in the ECS category hierarchy.
This field is an array. This will allow proper categorization of some events that fall in multiple categories. type: keyword Note: this field should contain an array of values. Important: The field value must be one of the following: authentication, configuration, database, driver, file, host, iam, intrusion_detection, malware, network, package, process, registry, session, threat, web To learn more about when to use which value, visit the page allowed values for event.category |
core |
|
Identification code for this event, if one exists. Some event sources use event codes to identify messages unambiguously, regardless of message language or wording adjustments over time. An example of this is the Windows Event ID. type: keyword example: |
extended |
|
event.created contains the date/time when the event was first read by an agent, or by your pipeline. This field is distinct from @timestamp in that @timestamp typically contain the time extracted from the original event. In most situations, these two timestamps will be slightly different. The difference can be used to calculate the delay between your source generating an event, and the time when your agent first processed it. This can be used to monitor your agent’s or pipeline’s ability to keep up with your event source. In case the two timestamps are identical, @timestamp should be used. type: date example: |
core |
|
Name of the dataset. If an event source publishes more than one type of log or events (e.g. access log, error log), the dataset is used to specify which one the event comes from. It’s recommended but not required to start the dataset name with the module name, followed by a dot, then the dataset name. type: keyword example: |
core |
|
Duration of the event in nanoseconds. If event.start and event.end are known this value should be the difference between the end and start time. type: long |
core |
|
event.end contains the date when the event ended or when the activity was last observed. type: date |
extended |
|
Hash (perhaps logstash fingerprint) of raw field to be able to demonstrate log integrity. type: keyword example: |
extended |
|
Unique ID to describe the event. type: keyword example: |
core |
|
Timestamp when an event arrived in the central data store. This is different from In normal conditions, assuming no tampering, the timestamps should chronologically look like this: type: date example: |
core |
|
This is one of four ECS Categorization Fields, and indicates the highest level in the ECS category hierarchy.
The value of this field can be used to inform how these kinds of events should be handled. They may warrant different retention, different access control, it may also help understand whether the data coming in at a regular interval or not. type: keyword Important: The field value must be one of the following: alert, enrichment, event, metric, state, pipeline_error, signal To learn more about when to use which value, visit the page allowed values for event.kind |
core |
|
Name of the module this data is coming from. If your monitoring agent supports the concept of modules or plugins to process events of a given source (e.g. Apache logs), type: keyword example: |
core |
|
Raw text message of entire event. Used to demonstrate log integrity or where the full log message (before splitting it up in multiple parts) may be required, e.g. for reindex. This field is not indexed and doc_values are disabled. It cannot be searched, but it can be retrieved from type: keyword example: |
core |
|
This is one of four ECS Categorization Fields, and indicates the lowest level in the ECS category hierarchy.
Note that when a single transaction is described in multiple events, each event may populate different values of Also note that in the case of a compound event (a single event that contains multiple logical events), this field should be populated with the value that best captures the overall success or failure from the perspective of the event producer. Further note that not all events will have an associated outcome. For example, this field is generally not populated for metric events, events with type: keyword Important: The field value must be one of the following: failure, success, unknown To learn more about when to use which value, visit the page allowed values for event.outcome |
core |
|
Source of the event. Event transports such as Syslog or the Windows Event Log typically mention the source of an event. It can be the name of the software that generated the event (e.g. Sysmon, httpd), or of a subsystem of the operating system (kernel, Microsoft-Windows-Security-Auditing). type: keyword example: |
extended |
|
Reason why this event happened, according to the source. This describes the why of a particular action or outcome captured in the event. Where type: keyword example: |
extended |
|
Reference URL linking to additional information about this event. This URL links to a static definition of this event. Alert events, indicated by type: keyword example: |
extended |
|
Risk score or priority of the event (e.g. security solutions). Use your system’s original value here. type: float |
core |
|
Normalized risk score or priority of the event, on a scale of 0 to 100. This is mainly useful if you use more than one system that assigns risk scores, and you want to see a normalized value across all systems. type: float |
extended |
|
Sequence number of the event. The sequence number is a value published by some event sources, to make the exact ordering of events unambiguous, regardless of the timestamp precision. type: long |
extended |
|
The numeric severity of the event according to your event source. What the different severity values mean can be different between sources and use cases. It’s up to the implementer to make sure severities are consistent across events from the same source. The Syslog severity belongs in type: long example: |
core |
|
event.start contains the date when the event started or when the activity was first observed. type: date |
extended |
|
This field should be populated when the event’s timestamp does not include timezone information already (e.g. default Syslog timestamps). It’s optional otherwise. Acceptable timezone formats are: a canonical ID (e.g. "Europe/Amsterdam"), abbreviated (e.g. "EST") or an HH:mm differential (e.g. "-05:00"). type: keyword |
extended |
|
This is one of four ECS Categorization Fields, and indicates the third level in the ECS category hierarchy.
This field is an array. This will allow proper categorization of some events that fall in multiple event types. type: keyword Note: this field should contain an array of values. Important: The field value must be one of the following: access, admin, allowed, change, connection, creation, deletion, denied, end, error, group, indicator, info, installation, protocol, start, user To learn more about when to use which value, visit the page allowed values for event.type |
core |
|
URL linking to an external system to continue investigation of this event. This URL links to another system where in-depth investigation of the specific occurrence of this event can take place. Alert events, indicated by type: keyword example: |
extended |
Geo Fields
Geo fields can carry data about a specific location related to an event.
This geolocation information can be derived from techniques such as Geo IP, or be user-supplied.
Field | Description | Level |
---|---|---|
City name. type: keyword example: |
core |
|
Two-letter code representing continent’s name. type: keyword example: |
core |
|
Name of the continent. type: keyword example: |
core |
|
Country ISO code. type: keyword example: |
core |
|
Country name. type: keyword example: |
core |
|
Longitude and latitude. type: geo_point example: |
core |
|
User-defined description of a location, at the level of granularity they care about. Could be the name of their data centers, the floor number, if this describes a local physical entity, city names. Not typically used in automated geolocation. type: keyword example: |
extended |
|
Postal code associated with the location. Values appropriate for this field may also be known as a postcode or ZIP code and will vary widely from country to country. type: keyword example: |
core |
|
Region ISO code. type: keyword example: |
core |
|
Region name. type: keyword example: |
core |
|
The time zone of the location, such as IANA time zone name. type: keyword example: |
core |
The geo
fields are expected to be nested at:
-
client.geo
-
destination.geo
-
host.geo
-
observer.geo
-
server.geo
-
source.geo
-
threat.enrichments.indicator.geo
-
threat.indicator.geo
Note also that the geo
fields are not expected to be used directly at the root of the events.
Git Fields
Fields related to git repositories
Field | Description | Level |
---|---|---|
The author of the git commit type: keyword |
custom |
|
The full sha1 hash of a git commit type: keyword |
custom |
|
The short sha1 hash of a git commit type: keyword |
custom |
|
The name of the git repo type: keyword |
custom |
|
Domain of the url type: keyword |
custom |
|
Unmodified original url as seen in the event source. type: keyword |
custom |
|
Unmodified original url as seen in the event source. type: keyword |
custom |
|
Scheme of the request, such as "https" or "ssh" type: keyword |
custom |
Hash Fields
The hash fields represent different bitwise hash algorithms and their values.
Field names for common hashes (e.g. MD5, SHA1) are predefined. Add fields for other hashes by lowercasing the hash algorithm name and using underscore separators as appropriate (snake case, e.g. sha3_512).
Note that this fieldset is used for common hashes that may be computed over a range of generic bytes. Entity-specific hashes such as ja3 or imphash are placed in the fieldsets to which they relate (tls and pe, respectively).
Field | Description | Level |
---|---|---|
MD5 hash. type: keyword |
extended |
|
SHA1 hash. type: keyword |
extended |
|
SHA256 hash. type: keyword |
extended |
|
SHA512 hash. type: keyword |
extended |
|
SSDEEP hash. type: keyword |
extended |
The hash
fields are expected to be nested at:
-
dll.hash
-
file.hash
-
process.hash
-
threat.enrichments.indicator.hash
-
threat.indicator.hash
Note also that the hash
fields are not expected to be used directly at the root of the events.
Host Fields
A host is defined as a general computing instance.
ECS host.* fields should be populated with details about the host on which the event happened, or from which the measurement was taken. Host types include hardware, virtual machines, Docker containers, and Kubernetes nodes.
Field | Description | Level |
---|---|---|
Operating system architecture. type: keyword example: |
core |
|
Percent CPU used which is normalized by the number of CPU cores and it ranges from 0 to 1. Scaling factor: 1000. For example: For a two core host, this value should be the average of the two cores, between 0 and 1. type: scaled_float |
extended |
|
The total number of bytes (gauge) read successfully (aggregated from all disks) since the last metric collection. type: long |
extended |
|
The total number of bytes (gauge) written successfully (aggregated from all disks) since the last metric collection. type: long |
extended |
|
Name of the domain of which the host is a member. For example, on Windows this could be the host’s Active Directory domain or NetBIOS domain name. For Linux this could be the domain of the host’s LDAP provider. type: keyword example: |
extended |
|
Hostname of the host. It normally contains what the type: keyword |
core |
|
Unique host id. As hostname is not always unique, use values that are meaningful in your environment. Example: The current usage of type: keyword |
core |
|
Host ip addresses. type: ip Note: this field should contain an array of values. |
core |
|
Host MAC addresses. The notation format from RFC 7042 is suggested: Each octet (that is, 8-bit byte) is represented by two [uppercase] hexadecimal digits giving the value of the octet as an unsigned integer. Successive octets are separated by a hyphen. type: keyword Note: this field should contain an array of values. example: |
core |
|
Name of the host. It can contain what type: keyword |
core |
|
The number of bytes (gauge) sent out on all network interfaces by the host since the last metric collection. type: long |
extended |
|
The number of packets (gauge) sent out on all network interfaces by the host since the last metric collection. type: long |
extended |
|
The number of bytes received (gauge) on all network interfaces by the host since the last metric collection. type: long |
extended |
|
The number of packets (gauge) received on all network interfaces by the host since the last metric collection. type: long |
extended |
|
Type of host. For Cloud providers this can be the machine type like type: keyword |
core |
|
Seconds the host has been up. type: long example: |
extended |
HTTP Fields
Fields related to HTTP activity. Use the url
field set to store the url of the request.
Field | Description | Level |
---|---|---|
Size in bytes of the request body. type: long example: |
extended |
|
The full HTTP request body. type: keyword Multi-fields: * http.request.body.content.text (type: text) example: |
extended |
|
Total size in bytes of the request (body and headers). type: long example: |
extended |
|
Request headers in key/value pairs. Can be used add request header information to events. Should not contain nested objects. All values are stored as keyword. type: object example: |
custom |
|
A unique identifier for each HTTP request to correlate logs between clients and servers in transactions. The id may be contained in a non-standard HTTP header, such as type: keyword example: |
extended |
|
HTTP request method. Prior to ECS 1.6.0 the following guidance was provided: "The field value must be normalized to lowercase for querying." As of ECS 1.6.0, the guidance is deprecated because the original case of the method may be useful in anomaly detection. Original case will be mandated in ECS 2.0.0 type: keyword example: |
extended |
|
Mime type of the body of the request. This value must only be populated based on the content of the request body, not on the type: keyword example: |
extended |
|
Referrer for this HTTP request. type: keyword example: |
extended |
|
Size in bytes of the response body. type: long example: |
extended |
|
The full HTTP response body. type: keyword Multi-fields: * http.response.body.content.text (type: text) example: |
extended |
|
Total size in bytes of the response (body and headers). type: long example: |
extended |
|
Response headers in key/value pairs. Can be used add response header information to events. Should not contain nested objects. All values are stored as keyword. type: object example: |
custom |
|
Mime type of the body of the response. This value must only be populated based on the content of the response body, not on the type: keyword example: |
extended |
|
HTTP response status code. type: long example: |
extended |
|
HTTP version. type: keyword example: |
extended |
Interface Fields
The interface fields are used to record ingress and egress interface information when reported by an observer (e.g. firewall, router, load balancer) in the context of the observer handling a network connection. In the case of a single observer interface (e.g. network sensor on a span port) only the observer.ingress information should be populated.
Field | Description | Level |
---|---|---|
Interface alias as reported by the system, typically used in firewall implementations for e.g. inside, outside, or dmz logical interface naming. type: keyword example: |
extended |
|
Interface ID as reported by an observer (typically SNMP interface ID). type: keyword example: |
extended |
|
Interface name as reported by the system. type: keyword example: |
extended |
The interface
fields are expected to be nested at:
-
observer.egress.interface
-
observer.ingress.interface
Note also that the interface
fields are not expected to be used directly at the root of the events.
Log Fields
Details about the event’s logging mechanism or logging transport.
The log.* fields are typically populated with details about the logging mechanism used to create and/or transport the event. For example, syslog details belong under log.syslog.*
.
The details specific to your event source are typically not logged under log.*
, but rather in event.*
or in other ECS fields.
Field | Description | Level |
---|---|---|
Full path to the log file this event came from, including the file name. It should include the drive letter, when appropriate. If the event wasn’t read from a log file, do not populate this field. type: keyword example: |
extended |
|
Original log level of the log event. If the source of the event provides a log level or textual severity, this is the one that goes in Some examples are type: keyword example: |
core |
|
The name of the logger inside an application. This is usually the name of the class which initialized the logger, or can be a custom name. type: keyword example: |
core |
|
The line number of the file containing the source code which originated the log event. type: integer example: |
extended |
|
The name of the file containing the source code which originated the log event. Note that this field is not meant to capture the log file. The correct field to capture the log file is type: keyword example: |
extended |
|
The name of the function or method which originated the log event. type: keyword example: |
extended |
|
Deprecated for removal in next major version release. This field is superseded by This is the original log message and contains the full log message before splitting it up in multiple parts. In contrast to the This field is not indexed and doc_values are disabled so it can’t be queried but the value can be retrieved from type: keyword example: |
core |
|
The Syslog metadata of the event, if the event was transmitted via Syslog. Please see RFCs 5424 or 3164. type: object |
extended |
|
The Syslog numeric facility of the log event, if available. According to RFCs 5424 and 3164, this value should be an integer between 0 and 23. type: long example: |
extended |
|
The Syslog text-based facility of the log event, if available. type: keyword example: |
extended |
|
Syslog numeric priority of the event, if available. According to RFCs 5424 and 3164, the priority is 8 * facility + severity. This number is therefore expected to contain a value between 0 and 191. type: long example: |
extended |
|
The Syslog numeric severity of the log event, if available. If the event source publishing via Syslog provides a different numeric severity value (e.g. firewall, IDS), your source’s numeric severity should go to type: long example: |
extended |
|
The Syslog numeric severity of the log event, if available. If the event source publishing via Syslog provides a different severity value (e.g. firewall, IDS), your source’s text severity should go to type: keyword example: |
extended |
Metrics Fields
Fields for metrics generated from logs.
Subkeys of this field should generally follow Prometheus naming convention, paying particular attention to the suffix.
"A metric name should have a suffix indicating the unit, in plural form." e.g. ttfb.seconds, heap.bytes, size.megabytes etc.
Field | Description | Level |
---|---|---|
total number of changes performed in a puppet run type: long |
custom |
|
number of resources puppet changed type: long |
custom |
|
number of resources puppet corrective changes made type: long |
custom |
|
number of resources puppet failed to manage type: long |
custom |
|
number of resources puppet failed to restart type: long |
custom |
|
number of resources puppet found to be out_of_sync type: long |
custom |
|
number of resources puppet restarted type: long |
custom |
|
number of resources puppet scheduled type: long |
custom |
|
number of resources puppet skipped type: long |
custom |
|
Total number of resources affected type: long |
custom |
|
total seconds spent executing a specific resource type: object example: |
custom |
|
Time to first byte in seconds. type: half_float |
custom |
Network Fields
The network is defined as the communication path over which a host or network event happens.
The network.* fields should be populated with details about the network activity associated with an event.
Field | Description | Level |
---|---|---|
A name given to an application level protocol. This can be arbitrarily assigned for things like microservices, but also apply to things like skype, icq, facebook, twitter. This would be used in situations where the vendor or service can be decoded such as from the source/dest IP owners, ports, or wire format. The field value must be normalized to lowercase for querying. See the documentation section "Implementing ECS". type: keyword example: |
extended |
|
Total bytes transferred in both directions. If type: long example: |
core |
|
A hash of source and destination IPs and ports, as well as the protocol used in a communication. This is a tool-agnostic standard to identify flows. Learn more at https://github.com/corelight/community-id-spec. type: keyword example: |
extended |
|
Direction of the network traffic. Recommended values are: * ingress * egress * inbound * outbound * internal * external * unknown When mapping events from a host-based monitoring context, populate this field from the host’s point of view, using the values "ingress" or "egress". When mapping events from a network or perimeter-based monitoring context, populate this field from the point of view of the network perimeter, using the values "inbound", "outbound", "internal" or "external". Note that "internal" is not crossing perimeter boundaries, and is meant to describe communication between two hosts within the perimeter. Note also that "external" is meant to describe traffic between two hosts that are external to the perimeter. This could for example be useful for ISPs or VPN service providers. type: keyword example: |
core |
|
Host IP address when the source IP address is the proxy. type: ip example: |
core |
|
IANA Protocol Number (https://www.iana.org/assignments/protocol-numbers/protocol-numbers.xhtml). Standardized list of protocols. This aligns well with NetFlow and sFlow related logs which use the IANA Protocol Number. type: keyword example: |
extended |
|
Network.inner fields are added in addition to network.vlan fields to describe the innermost VLAN when q-in-q VLAN tagging is present. Allowed fields include vlan.id and vlan.name. Inner vlan fields are typically used when sending traffic with multiple 802.1q encapsulations to a network sensor (e.g. Zeek, Wireshark.) type: object |
extended |
|
Name given by operators to sections of their network. type: keyword example: |
extended |
|
Total packets transferred in both directions. If type: long example: |
core |
|
L7 Network protocol name. ex. http, lumberjack, transport protocol. The field value must be normalized to lowercase for querying. See the documentation section "Implementing ECS". type: keyword example: |
core |
|
Array of TCP Flags found in the packet. The field value must be normalized to lowercase for querying. type: keyword Note: this field should contain an array of values. example: |
custom |
|
Same as network.iana_number, but instead using the Keyword name of the transport layer (udp, tcp, ipv6-icmp, etc.) The field value must be normalized to lowercase for querying. See the documentation section "Implementing ECS". type: keyword example: |
core |
|
In the OSI Model this would be the Network Layer. ipv4, ipv6, ipsec, pim, etc The field value must be normalized to lowercase for querying. See the documentation section "Implementing ECS". type: keyword example: |
core |
Normalized Fields
Fields generated and normalized by the logging pipeline, but normally hidden from view in the Kibana Discover app and saved search results in the Dashboard app.
This field is intended to contain fields that generally duplicate other fields and clutter the view.
This field is hidden superficially by an entry in the index pattern source filters.
Field | Description | Level |
---|---|---|
Array of dropped field names due to a type conflict. Values in this field are the equivalent of those in error type type: keyword |
custom |
|
Array of dropped field names due to no available field definition. Values in this field would normally be dropped silently by ElasticSearch. type: keyword |
custom |
|
The message field downcased, truncated to 256 characters, and indexed in an aggregatable form. The downcasing and truncation is handled by the Logstash pipeline. Ideally, this should be a multi-field of the message field, but lacking a truncation feature, this behavior is currently impossible. For more information, see: https://github.com/elastic/elasticsearch/issues/60329 type: keyword example: |
custom |
Observer Fields
An observer is defined as a special network, security, or application device used to detect, observe, or create network, security, or application-related events and metrics.
This could be a custom hardware appliance or a server that has been configured to run special network, security, or application software. Examples include firewalls, web proxies, intrusion detection/prevention systems, network monitoring sensors, web application firewalls, data loss prevention systems, and APM servers. The observer.* fields shall be populated with details of the system, if any, that detects, observes and/or creates a network, security, or application event or metric. Message queues and ETL components used in processing events or metrics are not considered observers in ECS.
Field | Description | Level |
---|---|---|
Observer.egress holds information like interface number and name, vlan, and zone information to classify egress traffic. Single armed monitoring such as a network sensor on a span port should only use observer.ingress to categorize traffic. type: object |
extended |
|
Network zone of outbound traffic as reported by the observer to categorize the destination area of egress traffic, e.g. Internal, External, DMZ, HR, Legal, etc. type: keyword example: |
extended |
|
Hostname of the observer. type: keyword |
core |
|
Observer.ingress holds information like interface number and name, vlan, and zone information to classify ingress traffic. Single armed monitoring such as a network sensor on a span port should only use observer.ingress to categorize traffic. type: object |
extended |
|
Network zone of incoming traffic as reported by the observer to categorize the source area of ingress traffic. e.g. internal, External, DMZ, HR, Legal, etc. type: keyword example: |
extended |
|
IP addresses of the observer. type: ip Note: this field should contain an array of values. |
core |
|
MAC addresses of the observer. The notation format from RFC 7042 is suggested: Each octet (that is, 8-bit byte) is represented by two [uppercase] hexadecimal digits giving the value of the octet as an unsigned integer. Successive octets are separated by a hyphen. type: keyword Note: this field should contain an array of values. example: |
core |
|
Custom name of the observer. This is a name that can be given to an observer. This can be helpful for example if multiple firewalls of the same model are used in an organization. If no custom name is needed, the field can be left empty. type: keyword example: |
extended |
|
The product name of the observer. type: keyword example: |
extended |
|
Observer serial number. type: keyword |
extended |
|
The type of the observer the data is coming from. There is no predefined list of observer types. Some examples are type: keyword example: |
core |
|
Vendor name of the observer. type: keyword example: |
core |
|
Observer version. type: keyword |
core |
Location | Field Set | Description |
---|---|---|
|
Fields to describe observer interface information. |
|
|
Fields to describe observed VLAN information. |
|
|
Fields describing a location. |
|
|
Fields to describe observer interface information. |
|
|
Fields to describe observed VLAN information. |
|
|
OS fields contain information about the operating system. |
Orchestrator Fields
Fields that describe the resources which container orchestrators manage or act upon.
Field | Description | Level |
---|---|---|
API version being used to carry out the action type: keyword example: |
extended |
|
Name of the cluster. type: keyword |
extended |
|
URL of the API used to manage the cluster. type: keyword |
extended |
|
The version of the cluster. type: keyword |
extended |
|
Namespace in which the action is taking place. type: keyword example: |
extended |
|
Organization affected by the event (for multi-tenant orchestrator setups). type: keyword example: |
extended |
|
The list of annotations added to the resource. type: keyword Note: this field should contain an array of values. example: |
extended |
|
Unique ID of the resource being acted upon. type: keyword |
extended |
|
The list of labels added to the resource. type: keyword Note: this field should contain an array of values. example: |
extended |
|
Name of the resource being acted upon. type: keyword example: |
extended |
|
Type of resource being acted upon. type: keyword example: |
extended |
|
Orchestrator cluster type (e.g. kubernetes, nomad or cloudfoundry). type: keyword example: |
extended |
Operating System Fields
The OS fields contain information about the operating system.
Field | Description | Level |
---|---|---|
OS family (such as redhat, debian, freebsd, windows). type: keyword example: |
extended |
|
Operating system name, including the version or code name. type: keyword Multi-fields: * os.full.text (type: text) example: |
extended |
|
Operating system kernel version as a raw string. type: keyword example: |
extended |
|
Operating system name, without the version. type: keyword Multi-fields: * os.name.text (type: text) example: |
extended |
|
Operating system platform (such centos, ubuntu, windows). type: keyword example: |
extended |
|
Use the One of these following values should be used (lowercase): linux, macos, unix, windows. If the OS you’re dealing with is not in the list, the field should not be populated. Please let us know by opening an issue with ECS, to propose its addition. type: keyword example: |
extended |
|
Operating system version as a raw string. type: keyword example: |
extended |
The os
fields are expected to be nested at:
-
host.os
-
observer.os
-
user_agent.os
Note also that the os
fields are not expected to be used directly at the root of the events.
Package Fields
These fields contain information about an installed software package. It contains general information about a package, such as name, version or size. It also contains installation details, such as time or location.
Field | Description | Level |
---|---|---|
Package architecture. type: keyword example: |
extended |
|
Additional information about the build version of the installed package. For example use the commit SHA of a non-released package. type: keyword example: |
extended |
|
Checksum of the installed package for verification. type: keyword example: |
extended |
|
Description of the package. type: keyword example: |
extended |
|
Indicating how the package was installed, e.g. user-local, global. type: keyword example: |
extended |
|
Time when package was installed. type: date |
extended |
|
License under which the package was released. Use a short name, e.g. the license identifier from SPDX License List where possible (https://spdx.org/licenses/). type: keyword example: |
extended |
|
Package name type: keyword example: |
extended |
|
Path where the package is installed. type: keyword example: |
extended |
|
Home page or reference URL of the software in this package, if available. type: keyword example: |
extended |
|
Package size in bytes. type: long example: |
extended |
|
Type of package. This should contain the package file type, rather than the package manager name. Examples: rpm, dpkg, brew, npm, gem, nupkg, jar. type: keyword example: |
extended |
|
Package version type: keyword example: |
extended |
Process Fields
These fields contain information about a process.
These fields can help you correlate metrics information with a process id/name from a log message. The process.pid
often stays in the metric itself and is copied to the global field for correlation.
Field | Description | Level |
---|---|---|
Array of process arguments, starting with the absolute path to the executable. May be filtered to protect sensitive information. type: keyword Note: this field should contain an array of values. example: |
extended |
|
Length of the process.args array. This field can be useful for querying or performing bucket analysis on how many arguments were provided to start a process. More arguments may be an indication of suspicious activity. type: long example: |
extended |
|
Full command line that started the process, including the absolute path to the executable, and all arguments. Some arguments may be filtered to protect sensitive information. type: keyword Multi-fields: * process.command_line.text (type: text) example: |
extended |
|
Unique identifier for the process. The implementation of this is specified by the data source, but some examples of what could be used here are a process-generated UUID, Sysmon Process GUIDs, or a hash of some uniquely identifying components of a process. Constructing a globally unique identifier is a common practice to mitigate PID reuse as well as to identify a specific process over time, across multiple monitored hosts. type: keyword example: |
extended |
|
Absolute path to the process executable. type: keyword Multi-fields: * process.executable.text (type: text) example: |
extended |
|
The exit code of the process, if this is a termination event. The field should be absent if there is no exit code for the event (e.g. process start). type: long example: |
extended |
|
Process name. Sometimes called program name or similar. type: keyword Multi-fields: * process.name.text (type: text) example: |
extended |
|
Identifier of the group of processes the process belongs to. type: long |
extended |
|
Process id. type: long example: |
core |
|
Parent process' pid. type: long example: |
extended |
|
The time the process started. type: date example: |
extended |
|
Thread ID. type: long example: |
extended |
|
Thread name. type: keyword example: |
extended |
|
Process title. The proctitle, some times the same as process name. Can also be different: for example a browser setting its title to the web page currently opened. type: keyword Multi-fields: * process.title.text (type: text) |
extended |
|
Seconds the process has been up. type: long example: |
extended |
|
The working directory of the process. type: keyword Multi-fields: * process.working_directory.text (type: text) example: |
extended |
The process
fields are expected to be nested at:
-
process.parent
Note also that the process
fields may be used directly at the root of the events.
Location | Field Set | Description |
---|---|---|
|
These fields contain information about binary code signatures. |
|
|
beta:[ This field reuse is beta and subject to change.] These fields contain Linux Executable Linkable Format (ELF) metadata. |
|
|
Hashes, usually file hashes. |
|
|
Information about the parent process. |
|
|
These fields contain Windows Portable Executable (PE) metadata. |
Rule Fields
Rule fields are used to capture the specifics of any observer or agent rules that generate alerts or other notable events.
Examples of data sources that would populate the rule fields include: network admission control platforms, network or host IDS/IPS, network firewalls, web application firewalls, url filters, endpoint detection and response (EDR) systems, etc.
Field | Description | Level |
---|---|---|
Name, organization, or pseudonym of the author or authors who created the rule used to generate this event. type: keyword Note: this field should contain an array of values. example: |
extended |
|
A categorization value keyword used by the entity using the rule for detection of this event. type: keyword example: |
extended |
|
The description of the rule generating the event. type: keyword example: |
extended |
|
A rule ID that is unique within the scope of an agent, observer, or other entity using the rule for detection of this event. type: keyword example: |
extended |
|
Name of the license under which the rule used to generate this event is made available. type: keyword example: |
extended |
|
The name of the rule or signature generating the event. type: keyword example: |
extended |
|
Reference URL to additional information about the rule used to generate this event. The URL can point to the vendor’s documentation about the rule. If that’s not available, it can also be a link to a more general page describing this type of alert. type: keyword example: |
extended |
|
Name of the ruleset, policy, group, or parent category in which the rule used to generate this event is a member. type: keyword example: |
extended |
|
A rule ID that is unique within the scope of a set or group of agents, observers, or other entities using the rule for detection of this event. type: keyword example: |
extended |
|
The version / revision of the rule being used for analysis. type: keyword example: |
extended |
Server Fields
A Server is defined as the responder in a network connection for events regarding sessions, connections, or bidirectional flow records.
For TCP events, the server is the receiver of the initial SYN packet(s) of the TCP connection. For other protocols, the server is generally the responder in the network transaction. Some systems actually use the term "responder" to refer the server in TCP connections. The server fields describe details about the system acting as the server in the network event. Server fields are usually populated in conjunction with client fields. Server fields are generally not populated for packet-level events.
Client / server representations can add semantic context to an exchange, which is helpful to visualize the data in certain situations. If your context falls in that category, you should still ensure that source and destination are filled appropriately.
Field | Description | Level |
---|---|---|
Some event server addresses are defined ambiguously. The event will sometimes list an IP, a domain or a unix socket. You should always store the raw address in the Then it should be duplicated to type: keyword |
extended |
|
Bytes sent from the server to the client. type: long example: |
core |
|
Server domain. type: keyword |
core |
|
IP address of the server (IPv4 or IPv6). type: ip |
core |
|
MAC address of the server. The notation format from RFC 7042 is suggested: Each octet (that is, 8-bit byte) is represented by two [uppercase] hexadecimal digits giving the value of the octet as an unsigned integer. Successive octets are separated by a hyphen. type: keyword example: |
core |
|
Translated ip of destination based NAT sessions (e.g. internet to private DMZ) Typically used with load balancers, firewalls, or routers. type: ip |
extended |
|
Translated port of destination based NAT sessions (e.g. internet to private DMZ) Typically used with load balancers, firewalls, or routers. type: long |
extended |
|
Packets sent from the server to the client. type: long example: |
core |
|
Port of the server. type: long |
core |
|
The highest registered server domain, stripped of the subdomain. For example, the registered domain for "foo.example.com" is "example.com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last two labels will not work well for TLDs such as "co.uk". type: keyword example: |
extended |
|
The subdomain portion of a fully qualified domain name includes all of the names except the host name under the registered_domain. In a partially qualified domain, or if the the qualification level of the full name cannot be determined, subdomain contains all of the names below the registered domain. For example the subdomain portion of "www.east.mydomain.co.uk" is "east". If the domain has multiple levels of subdomain, such as "sub2.sub1.example.com", the subdomain field should contain "sub2.sub1", with no trailing period. type: keyword example: |
extended |
|
The effective top level domain (eTLD), also known as the domain suffix, is the last part of the domain name. For example, the top level domain for example.com is "com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last label will not work well for effective TLDs such as "co.uk". type: keyword example: |
extended |
Service Fields
The service fields describe the service for or from which the data was collected.
These fields help you find and correlate logs for a specific service and version.
Field | Description | Level |
---|---|---|
Ephemeral identifier of this service (if one exists). This id normally changes across restarts, but type: keyword example: |
extended |
|
Unique identifier of the running service. If the service is comprised of many nodes, the This id should uniquely identify the service. This makes it possible to correlate logs and metrics for one specific service, no matter which particular node emitted the event. Note that if you need to see the events from one specific host of the service, you should filter on that type: keyword example: |
core |
|
Name of the service data is collected from. The name of the service is normally user given. This allows for distributed services that run on multiple hosts to correlate the related instances based on the name. In the case of Elasticsearch the type: keyword example: |
core |
|
Name of a service node. This allows for two nodes of the same service running on the same host to be differentiated. Therefore, In the case of Elasticsearch, the type: keyword example: |
extended |
|
Current state of the service. type: keyword |
core |
|
The type of the service data is collected from. The type can be used to group and correlate logs and metrics from one service type. Example: If logs or metrics are collected from Elasticsearch, type: keyword example: |
core |
|
Version of the service the data was collected from. This allows to look at a data set only for a specific version of a service. type: keyword example: |
core |
Source Fields
Source fields capture details about the sender of a network exchange/packet. These fields are populated from a network event, packet, or other event containing details of a network transaction.
Source fields are usually populated in conjunction with destination fields. The source and destination fields are considered the baseline and should always be filled if an event contains source and destination details from a network transaction. If the event also contains identification of the client and server roles, then the client and server fields should also be populated.
Field | Description | Level |
---|---|---|
Some event source addresses are defined ambiguously. The event will sometimes list an IP, a domain or a unix socket. You should always store the raw address in the Then it should be duplicated to type: keyword |
extended |
|
Bytes sent from the source to the destination. type: long example: |
core |
|
Source domain. type: keyword |
core |
|
IP address of the source (IPv4 or IPv6). type: ip |
core |
|
MAC address of the source. The notation format from RFC 7042 is suggested: Each octet (that is, 8-bit byte) is represented by two [uppercase] hexadecimal digits giving the value of the octet as an unsigned integer. Successive octets are separated by a hyphen. type: keyword example: |
core |
|
Translated ip of source based NAT sessions (e.g. internal client to internet) Typically connections traversing load balancers, firewalls, or routers. type: ip |
extended |
|
Translated port of source based NAT sessions. (e.g. internal client to internet) Typically used with load balancers, firewalls, or routers. type: long |
extended |
|
Packets sent from the source to the destination. type: long example: |
core |
|
Port of the source. type: long |
core |
|
The highest registered source domain, stripped of the subdomain. For example, the registered domain for "foo.example.com" is "example.com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last two labels will not work well for TLDs such as "co.uk". type: keyword example: |
extended |
|
The subdomain portion of a fully qualified domain name includes all of the names except the host name under the registered_domain. In a partially qualified domain, or if the the qualification level of the full name cannot be determined, subdomain contains all of the names below the registered domain. For example the subdomain portion of "www.east.mydomain.co.uk" is "east". If the domain has multiple levels of subdomain, such as "sub2.sub1.example.com", the subdomain field should contain "sub2.sub1", with no trailing period. type: keyword example: |
extended |
|
The effective top level domain (eTLD), also known as the domain suffix, is the last part of the domain name. For example, the top level domain for example.com is "com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last label will not work well for effective TLDs such as "co.uk". type: keyword example: |
extended |
TLS Fields
Fields related to a TLS connection. These fields focus on the TLS protocol itself and intentionally avoids in-depth analysis of the related x.509 certificate files.
Field | Description | Level |
---|---|---|
String indicating the cipher used during the current connection. type: keyword example: |
extended |
|
PEM-encoded stand-alone certificate offered by the client. This is usually mutually-exclusive of type: keyword example: |
extended |
|
Array of PEM-encoded certificates that make up the certificate chain offered by the client. This is usually mutually-exclusive of type: keyword Note: this field should contain an array of values. example: |
extended |
|
Certificate fingerprint using the MD5 digest of DER-encoded version of certificate offered by the client. For consistency with other hash values, this value should be formatted as an uppercase hash. type: keyword example: |
extended |
|
Certificate fingerprint using the SHA1 digest of DER-encoded version of certificate offered by the client. For consistency with other hash values, this value should be formatted as an uppercase hash. type: keyword example: |
extended |
|
Certificate fingerprint using the SHA256 digest of DER-encoded version of certificate offered by the client. For consistency with other hash values, this value should be formatted as an uppercase hash. type: keyword example: |
extended |
|
Distinguished name of subject of the issuer of the x.509 certificate presented by the client. type: keyword example: |
extended |
|
A hash that identifies clients based on how they perform an SSL/TLS handshake. type: keyword example: |
extended |
|
Date/Time indicating when client certificate is no longer considered valid. type: date example: |
extended |
|
Date/Time indicating when client certificate is first considered valid. type: date example: |
extended |
|
Also called an SNI, this tells the server which hostname to which the client is attempting to connect to. When this value is available, it should get copied to type: keyword example: |
extended |
|
Distinguished name of subject of the x.509 certificate presented by the client. type: keyword example: |
extended |
|
Array of ciphers offered by the client during the client hello. type: keyword Note: this field should contain an array of values. example: |
extended |
|
String indicating the curve used for the given cipher, when applicable. type: keyword example: |
extended |
|
Boolean flag indicating if the TLS negotiation was successful and transitioned to an encrypted tunnel. type: boolean |
extended |
|
String indicating the protocol being tunneled. Per the values in the IANA registry (https://www.iana.org/assignments/tls-extensiontype-values/tls-extensiontype-values.xhtml#alpn-protocol-ids), this string should be lower case. type: keyword example: |
extended |
|
Boolean flag indicating if this TLS connection was resumed from an existing TLS negotiation. type: boolean |
extended |
|
PEM-encoded stand-alone certificate offered by the server. This is usually mutually-exclusive of type: keyword example: |
extended |
|
Array of PEM-encoded certificates that make up the certificate chain offered by the server. This is usually mutually-exclusive of type: keyword Note: this field should contain an array of values. example: |
extended |
|
Certificate fingerprint using the MD5 digest of DER-encoded version of certificate offered by the server. For consistency with other hash values, this value should be formatted as an uppercase hash. type: keyword example: |
extended |
|
Certificate fingerprint using the SHA1 digest of DER-encoded version of certificate offered by the server. For consistency with other hash values, this value should be formatted as an uppercase hash. type: keyword example: |
extended |
|
Certificate fingerprint using the SHA256 digest of DER-encoded version of certificate offered by the server. For consistency with other hash values, this value should be formatted as an uppercase hash. type: keyword example: |
extended |
|
Subject of the issuer of the x.509 certificate presented by the server. type: keyword example: |
extended |
|
A hash that identifies servers based on how they perform an SSL/TLS handshake. type: keyword example: |
extended |
|
Timestamp indicating when server certificate is no longer considered valid. type: date example: |
extended |
|
Timestamp indicating when server certificate is first considered valid. type: date example: |
extended |
|
Subject of the x.509 certificate presented by the server. type: keyword example: |
extended |
|
Numeric part of the version parsed from the original string. type: keyword example: |
extended |
|
Normalized lowercase protocol name parsed from original string. type: keyword example: |
extended |
Tracing Fields
Distributed tracing makes it possible to analyze performance throughout a microservice architecture all in one view. This is accomplished by tracing all of the requests - from the initial web request in the front-end service - to queries made through multiple back-end services.
Unlike most field sets in ECS, the tracing fields are not nested under the field set name. In other words, the correct field name is trace.id
, not tracing.trace.id
, and so on.
Field | Description | Level |
---|---|---|
Unique identifier of the span within the scope of its trace. A span represents an operation within a transaction, such as a request to another service, or a database query. type: keyword example: |
extended |
|
Unique identifier of the trace. A trace groups multiple events like transactions that belong together. For example, a user request handled by multiple inter-connected services. type: keyword example: |
extended |
|
Unique identifier of the transaction within the scope of its trace. A transaction is the highest level of work measured within a service, such as a request to a server. type: keyword example: |
extended |
URL Fields
URL fields provide support for complete or partial URLs, and supports the breaking down into scheme, domain, path, and so on.
Field | Description | Level |
---|---|---|
Domain of the url, such as "www.elastic.co". In some cases a URL may refer to an IP and/or port directly, without a domain name. In this case, the IP address would go to the If the URL contains a literal IPv6 address enclosed by type: keyword example: |
extended |
|
The field contains the file extension from the original request url, excluding the leading dot. The file extension is only set if it exists, as not every url has a file extension. The leading period must not be included. For example, the value must be "png", not ".png". Note that when the file name has multiple extensions (example.tar.gz), only the last one should be captured ("gz", not "tar.gz"). type: keyword example: |
extended |
|
Portion of the url after the The type: keyword |
extended |
|
If full URLs are important to your use case, they should be stored in type: keyword Multi-fields: * url.full.text (type: text) example: |
extended |
|
Unmodified original url as seen in the event source. Note that in network monitoring, the observed URL may be a full URL, whereas in access logs, the URL is often just represented as a path. This field is meant to represent the URL as it was observed, complete or not. type: keyword Multi-fields: * url.original.text (type: text) example: |
extended |
|
Password of the request. type: keyword |
extended |
|
Path of the request, such as "/search". type: keyword |
extended |
|
Port of the request, such as 443. type: long example: |
extended |
|
The query field describes the query string of the request, such as "q=elasticsearch". The type: keyword |
extended |
|
The highest registered url domain, stripped of the subdomain. For example, the registered domain for "foo.example.com" is "example.com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last two labels will not work well for TLDs such as "co.uk". type: keyword example: |
extended |
|
Scheme of the request, such as "https". Note: The type: keyword example: |
extended |
|
The subdomain portion of a fully qualified domain name includes all of the names except the host name under the registered_domain. In a partially qualified domain, or if the the qualification level of the full name cannot be determined, subdomain contains all of the names below the registered domain. For example the subdomain portion of "www.east.mydomain.co.uk" is "east". If the domain has multiple levels of subdomain, such as "sub2.sub1.example.com", the subdomain field should contain "sub2.sub1", with no trailing period. type: keyword example: |
extended |
|
The effective top level domain (eTLD), also known as the domain suffix, is the last part of the domain name. For example, the top level domain for example.com is "com". This value can be determined precisely with a list like the public suffix list (http://publicsuffix.org). Trying to approximate this by simply taking the last label will not work well for effective TLDs such as "co.uk". type: keyword example: |
extended |
|
Username of the request. type: keyword |
extended |
The url
fields are expected to be nested at:
-
threat.enrichments.indicator.url
-
threat.indicator.url
Note also that the url
fields may be used directly at the root of the events.
User Fields
The user fields describe information about the user that is relevant to the event.
Fields can have one entry or multiple entries. If a user has more than one id, provide an array that includes all of them.
Field | Description | Level |
---|---|---|
Name of the directory the user is a member of. For example, an LDAP or Active Directory domain name. type: keyword |
extended |
|
User email address. type: keyword |
extended |
|
Additional information in key/value pairs. For additional information provided by the authenticator to events. Note: Not searchable or filterable. type: object example: |
custom |
|
User’s full name, if available. type: keyword Multi-fields: * user.full_name.text (type: text) example: |
extended |
|
Unique user hash to correlate information for a user in anonymized form. Useful if type: keyword |
extended |
|
Unique identifier of the user. type: keyword |
core |
|
Short name or login of the user. type: keyword Multi-fields: * user.name.text (type: text) example: |
core |
|
Array of user roles at the time of the event. type: keyword Note: this field should contain an array of values. example: |
extended |
The user
fields are expected to be nested at:
-
client.user
-
destination.user
-
host.user
-
server.user
-
source.user
-
user.changes
-
user.effective
-
user.target
Note also that the user
fields may be used directly at the root of the events.
Location | Field Set | Description |
---|---|---|
|
Captures changes made to a user. |
|
|
User whose privileges were assumed. |
|
|
User’s group relevant to the event. |
|
|
Targeted user of action taken. |
For usage and examples of the user fields, please see the User Fields Usage and Examples section.
User Fields Usage and Examples
Here are the subjects covered in this page.
-
[ecs-user-usage-field-reuse], or all places user fields can appear
User fields can be present in any kind of event, without affecting the event’s categorization.
However when the event is about IAM (Identity and Account Management),
it should be categorized as follows. In this section we’ll cover specifically
event.category
and event.type
with regards to IAM activity. Make sure to read
the Categorization section to see all allowed
values, and read more about event.kind
and event.outcome
.
Note
|
IAM activity is a bit particular in that events are expected to be assigned 2 event types. One of them indicates the type of activity (creation, deletion, change, etc.), and the other indicates whether a user or a group is the target of the management activity. |
Many sections of the examples below are elided, in order to focus on the categorization of the events.
Creation of group "test-group":
```JSON { "event": { "kind": "event", "category": ["iam"], <1> "type": ["group", "creation"], <2> "outcome": "success" }, "group": { "name": "test-group", … }, "user": { … }, "related": { "user": [ … ] } } ``` <1> Category "iam" <2> Both relevant event types to a group creation
Adding "test-user" to "test-group":
```JSON
{
"event": {
"kind": "event",
"category": ["iam"], <1>
"type": ["user", "change"], <2>
"action": "user added to group", <3>
"outcome": "success"
},
"user": {
…
"target": { <4>
"name": "test-user",
"group": { "name": "test-group" }
}
},
"related": { "user": [ … ] }
}
```
<1> Category "iam"
<2> Both relevant event types to a user modification
<3> event.action
is not a categorization field, and has no mandated value. It can be populated based on source event details or by a pipeline, to ensure the event captures all subtleties of what’s happening.
<4> How to use all possible user fields is detailed below.
Different systems use different values for user identifiers. Here are a few pointers to help normalize some simple cases.
-
When a system provides a composite value for the user name (e.g. DOMAINNAME\username), capture the domain name in
user.domain
and the user name (without the domain) inuser.name
. -
When a system uses an email address as the main identifier, populate both
user.id
anduser.email
with it.
The user fields can be reused (or appear) in many places across ECS. This makes it possible to capture many users relevant to a single event.
Here’s the full list of places where the user fields can appear:
-
user.*
-
user.effective.*
-
user.target.*
-
user.changes.*
-
source.user.*
-
destination.user.*
-
client.user.*
-
server.user.*
-
host.user.*
(deprecated)
Let’s go over the meaning of each.
The examples below will only populate user.name
and sometimes user.id
inside
the various user
nestings, for readability.
However in implementations, unless otherwise noted, all user
fields that can
reasonably be populated in each location should be populated.
====== User fields at the Root of an Event
The user fields at the root of an event are used to capture the user
performing the main action described by the event. This is especially important
when there’s more than one user present on the event. user.*
fields at the root
of the event represent the user performing the action.
In many cases, events that only mention one user should populate the user fields at the root of the event, even if the user is not the one performing the action.
In cases where a purpose-specific user field such as url.username
is populated,
user.name
should also be populated with the same user name.
{ "url": { "username": "alice" }, <1> "user": { "name": "alice" }, <2> "related": { "user": ["alice"] } }
-
Purpose-specific username field
-
Username copied to
user.name
to establishuser.name
as a reliable baseline.
====== Remote Logons
When users are crossing host boundaries, the users are captured at
source.user
and destination.user
.
Examples of data sources where this is applicable:
-
Remote logons via ssh, kerberos
-
Firewalls observing network traffic
In order to align with ECS' design of having user
at the root of the event as the
user performing the action, all source.user
fields should be copied to user
at the root.
Here’s an example where user "alice" logs on to another host as user "deus":
{ "user": { "name": "alice" }, "source": { "user": { "name": "alice" }, "ip": "10.42.42.42" }, "destination": { "user": { "name": "deus" }, "ip": "10.42.42.43" }, "related": { "user": ["alice", "deus"] } }
Whenever an event source populates the client
and server
fields in addition
to source
and destination
, the user fields should be copied accordingly as well.
You can review [ecs-mapping-network-events] to learn more about
mapping network events.
====== Privilege Changes
The user.effective
fields are relevant when there’s a privilege escalation or demotion
and it’s possible to determine the user requesting/performing the escalation.
Use the user
fields at the root to capture who is requesting the privilege change,
and user.effective
to capture the requested privilege level, whether or not the
privilege change was successful.
Here are examples where this is applicable:
-
A user changing identity on a host.
-
Examples: sudo, su, Run as.
-
-
Running a program as a different user. Examples:
-
A trusted user runs a specific admin command as root via a mechanism such as the Posix setuid/setgid.
-
A service manager with administrator privileges starts child processes as limited users, for security purposes (e.g. root runs Apache HTTPD as user "apache")
-
In cases where the event source only gives information about the effective user
and not who requested different privileges, the user
fields at the root of the
event should be used instead of user.effective
.
Here’s an example of user "alice" running a command as root via sudo:
{ "user": { "name": "alice", "id": "1001", "effective": { "name": "root", "id": "1" } }, "related": { "user": ["alice", "root"] } }
When it’s not possible (or it’s prohibitive) to determine which user is requesting different privilege levels, it’s acceptable to capture the effective user at the root of the event. Typically a privilege change event will already have happened, for example: bob "su" as root; and subsequent events will show the root user performing the actions.
====== Identity and Access Management
Whenever a user is performing an action that affects another user — typically
in IAM scenarios — the user affected by the action is captured at
user.target
. The user performing the IAM activity is captured at the root
of the event.
Examples of IAM activity include:
-
user-a creates or deletes user-b
-
user-a modifies user-b
In the create/delete scenarios, there’s either no prior state (user creation)
or no post state (user deletion). In these cases, only user
at the root and
user.target
must be populated.
Example where "root" creates user "bob":
{ "user": { "name": "root", "id": "1", "target": { "name": "bob", "id": "1002", ... } } "related": { "user": ["bob", "root"] } }
When there’s a change of state to an existing user, user.target
must be used
to capture the prior state of the user, and user.changes
should list only
the changes that were performed.
Example where "root" renames user "bob" to "bob.barker":
{ "user": { "name": "root", "id": "1", "target": { "name": "bob", "id": "1002" }, "changes": { "name": "bob.barker" } }, "related": { "user": ["bob", "bob.barker", "root"] } }
You’ll note in the example above that unmodified attributes like the user ID are
not repeated under user.changes.*
, since they didn’t change.
====== Combining IAM and Privilege Change
We’ve covered above how user.target
and user.changes
can be used at the same time.
If privilege escalation is also present in the same IAM event, user.effective
should of course be used as well.
Here’s the "rename" example from the IAM section above. In the following example, we know "alice" is escalating privileges to "root", in order to modify user "bob":
{ "user": { "name": "alice", "id": "1001", "effective": { "name": "root", "id": "1" }, "target": { "name": "bob", "id": "1002" }, "changes": { "name": "bob.barker" } }, "related": { "user": ["alice", "bob", "bob.barker", "root"] } }
====== Subtleties around field reuse
Most cases of field reuse in ECS are reusing a field set inside a different field set. Two examples of this are:
-
reusing
group
inuser
, resulting in theuser.group.*
fields, or -
reusing
user
indestination
, resulting in thedestination.user.*
fields, which also includedestination.user.group.*
.
The user
fields can also be reused within user
as different names,
representing the role of each relevant user. Examples are the user.target.*
or user.effective.*
fields.
However, it’s important to note that user
fields reused within
user
are not carried around anywhere else.
Let’s illustrate the various permutations of what’s valid and what is not.
Field | Validity | Notes |
---|---|---|
|
Valid |
Normal reuse. |
|
Valid |
The |
|
Valid |
The |
|
Invalid |
The |
In all events in this page, we’ve populated the related.user
fields.
Any event that has users in it should always populate the array field related.user
with all usernames seen in the event; including event names that appear in custom fields.
Note that this field is not a nesting of all user fields,
it’s a flat array meant to contain user identifiers.
Taking the example from user.changes
again, we can see that no matter the role
of the each user (before/after privilege escalation, affected user, username after rename), they are all present in related.user
:
{ "user": { "name": "alice", "id": "1001", "effective": { "name": "root", "id": "1" }, "target": { "name": "bob", "id": "1002" }, "changes": { "name": "bob.barker" } }, "related": { "user": ["alice", "root", "bob", "bob.barker"] } }
Like the other fields in the related field set, related.user
is meant to facilitate
pivoting. For example, if you have a suspicion about user "bob.barker", searching
for this name in related.user
will give you all events related to this user, whether
it’s the creation / rename of the user, or events where this user was active in a system.
For examples of mapping events from various sources, you can look at RFC 0007 in section Source Data.
As of ECS 1.8, host.user.*
fields are deprecated and will be removed at the next
major version of ECS.
User agent Fields
The user_agent fields normally come from a browser request.
They often show up in web service logs coming from the parsed user agent string.
Field | Description | Level |
---|---|---|
Name of the device. type: keyword example: |
extended |
|
Name of the user agent. type: keyword example: |
extended |
|
Unparsed user_agent string. type: keyword Multi-fields: * user_agent.original.text (type: text) example: |
extended |
|
Version of the user agent. type: keyword example: |
extended |
Location | Field Set | Description |
---|---|---|
|
OS fields contain information about the operating system. |
ECS Categorization Fields
At a high level, ECS provides fields to classify events in two different ways:
"Where it’s from" (e.g., event.module
, event.dataset
, agent.type
, observer.type
, etc.),
and "What it is." The categorization fields hold the "What it is" information,
independent of the source of the events.
ECS defines four categorization fields for this purpose, each of which falls under the event.*
field set.
Categorization Fields
Note
|
If your events don’t match any of these categorization values, you should leave the fields empty. This will ensure you can start populating the fields once the appropriate categorization values are published, in a later release. |
Categorization Usage
Using the categorization fields contains examples combining the categorization fields to classify different types of events.
ECS Categorization Field: event.kind
This is one of four ECS Categorization Fields, and indicates the highest level in the ECS category hierarchy.
event.kind
gives high-level information about what type of information the event contains, without being specific to the contents of the event. For example, values of this field distinguish alert events from metric events.
The value of this field can be used to inform how these kinds of events should be handled. They may warrant different retention, different access control, it may also help understand whether the data coming in at a regular interval or not.
Allowed Values
alert
This value indicates an event such as an alert or notable event, triggered by a detection rule executing externally to the Elastic Stack.
event.kind:alert
is often populated for events coming from firewalls, intrusion detection systems, endpoint detection and response systems, and so on.
This value is not used by Elastic solutions for alert documents that are created by rules executing within the Kibana alerting framework.
enrichment
The enrichment
value indicates an event collected to provide additional context, often to other events.
An example is collecting indicators of compromise (IOCs) from a threat intelligence provider with the intent to use those values to enrich other events. The IOC events from the intelligence provider should be categorized as event.kind:enrichment
.
event
This value is the most general and most common value for this field. It is used to represent events that indicate that something happened.
metric
This value is used to indicate that this event describes a numeric measurement taken at given point in time.
Examples include CPU utilization, memory usage, or device temperature.
Metric events are often collected on a predictable frequency, such as once every few seconds, or once a minute, but can also be used to describe ad-hoc numeric metric queries.
state
The state value is similar to metric, indicating that this event describes a measurement taken at given point in time, except that the measurement does not result in a numeric value, but rather one of a fixed set of categorical values that represent conditions or states.
Examples include periodic events reporting Elasticsearch cluster state (green/yellow/red), the state of a TCP connection (open, closed, fin_wait, etc.), the state of a host with respect to a software vulnerability (vulnerable, not vulnerable), and the state of a system regarding compliance with a regulatory standard (compliant, not compliant).
Note that an event that describes a change of state would not use event.kind:state
, but instead would use event.kind:event since a state change fits the more general event definition of something that happened.
State events are often collected on a predictable frequency, such as once every few seconds, once a minute, once an hour, or once a day, but can also be used to describe ad-hoc state queries.
pipeline_error
This value indicates that an error occurred during the ingestion of this event, and that event data may be missing, inconsistent, or incorrect. event.kind:pipeline_error
is often associated with parsing errors.
signal
This value is used by Elastic solutions (e.g., Security, Observability) for alert documents that are created by rules executing within the Kibana alerting framework.
Usage of this value is reserved, and data ingestion pipelines must not populate event.kind
with the value "signal".
ECS Categorization Field: event.category
This is one of four ECS Categorization Fields, and indicates the second level in the ECS category hierarchy.
event.category
represents the "big buckets" of ECS categories. For example, filtering on event.category:process
yields all events relating to process activity. This field is closely related to event.type
, which is used as a subcategory.
This field is an array. This will allow proper categorization of some events that fall in multiple categories.
Allowed Values
authentication
Events in this category are related to the challenge and response process in which credentials are supplied and verified to allow the creation of a session. Common sources for these logs are Windows event logs and ssh logs. Visualize and analyze events in this category to look for failed logins, and other authentication-related activity.
Expected event types for category authentication:
start, end, info
configuration
Events in the configuration category have to deal with creating, modifying, or deleting the settings or parameters of an application, process, or system.
Example sources include security policy change logs, configuration auditing logging, and system integrity monitoring.
Expected event types for category configuration:
access, change, creation, deletion, info
database
The database category denotes events and metrics relating to a data storage and retrieval system. Note that use of this category is not limited to relational database systems. Examples include event logs from MS SQL, MySQL, Elasticsearch, MongoDB, etc. Use this category to visualize and analyze database activity such as accesses and changes.
Expected event types for category database:
access, change, info, error
driver
Events in the driver category have to do with operating system device drivers and similar software entities such as Windows drivers, kernel extensions, kernel modules, etc.
Use events and metrics in this category to visualize and analyze driver-related activity and status on hosts.
Expected event types for category driver:
change, end, info, start
file
Relating to a set of information that has been created on, or has existed on a filesystem. Use this category of events to visualize and analyze the creation, access, and deletions of files. Events in this category can come from both host-based and network-based sources. An example source of a network-based detection of a file transfer would be the Zeek file.log.
Expected event types for category file:
change, creation, deletion, info
host
Use this category to visualize and analyze information such as host inventory or host lifecycle events.
Most of the events in this category can usually be observed from the outside, such as from a hypervisor or a control plane’s point of view. Some can also be seen from within, such as "start" or "end".
Note that this category is for information about hosts themselves; it is not meant to capture activity "happening on a host".
Expected event types for category host:
access, change, end, info, start
iam
Identity and access management (IAM) events relating to users, groups, and administration. Use this category to visualize and analyze IAM-related logs and data from active directory, LDAP, Okta, Duo, and other IAM systems.
Expected event types for category iam:
admin, change, creation, deletion, group, info, user
intrusion_detection
Relating to intrusion detections from IDS/IPS systems and functions, both network and host-based. Use this category to visualize and analyze intrusion detection alerts from systems such as Snort, Suricata, and Palo Alto threat detections.
Expected event types for category intrusion_detection:
allowed, denied, info
malware
Malware detection events and alerts. Use this category to visualize and analyze malware detections from EDR/EPP systems such as Elastic Endpoint Security, Symantec Endpoint Protection, Crowdstrike, and network IDS/IPS systems such as Suricata, or other sources of malware-related events such as Palo Alto Networks threat logs and Wildfire logs.
Expected event types for category malware:
info
network
Relating to all network activity, including network connection lifecycle, network traffic, and essentially any event that includes an IP address. Many events containing decoded network protocol transactions fit into this category. Use events in this category to visualize or analyze counts of network ports, protocols, addresses, geolocation information, etc.
Expected event types for category network:
access, allowed, connection, denied, end, info, protocol, start
package
Relating to software packages installed on hosts. Use this category to visualize and analyze inventory of software installed on various hosts, or to determine host vulnerability in the absence of vulnerability scan data.
Expected event types for category package:
access, change, deletion, info, installation, start
process
Use this category of events to visualize and analyze process-specific information such as lifecycle events or process ancestry.
Expected event types for category process:
access, change, end, info, start
registry
Having to do with settings and assets stored in the Windows registry. Use this category to visualize and analyze activity such as registry access and modifications.
Expected event types for category registry:
access, change, creation, deletion
session
The session category is applied to events and metrics regarding logical persistent connections to hosts and services. Use this category to visualize and analyze interactive or automated persistent connections between assets. Data for this category may come from Windows Event logs, SSH logs, or stateless sessions such as HTTP cookie-based sessions, etc.
Expected event types for category session:
start, end, info
threat
Use this category to visualize and analyze events describing threat actors' targets, motives, or behaviors.
Expected event types for category threat:
indicator
web
Relating to web server access. Use this category to create a dashboard of web server/proxy activity from apache, IIS, nginx web servers, etc. Note: events from network observers such as Zeek http log may also be included in this category.
Expected event types for category web:
access, error, info
ECS Categorization Field: event.type
This is one of four ECS Categorization Fields, and indicates the third level in the ECS category hierarchy.
event.type
represents a categorization "sub-bucket" that, when used along with the event.category
field values, enables filtering events down to a level appropriate for single visualization.
This field is an array. This will allow proper categorization of some events that fall in multiple event types.
Allowed Values
access
The access event type is used for the subset of events within a category that indicate that something was accessed. Common examples include event.category:database AND event.type:access
, or event.category:file AND event.type:access
. Note for file access, both directory listings and file opens should be included in this subcategory. You can further distinguish access operations using the ECS event.action
field.
admin
The admin event type is used for the subset of events within a category that are related to admin objects. For example, administrative changes within an IAM framework that do not specifically affect a user or group (e.g., adding new applications to a federation solution or connecting discrete forests in Active Directory) would fall into this subcategory. Common example: event.category:iam AND event.type:change AND event.type:admin
. You can further distinguish admin operations using the ECS event.action
field.
allowed
The allowed event type is used for the subset of events within a category that indicate that something was allowed. Common examples include event.category:network AND event.type:connection AND event.type:allowed
(to indicate a network firewall event for which the firewall disposition was to allow the connection to complete) and event.category:intrusion_detection AND event.type:allowed
(to indicate a network intrusion prevention system event for which the IPS disposition was to allow the connection to complete). You can further distinguish allowed operations using the ECS event.action
field, populating with values of your choosing, such as "allow", "detect", or "pass".
change
The change event type is used for the subset of events within a category that indicate that something has changed. If semantics best describe an event as modified, then include them in this subcategory. Common examples include event.category:process AND event.type:change
, and event.category:file AND event.type:change
. You can further distinguish change operations using the ECS event.action
field.
connection
Used primarily with event.category:network
this value is used for the subset of network traffic that includes sufficient information for the event to be included in flow or connection analysis. Events in this subcategory will contain at least source and destination IP addresses, source and destination TCP/UDP ports, and will usually contain counts of bytes and/or packets transferred. Events in this subcategory may contain unidirectional or bidirectional information, including summary information. Use this subcategory to visualize and analyze network connections. Flow analysis, including Netflow, IPFIX, and other flow-related events fit in this subcategory. Note that firewall events from many Next-Generation Firewall (NGFW) devices will also fit into this subcategory. A common filter for flow/connection information would be event.category:network AND event.type:connection AND event.type:end
(to view or analyze all completed network connections, ignoring mid-flow reports). You can further distinguish connection events using the ECS event.action
field, populating with values of your choosing, such as "timeout", or "reset".
creation
The "creation" event type is used for the subset of events within a category that indicate that something was created. A common example is event.category:file AND event.type:creation
.
deletion
The deletion event type is used for the subset of events within a category that indicate that something was deleted. A common example is event.category:file AND event.type:deletion
to indicate that a file has been deleted.
denied
The denied event type is used for the subset of events within a category that indicate that something was denied. Common examples include event.category:network AND event.type:denied
(to indicate a network firewall event for which the firewall disposition was to deny the connection) and event.category:intrusion_detection AND event.type:denied
(to indicate a network intrusion prevention system event for which the IPS disposition was to deny the connection to complete). You can further distinguish denied operations using the ECS event.action
field, populating with values of your choosing, such as "blocked", "dropped", or "quarantined".
end
The end event type is used for the subset of events within a category that indicate something has ended. A common example is event.category:process AND event.type:end
.
error
The error event type is used for the subset of events within a category that indicate or describe an error. A common example is event.category:database AND event.type:error
. Note that pipeline errors that occur during the event ingestion process should not use this event.type
value. Instead, they should use event.kind:pipeline_error
.
group
The group event type is used for the subset of events within a category that are related to group objects. Common example: event.category:iam AND event.type:creation AND event.type:group
. You can further distinguish group operations using the ECS event.action
field.
indicator
The indicator event type is used for the subset of events within a category that contain details about indicators of compromise (IOCs).
A common example is event.category:threat AND event.type:indicator
.
info
The info event type is used for the subset of events within a category that indicate that they are purely informational, and don’t report a state change, or any type of action. For example, an initial run of a file integrity monitoring system (FIM), where an agent reports all files under management, would fall into the "info" subcategory. Similarly, an event containing a dump of all currently running processes (as opposed to reporting that a process started/ended) would fall into the "info" subcategory. An additional common examples is event.category:intrusion_detection AND event.type:info
.
installation
The installation event type is used for the subset of events within a category that indicate that something was installed. A common example is event.category:package
AND event.type:installation
.
protocol
The protocol event type is used for the subset of events within a category that indicate that they contain protocol details or analysis, beyond simply identifying the protocol. Generally, network events that contain specific protocol details will fall into this subcategory. A common example is event.category:network AND event.type:protocol AND event.type:connection AND event.type:end
(to indicate that the event is a network connection event sent at the end of a connection that also includes a protocol detail breakdown). Note that events that only indicate the name or id of the protocol should not use the protocol value. Further note that when the protocol subcategory is used, the identified protocol is populated in the ECS network.protocol
field.
start
The start event type is used for the subset of events within a category that indicate something has started. A common example is event.category:process AND event.type:start
.
user
The user event type is used for the subset of events within a category that are related to user objects. Common example: event.category:iam AND event.type:deletion AND event.type:user
. You can further distinguish user operations using the ECS event.action
field.
ECS Categorization Field: event.outcome
This is one of four ECS Categorization Fields, and indicates the lowest level in the ECS category hierarchy.
event.outcome
simply denotes whether the event represents a success or a failure from the perspective of the entity that produced the event.
Note that when a single transaction is described in multiple events, each event may populate different values of event.outcome
, according to their perspective.
Also note that in the case of a compound event (a single event that contains multiple logical events), this field should be populated with the value that best captures the overall success or failure from the perspective of the event producer.
Further note that not all events will have an associated outcome. For example, this field is generally not populated for metric events, events with event.type:info
, or any events for which an outcome does not make logical sense.
Allowed Values
failure
Indicates that this event describes a failed result. A common example is event.category:file AND event.type:access AND event.outcome:failure
to indicate that a file access was attempted, but was not successful.
success
Indicates that this event describes a successful result. A common example is event.category:file AND event.type:create AND event.outcome:success
to indicate that a file was successfully created.
unknown
Indicates that this event describes only an attempt for which the result is unknown from the perspective of the event producer. For example, if the event contains information only about the request side of a transaction that results in a response, populating event.outcome:unknown
in the request event is appropriate. The unknown value should not be used when an outcome doesn’t make logical sense for the event. In such cases event.outcome
should not be populated.
Using the Categorization Fields
The event categorization fields work together to identify and group similar events from multiple data sources.
These general principles can help guide the categorization process:
-
Events from multiple data sources that are similar enough to be viewed or analyzed together, should fall into the same
event.category
field. -
Both
event.category
andevent.type
are arrays and may be populated with multiple allowed values, if the event can be reasonably classified into more than one category and/or type. -
event.kind
,event.category
,event.type
andevent.outcome
all have allowed values. This is to normalize these fields. Values that aren’t in the list of allowed values should not be used. -
Values of
event.outcome
are a very limited set to indicate success or failure. Domain-specific actions, such as deny and allow, that could be considered outcomes are not captured in theevent.outcome
field, but rather in theevent.type
and/orevent.action
fields. -
Values of
event.category
,event.type
, andevent.outcome
are consistent across all values ofevent.kind
. -
When a specific event doesn’t fit into any of the defined allowed categorization values, the field should be left empty.
The following examples detail populating the categorization fields and provides some context for the classification decisions.
Firewall blocking a network connection
This event from a firewall describes a successfully blocked network connection:
... { "source": { "address": "10.42.42.42", "ip": "10.42.42.42", "port": 38842 }, "destination": { "address": "10.42.42.1", "ip": "10.42.42.1", "port": 443 }, "rule": { "name": "wan-lan", "id": "default" }, ... "event": { "kind": "event", <1> "category": [ <2> "network" ], "type": [ <3> "connection", "denied" ], "outcome": "success", <4> "action": "dropped" <5> } } ...
-
Classifying as an
event
. -
event.category
categorizes this event asnetwork
activity. -
The event was both an attempted network
connection
and wasdenied
. -
The blocking of this connection is expected. The outcome is a
success
from the perspective of the firewall emitting the event. -
The firewall classifies this denied connection as
dropped
, and this value is captured inevent.action
.
A "denied" network connection could fall under different action values: "blocked", "dropped", "quarantined", etc. The event.action
field captures the action taken as described by the source, and populating event.type:denied
provides an independent, normalized value.
A single query will return all denied network connections which have been normalized with the same categorization values:
event.category:network AND event.type:denied
Failed attempt to create a user account
User alice
attempts to add a user account, bob
, into a directory service, but the action fails:
{ "user": { "name": "alice", "target": { "name": "bob" } }, "event": { "kind": "event", <1> "category": [ <2> "iam" ], "type": [ <3> "user", "creation" ], "outcome": "failure" <4> } }
-
Again classifying as an
event
. -
Categorized using
iam
for an event user account activity. -
Both
user
andcreation
-
The creation of a user account was attempted, but it was not successful.
Informational listing of a file
A utility, such as a file integrity monitoring (FIM) application, takes inventory of a file but does not access or modify the file:
{ "file": { "name": "example.png", "owner": "alice", "path": "/home/alice/example.png", "type": "file" }, "event": { "kind": "event", <1> "category": [ <2> "file" ], "type": [ <3> "info" ] } }
-
Classifying as
event
. -
The event is reporting on a
file
. -
The
info
type categorizes purely informational events. The target file here was not accessed or modified.
The source data didn’t include any context around the event’s outcome, so event.outcome
should not be populated.
Security application failed to block a network connection
An intrusion detection system (IDS) attempts to block a connection but fails. The event emitted by the IDS is considered an alert:
{ "source": { "address": "10.42.42.42", "ip": "10.42.42.42", "port": 38842 }, "destination": { "address": "10.42.42.1", "ip": "10.42.42.1", "port": 443 }, ... "event": { "kind": "alert", <1> "category": [ <2> "intrusion_detection", "network" ], "type": [ <3> "connection", "denied" ], "outcome": "failure" <4> } }
-
The IDS emitted this event when a detection rule generated an alert. The
event.kind
is set toalert
. -
With the event emitted from a network IDS device, the event is categorized both as
network
andintrusion_detection
. -
The alert event is a ‘connection` that was
denied
by the IDS’ configuration. -
The IDS experienced an issue when attempting to deny the connection. Since the action taken by the IDS failed, the outcome is set as
failure
.
Migrating to ECS
There are multiple ways to reap the benefit of ECS. The simplest is to use [ecs-products-solutions].
If you have a custom pipeline or application you would like to convert to ECS, please have a look at [ecs-converting].
Products and Solutions that Support ECS
The following Elastic products support ECS out of the box, as of version 7.0:
-
-
-
-
-
-
Log formatters that support ECS out of the box for various languages can be found here.
-
-
Converting a Custom Implementation
A common schema helps you correlate and use data from various sources.
Fields for most Elastic modules and solutions (version 7.0 and later) are mapped to the Elastic Common Schema. You may want to map data from other implementations to ECS to help you correlate data across all of your products and solutions.
Before you start a conversion, be sure that you understand the basics below.
Core and extended levels
Make sure you understand the distinction between Core and Extended fields, as explained in the [ecs-guidelines].
Core and Extended fields are documented in the [ecs-field-reference] or, for a single page representation of all fields, please see the /generated/csv/fields.csv[generated CSV of fields].
An approach to mapping an existing implementation
Here’s the recommended approach for converting an existing implementation to ECS.
-
Review each field in your original event and map it to the relevant ECS field.
-
Start by mapping your field to the relevant ECS Core field.
-
If a relevant ECS Core field does not exist, map your field to the relevant ECS Extended field.
-
If no relevant ECS Extended field exists, consider keeping your field with its original details, or possibly renaming it using ECS naming guidelines and attempt to map one or more of your original event fields to it.
-
-
Review each ECS Core field, and attempt to populate it.
-
Review your original event data again
-
Consider populating the field based on additional meta-data such as static information (e.g. add
event.category:authentication
even if your auth events don’t mention the word "authentication") -
Consider capturing additional environment meta-data, such as information about the host, container or cloud instance.
-
-
Review other extended fields from any field set you are already using, and attempt to populate it as well.
-
Set
ecs.version
to the version of the schema you are conforming to. This will allow you to upgrade your sources, pipelines and content (like dashboards) smoothly in the future.
Using a spreadsheet to plan your migration
Using a spreadsheet to plan the migration from pre-existing source fields to ECS is a common practice. It’s a good way to address each of your fields methodically among colleagues.
If the data source is either a structured log, or if you already have a pipeline producing events with these non-ECS field names, the tool ECS Mapper may help you get started in performing all of these field renames.
After exporting your mapping spreadsheet to CSV, ECS Mapper will convert your field mapping to equivalent pipelines for Beats, Elasticsearch, and Logstash. Learn more at ECS Mapper.
Additional Information
Questions and Answers
What are the benefits of using ECS?
The benefits to a user adopting these fields and names in their clusters are:
-
Data correlation. Ability to easily correlate data from the same or different sources, including:
-
data from metrics, logs, and application performance management (apm) tools
-
data from the same machines/hosts
-
data from the same service
-
-
Ease of recall. Improved ability to remember commonly used field names (because there is a single set, not a set per data source)
-
Ease of deduction. Improved ability to deduce field names (because the field naming follows a small number of rules with few exceptions)
-
Reuse. Ability to re-use analysis content (searches, visualizations, dashboards, alerts, reports, and machine learning jobs) across multiple data sources
-
Future proofing. Ability to use any future Elastic-provided analysis content in your environment without modifications
What if I have fields that conflict with ECS?
The
processor] can help you resolve field conflicts. For example, imagine that you
already have a field called "user," but ECS employs user
as an object. You can
use the rename processor on ingest time to rename your field to the matching ECS
field. If your field does not match ECS, you can rename your field to
user.value
instead.
What if my events have additional fields?
Events may contain fields in addition to ECS fields. These fields can follow the ECS naming and writing rules, but this is not a requirement.
Why does ECS use a dot notation instead of an underline notation?
There are two common key formats for ingesting data into Elasticsearch:
-
Dot notation:
user.firstname: Nicolas
,user.lastname: Ruflin
-
Underline notation:
user_firstname: Nicolas
,user_lastname: Ruflin
ECS uses the dot notation to represent nested objects.
What is the difference between the two notations?
Ingesting user.firstname: Nicolas
and user.lastname: Ruflin
is identical to ingesting the following JSON:
"user": { "firstname": "Nicolas", "lastname": "Ruflin" }
datatype]. In the case of the underline notation, both are just
Advantages of dot notation
With dot notation, each prefix in Elasticsearch is an object. Each object can have that control how fields inside the object are treated. In the context of ECS, for example, these parameters would allow you to disable dynamic property creation for certain prefixes.
Individual objects give you more flexibility on both the ingest and the event sides. In Elasticsearch, for example, you can use the remove processor to drop complete objects instead of selecting each key inside. You don’t have to know ahead of time which keys will be in an object.
In Beats, you can simplify the creation of events. For example, you can treat each object as an object (or struct in Golang), which makes constructing and modifying each part of the final event easier.
Disadvantage of dot notation
In Elasticsearch, each key can have only one type. For example, if user
is an
object
, you can’t use it as a keyword
type in the same index, like {"user":
"nicolas ruflin"}
. This restriction can be an issue in certain datasets. For
the ECS data itself, this is not an issue because all fields are predefined.
What if I already use the underline notation?
As long as there are no conflicts, underline notation and ECS dot notation can coexist in the same document.
Glossary of ECS Terms
- ECS
-
Elastic Common Schema. The Elastic Common Schema (ECS) is a document schema for Elasticsearch, for use cases such as logging and metrics. ECS defines a common set of fields, their datatype, and gives guidance on their correct usage. ECS is used to improve uniformity of event data coming from different sources.
Contributing to ECS
All information related to ECS is versioned in the elastic/ecs repository. All changes to ECS happen through Pull Requests submitted through Git.
See the Contribution Guidelines.
Generated Artifacts
ECS maintains a collection of artifacts which are generated based on the schema. Examples include Elasticsearch index templates, CSV, and Beats field mappings. The maintained artifacts can be found in the /generated#artifacts-generated-from-ecs[ECS Github repo].
Users can generate custom versions of these artifacts using the ECS project’s tooling. See the tooling /USAGE.md[usage documentation] for more detail.