further .text and keyword discussion
See original GitHub issuefor security use cases, large error messages, etc… .text
is an important aspect in searching.
pro(s):
- faster searches
- severely reduces impact on the backend (and ultimately customer/user experience)
con(s):
- increases storage
temporary proposal to meet in the middle (for now) instead of adding .text
into the dynamic template for all strings fields VS having no .text
fields at all
ECS and thus corresponding beats index templates have .text
field for high value/impacting fields. List of fields off the top of my head would be:
process.executable.text
process.args.text
url.original
user.email
user_agent.original
error.stack_trace
file.path
host.name
- any
.user.name
fields - any
.domain
fields - any
.as.organization.name
fields http.request.body.content
http.response.body.content
os.name.full
existing ECS reference: https://github.com/elastic/ecs/issues/340 https://github.com/elastic/ecs/issues/104
Issue Analytics
- State:
- Created 4 years ago
- Comments:11 (6 by maintainers)
Top Results From Across the Web
When to use the keyword type vs text datatype in Elasticsearch
In this article, we'll look at some important differences between these types and discuss when to use a keyword vs a text datatype...
Read more >Elasticsearch: Text vs. Keyword - Code Curated
The crucial difference between them is that Elasticsearch will analyze the Text while it won't analyze Keyword type.
Read more >Text type family | Elasticsearch Guide [master] | Elastic
Most users who want to do more with text fields use multi-field mappings by having both a text field for full text searches,...
Read more >Keyword Density in Your Body Text: SEO for Beginners Tutorial
Why you should organically use keywords in your body text. ... I want to discuss using keywords throughout your body or the body...
Read more >Keyword Reading Strategies | How to Find Keywords in a Text
Here are my favorite strategies for finding keywords when studying, to increase reading comprehension, and maximize learning.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
So I reviewed all of our current rule sets and there are a few more ECS fields that are used by hunting searches. This list is not ranked but the process fields are clearly the most heavily used and process.args / process.command_line may be the most frequently used. These are the fields not listed above:
process.command_line process.name process.parent.args process.parent.executable process.pe.original_file_name process.pe.signer_name process.working_directory
There are also 13 non-ECS fields that are used by hunting searches or by ML job datafeed queries in addition to 12 ECS fields for a total of 25. The complete breakdown is in here: https://docs.google.com/spreadsheets/d/1xOicLbct4Vk10hj4_Xaa1ViYjU9W_unFZ_Rv_mTe-ks/edit#gid=0
I would add
os.name.full
to the list. See why in https://github.com/elastic/ecs/issues/576#issuecomment-540748059