ELK logstash 配置语法(24th)

彩媛發表於 2023-9-24 00:00:00

<h3>数据类型</h3>
logstash支持的数据类型有：
<ul>
<li>array 
数组可以是单个或者多个字符串值。 path => [ "/var/log/messages", "/var/log/*.log" ] 
path => "/data/mysql/mysql.log" 如果指定了多次，追加数组。此实例path数组包含三个字符串元素。</li>
<li>boolean 
布尔值必须是TRUE或者false。true和false不能有引号。 ssl_enable => true
</li>
<li>bytes 
指定字节单位。支持的单位有SI (k M G T P E Z Y) 和 Binary (Ki Mi Gi Ti Pi Ei Zi Yi)。Binary单位基于1024，SI单位基于1000。不区分大小写和忽略值与单位之间的空格。如果没有指定单位，默认是byte。 my_bytes => "1113" # 1113 bytes 
my_bytes => "10MiB" # 10485760 bytes 
my_bytes => "100kib" # 102400 bytes 
my_bytes => "180 mb" # 180000000 bytes
</li>
<li>Codec 
logstash编码名称用来表示数据编码。用于input和output段。便于数据的处理。如果input和output使用合适的编码，就无需单独的filter对数据进行处理。 codec => "json"
</li>
<li>hash 
键值对，注意多个键值对用空格分隔，而不是逗号。 match => { "field1" => "value1" "field2" => "value2" ... }
</li>
<li>number 
必须是有效的数值，浮点数或者整数。 port => 33
</li>
<li>password 
一个单独的字符串。 my_password => "password"
</li>
<li>path 
一个代表有效的操作系统路径。 my_path => "/tmp/logstash"
</li>
<li>string name => "Hello world" 
name => 'It\'s a beautiful day'
</li>
</ul>
<h3>字段引用</h3>
logstash字段引用语法。要在 Logstash 配置中使用字段的值，只需要把字段的名字写在中括号 [] 里就行了，这就叫字段引用。还需注意字段层次。如果引用的是一个顶级字段，可以省略[]，直接指定字段名。要引用嵌套的字段，需要指定完整的路径，如。
下面有五个顶级字段(agent, ip, request, response, ua) 和三个嵌套字段 (status, bytes, os)。<pre class="brush:bash;toolbar:false">{
"agent": "Mozilla/5.0 (compatible; MSIE 9.0)",
"ip": "192.168.24.44",
"request": "/index.html"
"response": {
"status": 200,
"bytes": 52353
},
"ua": {
"os": "Windows 7"
}
}</pre>为了引用os字段，需指定。引用顶级字段如request，可以简单指定request即可。
<h3>sprintf格式</h3>
字段引用格式也可以用于logstash调用sprintf格式。这种格式可以从其他字符串中引用字段值。如：<pre class="brush:bash;toolbar:false">output {
statsd {
increment => "apache.%{}"
}
}</pre>也可以格式化时间。如：<pre class="brush:bash;toolbar:false">output {
file {
path => "/var/log/%{type}.%{+yyyy.MM.dd.HH}"
}
}</pre>
<h3>条件判断</h3>
使用条件来决定filter和output处理特定的事件。
logstash条件类似于编程语言。条件支持if、else if、else语句，可以嵌套。
条件语法如下：<pre class="brush:bash;toolbar:false">if EXPRESSION {
...
} else if EXPRESSION {
...
} else {
...
}</pre>比较操作有：
<ul class="itemizedlist" type="disc">
<li class="listitem">相等: <code class="literal">==</code>, <code class="literal">!=</code>, <code class="literal"><</code>, <code class="literal">></code>, <code class="literal"><=</code>, <code class="literal">>=</code>
</li>
<li class="listitem">正则: <code class="literal">=~(匹配正则)</code>, <code class="literal">!~(不匹配正则)</code>
</li>
<li class="listitem">包含: <code class="literal">in(包含)</code>, <code class="literal">not in(不包含)</code>
</li>
</ul>
布尔操作：
<ul class="itemizedlist" type="disc">
<li class="listitem">
<code class="literal">and(与)</code>, <code class="literal">or(或)</code>, <code class="literal">nand(非与)</code>, <code class="literal">xor(非或)</code>
</li>
</ul>
一元运算符：
<ul class="itemizedlist" type="disc">
<li class="listitem"><code class="literal">!(取反)</code></li>
<li class="listitem">
<code>()</code>(复合表达式), <code>!()</code>(对复合表达式结果取反)</li>
</ul>
如mutate filter删除secret字段对于action是login的：<pre class="brush:bash;toolbar:false">filter {
if == "login" {
mutate { remove => "secret" }
}
}</pre>在一个条件里指定多个表达式：<pre class="brush:bash;toolbar:false">output {
# Send production errors to pagerduty
if == "ERROR" and == "production" {
pagerduty {
...
}
}
}</pre>在in条件，可以比较字段值：<pre class="brush:bash;toolbar:false">filter {
if in {
mutate { add_tag => "field in field" }
}
if in "foo" {
mutate { add_tag => "field in string" }
}
if "hello" in {
mutate { add_tag => "string in field" }
}
if in ["hello", "world", "foo"] {
mutate { add_tag => "field in list" }
}
if in {
mutate { add_tag => "shouldnotexist" }
}
if !("foo" in ["hello", "world"]) {
mutate { add_tag => "shouldexist" }
}
}</pre><pre class="brush:bash;toolbar:false">output {
if "_grokparsefailure" not in {
elasticsearch { ... }
}
}</pre>字段引用、sprintf格式、条件判断只能用于filter和output，不能用于input。
<h3>@metadata字段</h3>
在logstash1.5版本开始，有一个特殊的字段，叫做@metadata。@metadata包含的内容不会作为事件的一部分输出。<pre class="brush:bash;toolbar:false">input { stdin { } }

filter {
mutate { add_field => { "show" => "This data will be in the output" } }
mutate { add_field => { "[@metadata]" => "Hello" } }
mutate { add_field => { "[@metadata]" => "This data will not be in the output" } }
}

output {
if [@metadata] == "Hello" {
stdout { codec => rubydebug }
}
}</pre>查看输出：<pre class="brush:bash;toolbar:false">$ bin/logstash -f ../test.conf
Logstash startup completed
asdf
{
 "message" => "asdf",
 "@version" => "1",
"@timestamp" => "2015-03-18T23:09:29.595Z",
 "host" => "www.ttlsa.com",
 "show" => "This data will be in the output"
}</pre>"asdf"变成message字段内容。条件与@metadata内嵌的test字段内容判断成功，但是输出并没有展示@metadata字段和其内容。
不过，如果指定了metadata => true，rubydebug codec允许显示@metadata字段的内容。<pre class="brush:bash;toolbar:false">stdout { codec => rubydebug { metadata => true } }</pre>下面是输出的内容：<pre class="brush:bash;toolbar:false">$ bin/logstash -f ../test.conf
Logstash startup completed
asdf
{
 "message" => "asdf",
 "@version" => "1",
"@timestamp" => "2015-03-18T23:10:19.859Z",
 "host" => "www.ttlsa.com",
 "show" => "This data will be in the output",
"@metadata" => {
 "test" => "Hello",
 "no_show" => "This data will not be in the output"
}
}</pre>可以看到@metadata字段及其子字段内容。
注意：只有rubydebug codec可以显示@metadata字段内容。
确保@metadata字段临时需要，不希望最终输出。最常见的情景是filter的时间字段，需要一临时的时间戳。如：<pre class="brush:bash;toolbar:false">input { stdin { } }

filter {
grok { match => [ "message", "%{HTTPDATE:[@metadata]}" ] }
date { match => [ "[@metadata]", "dd/MMM/yyyy:HH:mm:ss Z" ] }
}

output {
stdout { codec => rubydebug }
}</pre>输出结果：<pre class="brush:bash;toolbar:false">$ bin/logstash -f ../test.conf
Logstash startup completed
02/Mar/2014:15:36:43 +0100
{
 "message" => "02/Mar/2014:15:36:43 +0100",
 "@version" => "1",
"@timestamp" => "2014-03-02T14:36:43.000Z",
 "host" => "example.com"
}</pre>

頁: [1]

圆梦公社's Archiver

ELK logstash 配置语法(24th)