Nginx Unique Tracing ID

作者: 四火流年 | 来源:发表于2017-05-07 23:43 被阅读1193次

    背景

    我们想要从Nginx接受请求开始,生成一个Unique Tracing ID,不仅记录在Nginx的日志中,也要贯穿到整个后台的服务,从而利用这个ID方便问题的排查。

    方案一

    利用Nginx丰富的内置变量,拼接出一个“unique enough id”。这里使用了五个变量:

    • $pid: Nginx worker process id
    • $msec: timestamp in millisecond
    • $remote_addr: client address
    • $connection: TCP connection serial number
    • $connection_requests: current number of requests made through a connection

    实现步骤

    1.在nginx.conf的location模块里:

    location / {
        proxy_pass http://upstream;
        set $req_id $pid.$msec.$remote_addr.$connection.$connection_requests;
        proxy_set_header X-Request-Id $req_id;
    }
    

    2.在http模块的 log_format 里加上 $req_id,至此Nginx的日志中将包含这个ID

    log_format trace '... $req_id';
    

    3.在后台服务中可以通过下面的方式获取$req_id

    class MainHandler(tornado.web.RequestHandler):
        def get(self):
            self.write(self.request.headers["X-Request-Id"])
    

    4.重启Nginx

    nginx -s reload
    

    问题

    格式混乱,信息冗余,生成的效果如下:

    97372.1493211301.686.127.0.0.1.471.32
    

    方案二

    使用Nginx内置的变量 $request_id
    这是最直接的办法,使用Nginx自带的一个$request_id,一个16位比特的随机数,用32位的16进制数表示。

    proxy_set_header X-Request-Id $request_id;
    

    问题

    这Nginx 1.11.0 版本新增加的feature,使用Nginx旧版本,或者依赖某些二次开发的Nginx版本,例如 Tengine 继承的是Nginx 1.8.1 版本,都面临着升级Nginx的问题。

    方案三

    使用 Lua 生成一个uuid.
    利用Lua轻量小巧的特性,嵌入到Nginx的配置文件当中,然后生成一个uuid.

    实现步骤

    1.在 http 模块里加入:

        map $host $uuid {
            default '';
        }
        lua_package_path '/path/to/uuid4.lua';
        init_by_lua '
            uuid4 = require "uuid4"
            math = require "math"
        ';
    

    2.在server模块里加入:

        set_by_lua $uuid '
            return uuid4.getUUID()
        ';
    

    3.在location模块里加入:

        proxy_set_header X-Request-Id $uuid;
    

    4.uuid4.lua
    引用自 第三方库

    --[[
    The MIT License (MIT)
    Copyright (c) 2012 Toby Jennings
    Permission is hereby granted, free of charge, to any person obtaining a copy of this software and
    associated documentation files (the "Software"), to deal in the Software without restriction,
    including without limitation the rights to use, copy, modify, merge, publish, distribute,
    sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is
    furnished to do so, subject to the following conditions:
    The above copyright notice and this permission notice shall be included in all copies or substantial
    portions of the Software.
    THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT
    NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
    NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM,
    DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
    OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
    --]]
    
    local M = {}
    -----
    math.randomseed( os.time() )
    math.random()
    -----
    local function num2bs(num)
        local _mod = math.fmod or math.mod
        local _floor = math.floor
        --
        local result = ""
        if(num == 0) then return "0" end
        while(num  > 0) do
             result = _mod(num,2) .. result
             num = _floor(num*0.5)
        end
        return result
    end
    --
    local function bs2num(num)
        local _sub = string.sub
        local index, result = 0, 0
        if(num == "0") then return 0; end
        for p=#num,1,-1 do
            local this_val = _sub( num, p,p )
            if this_val == "1" then
                result = result + ( 2^index )
            end
            index=index+1
        end
        return result
    end
    --
    local function padbits(num,bits)
        if #num == bits then return num end
        if #num > bits then print("too many bits") end
        local pad = bits - #num
        for i=1,pad do
            num = "0" .. num
        end
        return num
    end
    --
    local function getUUID()
        local _rnd = math.random
        local _fmt = string.format
        --
        _rnd()
        --
        local time_low_a = _rnd(0, 65535)
        local time_low_b = _rnd(0, 65535)
        --
        local time_mid = _rnd(0, 65535)
        --
        local time_hi = _rnd(0, 4095 )
        time_hi = padbits( num2bs(time_hi), 12 )
        local time_hi_and_version = bs2num( "0100" .. time_hi )
        --
        local clock_seq_hi_res = _rnd(0,63)
        clock_seq_hi_res = padbits( num2bs(clock_seq_hi_res), 6 )
        clock_seq_hi_res = "10" .. clock_seq_hi_res
        --
        local clock_seq_low = _rnd(0,255)
        clock_seq_low = padbits( num2bs(clock_seq_low), 8 )
        --
        local clock_seq = bs2num(clock_seq_hi_res .. clock_seq_low)
        --
        local node = {}
        for i=1,6 do
            node[i] = _rnd(0,255)
        end
        --
        local guid = ""
        guid = guid .. padbits(_fmt("%X",time_low_a), 4)
        guid = guid .. padbits(_fmt("%X",time_low_b), 4)
        guid = guid .. padbits(_fmt("%X",time_mid), 4)
        guid = guid .. padbits(_fmt("%X",time_hi_and_version), 4)
        guid = guid .. padbits(_fmt("%X",clock_seq), 4)
        --
        for i=1,6 do
            guid = guid .. padbits(_fmt("%X",node[i]), 2)
        end
        --
        return guid
    end
    --
    M.getUUID = getUUID
    return M
    

    问题

    Lua的这个模块太长,担心性能问题,需要进行性能评估。

    方案四

    还是利用Lua脚本,使用时间戳加随机数的方式
    关键步骤:

        set_by_lua $rdm_number '
            return os.time() .. os.clock()*100 .. math.random(1000000000, os.time())
        ';
    

    问题

    os.time()的精确度在1秒,os.clock()的精确度在0.01秒,这样处理之后,总的精度在10毫秒,没有达到要求。
    Lua有一个 Luasocket 模块,可以达到毫秒级别的精度,但是需要安装。

    方案五

    结合Nginx的 $msec 变量和 Lua 的随机数
    关键配置

    server {
        ...
        set_by_lua $rdm_number '
            return math.random(1000000000, os.time())
        ';
        location / {
            ...
            set $req_id $msec$rdm_number;
            proxy_set_header X-Request-Id $req_id;
        }
    }
    

    终记

    最终确定方案五,简单,方便,影响最小。
    在方案选择、测试过程中,还遇到了环境搭建相关的问题,将记录在下篇文章中,敬请期待!


    参考

    1.http://stackoverflow.com/questions/17748735/setting-a-trace-id-in-nginx-load-balancer
    2.https://blog.ryandlane.com/2014/12/11/using-lua-in-nginx-for-unique-request-ids-and-millisecond-times-in-logs/
    3.http://www.jb51.net/article/82167.htm
    4.http://nginx.org/en/docs/http/ngx_http_core_module.html#.24args
    5.http://nginx.org/en/docs/http/ngx_http_core_module.html#var_request_id

    相关文章

      网友评论

        本文标题:Nginx Unique Tracing ID

        本文链接:https://www.haomeiwen.com/subject/pztptxtx.html