redis哨兵集群搭载操作手册

作者: 柯家四少 | 来源:发表于2019-01-23 15:05 被阅读0次

redis哨兵集群搭载操作手册
缓存-Redis 哨兵集群实现高可用
深入浅出Redis-redis哨兵集群
NoSQL Redis 集群架构（Cluster）--Day2
实现Nginx负载均衡、Session共享、Redis集群（二）
Java-分布式框架-redis-3
8.Redis复制、哨兵、集群
.netcore高级工程师面试之Redis的哨兵架构
11.Redis集群入门
突破Java面试(23-7) - Redis的哨兵架构

1.Redis 主从分离

首先，我们默认大家都已经安装了redis，然后我们将redis.conf 拷贝多份，并且创建多个目录，用于区分多个redis 服务：

这里面，每个目录中都有自己的redis.conf 配置文件，接下来，我们先设置主服务器的配置文件。

一、配置Master

1、修改端口

# Accept connections on the specified port,defaultis6379(IANA #815344).

# If port 0is specified Redis will not listen on a TCP socket.

port 6380

redis 的默认端口是6379，这里我们把主服务器的端口设置为6380。

2.修改pidfile

# If a pid file is specified, Redis writes it where specified at startup

# and removes it at exit.

# When the server runs non daemonized, no pid file is created if none is

# specified in the configuration. When the server is daemonized, the pid file

# is used even if not specified, defaulting to "/var/run/redis.pid".

# Creating a pid file is best effort: if Redis is not able to create it

# nothing bad happens, the server will start and run normally.

pidfile /var/run/redis_6380.pid

pidfile 是我们启动redis 的时候，linux 为我们分配的一个pid 进程号，如果这里不作修改，会影响后面redis服务的启动

3、启动 redis

启动redis，我们可以看到，redis已经占领了6380 端口

进入客户端

redis-cli -p 6380

127.0.0.1:6380> info

...

# Replication

role:master

connected_slaves:0

master_repl_offset:0

repl_backlog_active:0

repl_backlog_size:1048576

repl_backlog_first_byte_offset:0

repl_backlog_histlen:0

我们可以看到，redis 现在的角色是一个master 启动的服务。

二、配置Slave

　　和上面配置 master一样，我们需要修改端口号和pid 文件，在修改完之后，我们有两种方法配置从服务

1、在配置文件中配置从服务

################################# REPLICATION ###################

# Master-Slave replication. Use slaveof to make a Redis instance a copy of

# another Redis server. A few things to understand ASAP about Redis replication.

# 1) Redis replication is asynchronous, but you can configure a master to

# stop accepting writes if it appears to be not connected with at least

# a given number of slaves.

# 2) Redis slaves are able to perform a partial resynchronization with the

# master if the replication link is lost for a relatively small amount of

# time. You may want to configure the replication backlog size (see the next

# sections of this file) with a sensible value depending on your needs.

# 3) Replication is automatic and does not need user intervention. After a

# network partition slaves automatically try to reconnect to masters

# and resynchronize with them.

# slaveof <masterip> <masterport>

slaveof 127.0.0.1 6380

我们可以在配置文件中直接修改slaveof属性，我们直接配置主服务器的ip 地址，和端口号，如果这里主服务器有配置密码

可以通过配置masterauth来设置链接密码

# If the masterispasswordprotected(usingthe"requirepass" configuration

# directive below) it is possible to tell the slave to authenticate before

# starting the replication synchronization process, otherwise the master will

# refuse the slave request.

# masterauth

启动redis 服务：

我们可以看到，现在有两个现在在运行，我们进入6381的客户端，看一下他的状态，

# Replication

role:slave

master_host:127.0.0.1

master_port:6380

master_link_status:up

master_last_io_seconds_ago:1

master_sync_in_progress:0

slave_repl_offset:71

slave_priority:100

slave_read_only:1

connected_slaves:0

master_repl_offset:0

repl_backlog_active:0

repl_backlog_size:1048576

repl_backlog_first_byte_offset:0

repl_backlog_histlen:0

我们可以看到，现在的redis 是一个从服务的角色，连接着6380的服务。

2、在服务启动后设置

我们修改6382端口的服务器配置文件之后，启动服务

127.0.0.1:6382> slaveof 127.0.0.1 6380

//修改后状态

# Replication

role:slave

master_host:127.0.0.1

master_port:6380

master_link_status:up

master_last_io_seconds_ago:1

master_sync_in_progress:0

slave_repl_offset:617

slave_priority:100

slave_read_only:1

connected_slaves:0

master_repl_offset:0

repl_backlog_active:0

repl_backlog_size:1048576

repl_backlog_first_byte_offset:0

repl_backlog_histlen:0

3、总结

　　　我们先看一下目前master 的状态：

# Replication

role:master

connected_slaves:2

slave0:ip=127.0.0.1,port=6381,state=online,offset=785,lag=0

slave1:ip=127.0.0.1,port=6382,state=online,offset=785,lag=0

master_repl_offset:785

repl_backlog_active:1

repl_backlog_size:1048576

repl_backlog_first_byte_offset:2

repl_backlog_histlen:784

我们可以可以看到，两个从服务已经在连着主服务器，上面两种配置的区别在于，当salve 断线重连之后，

如果我们是修改类配置文件，重连之后会自己链接上去master，并且同步master 上面的数据，

如果我们是手动连接上去的主服务器，重连之后，从服务器会读取自己本地的 rdb 回复数据，而不会去自动链接主服务

我们如果需要设置读写分离，只需要在主服务器中设置：

# Note: read only slaves are not designed to be exposed to untrusted clients

# on the internet. It's just a protection layer against misuse of the instance.

# Still a read only slave exports by default all the administrative commands

# such as CONFIG, DEBUG, and so forth. To a limited extent you can improve

# security of read only slaves using 'rename-command' to shadow all the

# administrative / dangerous commands.

slave-read-only yes

2、Sentinel 哨兵

1、配置端口

在sentinel.conf 配置文件中，我们可以找到port 属性，这里是用来设置sentinel 的端口，一般情况下，至少会需要三个哨兵对redis 进行监控，我们可以通过修改端口启动多个sentinel 服务。

# port # The port that this sentinel instance will run on

port 26379

2、配置主服务器的ip 和端口

我们把监听的端口修改成6380，并且加上权值为2，这里的权值，是用来计算我们需要将哪一台服务器升级升主服务器

# sentinel monitor <master-name> <ip> <redis-port> <quorum>

# Tells Sentinel to monitor this master, and to consider it in O_DOWN

# (Objectively Down) state only if at least <quorum> sentinels agree.

# Note that whatever is the ODOWN quorum, a Sentinel will require to

# be elected by the majority of the known Sentinels in order to

# start a failover, so no failover can be performed in minority.

# Slaves are auto-discovered, so you don't need to specify slaves in

# any way. Sentinel itself will rewrite this configuration file adding

# the slaves using additional configuration options.

# Also note that the configuration file is rewritten when a

# slave is promoted to master.

# Note: master name should not include special characters or spaces.

# The valid charset is A-z 0-9 and the three characters ".-_".

sentinel monitor mymaster 127.0.0.1 6380 2

3、启动Sentinel

./redis-sentinel sentinel.conf --sentinel &

sentinel 启动之后，就会监视到现在有一个主服务器，两个从服务器

　　当我们把其中一个从服务器器关闭之后，我们可以看到日志：

10894:X30Dec16:27:03.670# +sdown slave127.0.0.1:6381127.0.0.16381@ mymaster127.0.0.16380

　　日志表示，6381这个从服务器已经从主服务器中脱离了出来，我们重新把6381 接回去。

10894:X30Dec16:28:43.288* +reboot slave127.0.0.1:6381127.0.0.16381@ mymaster127.0.0.1638010894:X30Dec16:28:43.365# -sdown slave127.0.0.1:6381127.0.0.16381@ mymaster127.0.0.16380

4、关闭Master

我们手动关闭Master 之后，sentinel 在监听master 确实是断线了之后，将会开始计算权值，然后重新分配主服务器

我们可以看到，6380主服务器断了之后，sentinel 帮我们选了6382作为新的主服务器

我们进到6382的客户端，查看他的状态：

# Replication

role:master

connected_slaves:1

slave0:ip=127.0.0.1,port=6381,state=online,offset=13751,lag=0

master_repl_offset:13751

repl_backlog_active:1

repl_backlog_size:1048576

repl_backlog_first_byte_offset:2

repl_backlog_histlen:13750

我们可以看到 6382，重slave 荣升为master

127.0.0.1:6382>set name jaycekon

原本的没有权限写，也得到了相应的权限

5、重连Master

　　　　大家可能会好奇，如果master 重连之后，会不会抢回属于他的位置，答案是否定的，就比如你被一个小弟抢了你老大的位置，他肯给回你这个位置吗。因此当master 回来之后，他也只能当个小弟

3、Sentinel 总结

一、Sentinel的作用：

A、Master 状态监测

B、如果Master 异常，则会进行Master-slave 转换，将其中一个Slave作为Master，将之前的Master作为Slave

C、Master-Slave切换后，master_redis.conf、slave_redis.conf和sentinel.conf的内容都会发生改变，即master_redis.conf中会多一行slaveof的配置，sentinel.conf的监控目标会随之调换

二、Sentinel的工作方式:

1)：每个Sentinel以每秒钟一次的频率向它所知的Master，Slave以及其他 Sentinel 实例发送一个 PING 命令

2)：如果一个实例（instance）距离最后一次有效回复 PING 命令的时间超过 down-after-milliseconds 选项所指定的值，则这个实例会被 Sentinel 标记为主观下线。

3)：如果一个Master被标记为主观下线，则正在监视这个Master的所有 Sentinel 要以每秒一次的频率确认Master的确进入了主观下线状态。

4)：当有足够数量的 Sentinel（大于等于配置文件指定的值）在指定的时间范围内确认Master的确进入了主观下线状态，则Master会被标记为客观下线

5)：在一般情况下，每个 Sentinel 会以每 10 秒一次的频率向它已知的所有Master，Slave发送 INFO 命令

6)：当Master被 Sentinel 标记为客观下线时，Sentinel 向下线的 Master 的所有 Slave 发送 INFO 命令的频率会从 10 秒一次改为每秒一次

7)：若没有足够数量的 Sentinel 同意 Master 已经下线， Master 的客观下线状态就会被移除。

若 Master 重新向 Sentinel 的 PING 命令返回有效回复， Master 的主观下线状态就会被移除。

4.集群

集群至少需要3主3从，且每个实例使用不同的配置文件，主从不用配置，集群会自己选。

修改每个实例的配置文件：

cluster-enabled yes --开启集群

cluster-config-file nodes-6382.conf --集群配置文件名，每个实例配置的要不同，redis会根据文件名自动新建

用集群工具创建集群：

我们可以用集群工具进行集群，该工具是redis源码包中，用ruby编写，所以需要先安装ruby。

1、安装rubygems

yum install ruby（这一步建议去官网下载ruby包2.2版本以上的，官网地址：http://www.ruby-lang.org/en/downloads/）

yum install rubygems

gem install redis（ruby版本必须大于等于2.2）

2、把6个redis实例都起来，每个实例的集群都打开。

3、redis安装目录的src执行./redis-trib.rb create --replicas 1 127.0.0.1:6380

127.0.0.1:6381 127.0.0.1:6382 127.0.0.1:6383 127.0.0.1:6384

127.0.0.1:6385

提示信息如下：

Connecting to node 127.0.0.1:6380: OK

Connecting to node 127.0.0.1:6381: OK

Connecting to node 127.0.0.1:6382: OK

Connecting to node 127.0.0.1:6383: OK

Connecting to node 127.0.0.1:6384: OK

Connecting to node 127.0.0.1:6385: OK

>>> Performing hash slots allocation on 6 nodes...

Using 3 masters:

127.0.0.1:6380

127.0.0.1:6381

127.0.0.1:6382

Adding replica 127.0.0.1:6383 to 127.0.0.1:6380

Adding replica 127.0.0.1:6384 to 127.0.0.1:6381

Adding replica 127.0.0.1:6385 to 127.0.0.1:6382

M: d4f906940d68714db787a60837f57fa496de5d12 127.0.0.1:6380 slots:0-5460 (5461 slots) master

M: b547d05c9d0e188993befec4ae5ccb430343fb4b 127.0.0.1:6381 slots:5461-10922 (5462 slots) master

M: 887fe91bf218f203194403807e0aee941e985286 127.0.0.1:6382 slots:10923-16383 (5461 slots) master

S: e0f6559be7a121498fae80d44bf18027619d9995 127.0.0.1:6383 replicates d4f906940d68714db787a60837f57fa496de5d12

S: a61dbf654c9d9a4d45efd425350ebf720a6660fc 127.0.0.1:6384 replicates b547d05c9d0e188993befec4ae5ccb430343fb4b

S: 551e5094789035affc489db267c8519c3a29f35d 127.0.0.1:6385 replicates 887fe91bf218f203194403807e0aee941e985286

Can I set the above configuration? (type 'yes' to accept):

输入yes，这样集群就建立了。

登录任一台redis，执行 info cluster，提示cluster_enabled:1

集群过程：

首先redis-trib.rb会以客户端的形式尝试连接所有的节点，并发送PING命令以确定节点能够正常服务。如果有任何节点无法连接，则创建失败。同时发送 INFO 命令获取每个节点的运行ID以及是否开启了集群功能（即cluster_enabled为1）。准备就绪后集群会向每个节点发送 CLUSTER MEET命令，格式为 CLUSTER MEET ip port，这个命令用来告诉当前节点指定ip和port上在运行的节点也是集群的一部分，从而使得6个节点最终可以归入一个集群。

然后redis-trib.rb会分配主从数据库节点，分配的原则是尽量保证每个主数据库运行在不同的IP地址上，同时每个从数据库和主数据库均不运行在同一IP地址上，以保证系统的容灾能力

3主3从，当1个主故障，大家会给对应的从投票，把从立为主，若没有从数据库可以恢复则redis集群就down了。

客户端连接：

使用redis-cli -c -p 任意一个端口

redis哨兵集群搭载操作手册
1.Redis 主从分离首先，我们默认大家都已经安装了redis，然后我们将redis.conf 拷贝多份，并且...
缓存-Redis 哨兵集群实现高可用
Redis 哨兵集群实现高可用哨兵的介绍 sentinel，中文名是哨兵。哨兵是 redis 集群机构中非常重要...
深入浅出Redis-redis哨兵集群
深入浅出Redis-redis哨兵集群 1、Sentinel 哨兵 Sentinel（哨兵）是Redis 的高可用...
NoSQL Redis 集群架构（Cluster）--Day2
redis集群一、集群redis cluster 1、哨兵的不足 2、redis cluster 3、集群搭建 ...
实现Nginx负载均衡、Session共享、Redis集群（二）
感谢博主：spring整合redis(哨兵模式)Redis高可用集群-哨兵模式（Redis-Sentinel）搭建...
Java-分布式框架-redis-3
一、Redis哨兵集群弊端 redis3.0之前比较可靠的集群就是哨兵集群，主从节点，主节点拥有写与读的权限，从节...
8.Redis复制、哨兵、集群
Redis复制、哨兵、集群 Redis实现高可用相关的技术，它们包括：持久化、复制、哨兵和集群，其主要作用和解决的...
.netcore高级工程师面试之Redis的哨兵架构
简介：Github 1 Redis Sentinal机制 sentinal，中文名哨兵哨兵是redis集群架构中...
11.Redis集群入门
本主题，介绍了Redis的集群配置与编程入门，其实Redis的集群还包含主从集群模式，哨兵集群模式（Redis 5...
突破Java面试(23-7) - Redis的哨兵架构
Github 1 Redis Sentinal机制 sentinal，中文名哨兵哨兵是redis集群架构中非常重...