Database Administrators
Q&A for database professionals who wish to improve their database skills
Latest Questions
1
votes
1
answers
1582
views
Innodb not started?
I installed MariaDB_Galera_server10.0 , but when i check the error log i see this: 170118 14:49:09 [Note] InnoDB: 128 rollback segment(s) are active. 170118 14:49:09 [Note] InnoDB: Waiting for purge to start 170118 14:49:09 [Note] InnoDB: Percona XtraDB (http://www.percona.com) 5.6.34-79.1 sta rted;...
I installed MariaDB_Galera_server10.0 , but when i check the error log i see this:
170118 14:49:09 [Note] InnoDB: 128 rollback segment(s) are active.
170118 14:49:09 [Note] InnoDB: Waiting for purge to start
170118 14:49:09 [Note] InnoDB: Percona XtraDB (http://www.percona.com) 5.6.34-79.1 sta
rted; log sequence number 1627308
170118 14:49:09 [Note] Plugin ‘FEEDBACK’ is disabled.
170118 14:49:09 [Note] WSREP: Service disconnected.
170118 14:49:10 [Note] WSREP: Some threads may fail to exit.
170118 14:49:10 [Note] InnoDB: FTS optimize thread exiting.
170118 14:49:10 [Note] InnoDB: Starting shutdown...
170118 14:49:10 [Note] InnoDB: Waiting for page_cleaner to finish flushing of buffer po
170118 14:49:11 [Note] InnoDB: Shutdown completed; log sequence number 1627318
this is the cluster configuration '/etc/mysql/conf.d/cluster.cnf' :
[mysqld]
query_cache_size=0
binlog_format=ROW
default_storage_engine=innodb
innodb_autoinc_lock_mode=2
query_cache_type=0
bind-address=0.0.0.0
wsrep_on=ON
wsrep_provider=/usr/Lib64/libgalera_smm.so
wsrep_provider_options="gcache.size=32G"
wsrep_cluster_name="test_cluster"
wsrep_cluster_address=gcomm://192.168.10.231, 192.168.10.233
wsrep_sst_method= rsync
wsrep_sst_auth = wsrep_sst_user:wsrep_sst_pass
wsrep_node_address='192.168.10.231'
wsrep_node_name="yasoo"
and my.cnf :
# MariaDB database server configuration file.
# You can copy this file to one of:
- “/etc/mysql/my.cnf’ to set global options,
# - “—/.my.cnf” to set user-specific options.
# One can use all, long options that the program supports.
# Run program with - -help to get a list of available options and with
# --print-defaults to see which it would actually understand and use.
# For explanations see
# http://dev .mysql .com/doc/mysql/en/server-system-variables.html
# This will be passed to all mysql clients
# It has been reported that passwords should be enclosed with ticks/quotes
# escpecially if they contain “#‘ chars...
# Remember to edit /etc/mysql/debian.cnf when changing the socket location.
[client]
port = 3306
socket = /var/run/mysqld/mysqld.sock
# Here is entries for some specific programs
# This was formally known as [safe_mysqid] . Both versions are currently parsed.
[mysqld_safe]
log-bin=/var/log/mysql-bin.log
log=/var/tog/mysql.log
#1og-error= /var/log/mysqld.error.log
socket = /var/run/mysqld/mysqld.sock
nice =0
[mysqid]
#* Basic Settings
user = mysql
pid - file = /var/run/mysqld/mysqld.pid
socket = /var/run/mysqld/mysqld.sock
port = 3306
basedir = /usr
datadir = /var/1ib/mysql
tmpdir = ltmp
lc_messages_dir = /usr/share/mysql
lc_messages = en_US
skip-external-locking
# Instead of skip-networking the default is now to listen only on
# localhost which is more compatible and is not less secure.
#bind-address = 127.0.0.1
#* Fine Tuning
max_connections = 100
connect_timeout = 5
wait_timeout = 600
max-allowed_packet = 16M
thread_cache_size = 128
sort_buffer_size = 4M
bulk_insert_buffer_size = 16M
tmp_table_size = 32M
max_heap_table_size = 32M
#* MyISAM
# This replaces the startup script and checks MyISAM tables if needed
# the first time they are touched. On error, make copy and try a repair.
myisam_recover_options = BACKUP
key_buffer_size = 128M
#open-files-limit = 2000
table_open_cache = 400
myisam_sort_buffer_size = 512M
concurrent_insert = 2
read buffer size = 2M
read md buffer size = 1M
#* Query Cache Configuration
# Cache only tiny result sets, so we can fit more in the query cache.
query_cache_limit = 128K
# for more write intensive setups, set to DEMAND or OFF
#query_cache_type = DEMAND
* Logging and Replication
# Both location gets rotated by the cronjob.
# Be aware that this log type is a performance killer.
# As of 5.1 you can enable the log at runtime’
general_log_file = /var/log/mysqi/mysql.log
general_log = 1
# Error logging goes to syslog due to /etc/mysql/conf.d/mysqld_safe_syslog.cnf.
#1og-bin=/var/log/mysql-bin .log
#1og=/var/log/mysqi.1og
#]og-error= /var/log/mysqld.error.iog
# we do want to know about network errors and such
]og_warnings = 2
# Enable the slow query log to see queries with especially long duration
#slow_query_log[={O 1})
slow_query_log_file = /var/log/mysql/mariadb-slow.log
long_query_time = 10
log_slow_verbosity = query_plan
#log -queries -not -using -indexes
#log_slow_admin_statements
The following can be used as easy to replay backup logs or for replication.
# note: if you are setting up a replication sl.ave, see README.Debian about
other settings you may need to change.
server-id = 121
#report_host = masterl
#auto_inc rement_inc rement = 2
#auto_increment_offset = 1
log_bin = /var/log/mysql/mariadb-bin
log_bin_index = /var/log/mysql/ma riadb-bin.index
# not fab for performance, but safer
#sync_binlog = 1
expire_logs_days = 10
max_binlog_size = lOOM
# slaves
relay_log = /var/log/mysql/relay-bin
relay_log_index = /var/log/mysql/relay-bin.index
relay_log_info_file = /var/log/mysql/relay-bin.info
log_slave_updates
# If applications support it, this stricter sql_mode prevents some
# mistakes like inserting invalid dates etc.
#sql_mode = NO_ENGINE_SUBSTITUTION,TRADITIONAL
* InnoDB
# InnoDB is enabled by default with a 10MB datafile in /var/lib/mysql/
# Read the manual for more InnoDB related options. There are many!
default_storage_engine = InnoDB
# you can’t just change log file size, requires special procedure
nnodb_log_file_size = SOM
innodb_buffer_pool_size = 256M
innodb_log_buffer_size = 8M
innodb_file_per_table = 1
innodb_open_files = 400
innodb_io_capacity = 400
innodb flush method = 0_DIRECT
#* Security Features
# Read the manu&L, too, if you want chroot!
# chroot = /var/lJb/mysql/
# For generating SSL certificates I recommend the OpenSSL GUI “tinyca”.
# ssl. -ca=/etc/mysql/cace rt . pem
# ssl-cert=/etc/mysql/server-cert .pem
# ssl -key=/etc/mysql/server-key .pem
#*Galera-related settings
[galera]
#Mandatory settings
log-error=/var/log/mysql/mysql.err
log -bin=/var/log/mysql/mysql-replication.og
# Allow server to accept connections on all interfaces.
#bind-address=G.O.O.O
# Optional setting
wsrep_slave_threads=16
#innodb_flush_tog_at_trx_commit=0
[mysqidump]
quick
quote-names
max_allowed_packet = 16M
[mysql]
#no-auto-rehash # faster start of mysq1. but no tab completion
[isamchkJ
key_buffer = 16M
*# IMPORTANT: Additional settings that can override those from this file!
# The files must end with ‘.cnf’, otherwise they’ll be ignored.
!includedir /etc/mysql/conf.d/
Yaser Jawi
(11 rep)
Jan 19, 2017, 10:01 AM
• Last activity: May 24, 2025, 09:07 PM
0
votes
1
answers
470
views
MariaDB Galera Cluster (10.6.12) and Laravel WSREP issue
I'm administrating a galera cluster with 3 nodes, serving a Laravel application and HAProxy is the gateway to the cluster. We had an interesting issue, and I'm unable to figure out why was there an error message in the Laravel application. Here is what I know: At 13:25:36 there was a note on Node1,...
I'm administrating a galera cluster with 3 nodes, serving a Laravel application and HAProxy is the gateway to the cluster. We had an interesting issue, and I'm unable to figure out why was there an error message in the Laravel application. Here is what I know:
At 13:25:36 there was a note on Node1, that
... connection to peer f3e3XXXX-XXX with addr tcp://X.X.X.X:4567 timed out, no messages seen in PT3S, ...
This IP represents Node2.
Roughly the same time 13:25:37, on Node2, the following message appeared in the logs ... WSREP: (f3e3XXXX-XXX, 'tcp://X.X.X.X:4567') turning message relay requesting on, nonlive peers: tcp://X.X.X.X:4567 ...
This 2nd IP represent Node1. I'll post a more detailed log later in the post.
So my understanding is there could have been a network hickup between the clusters and Node2 (and 3) had to rejoin the cluster. Meanwhile, someone triggered the Larave application that made a query towards the cluster. But this request ended up in an error message saying: [2023-11-07 13:25:46] production.ERROR: SQLSTATE[08S01]: Communication link failure: 1047 WSREP has not yet prepared node for application use .....
If my understanding how a cluster should work is correct, I have no clue what happened. Isn't the galera cluster supposed to be able to manage if a node disappears from the cluster and use an existing node to handle the incoming query? Was this a HAProxy issue maybe and it could not detect that Node2 and 3 are unavailable and sent the query there eitherway?
This issue stayed for about 10 seconds and the cluster is working just fine ever since (I think... wsrep_cluster_size
is 3
ATM so it must be OK).
Now I don't want to "*spam*" this post with all the logs, so I'll insert the logs from Node1 and 2 into pastebin and URLs are provided.
!!! Can anyone explain me what happened here in this situation exactly? Maybe a way to avoid this in the future? !!!
Thank you in advance !
**MariaDB logs:** https://pastebin.com/SpDbe7Bb
**Laravel logs:**
`
[2023-11-07 13:25:46] production.ERROR: SQLSTATE[08S01]: Communication link failure: 1047 WSREP has not yet prepared node for application use (SQL: select * from .......) {"exception":"[object] (Illuminate\\Database\\QueryException(code: 08S01): SQLSTATE[08S01]: Communication link failure: 1047 WSREP has not yet prepared node for application use (SQL: select * from .......) at /var/www/html/myproject/releases/184/vendor/laravel/framework/src/Illuminate/Database/Connection.php:671)
`
Imre Bertalan
(105 rep)
Nov 8, 2023, 11:09 AM
• Last activity: Nov 8, 2023, 11:45 AM
1
votes
1
answers
2729
views
MariaDB Galera Cluster galera.cache file getting bigger than specified gcache.size
We have a 3 node Galera Cluster running on Kubernetes, behind 2 HAProxy PODs configured so, all queries are executed on the first POD/node of the cluster if available, and the other 2 nodes, provide HA (HA Proxy backend backup nodes). In the config file, gcache.size is configured to 5 GB, and when a...
We have a 3 node Galera Cluster running on Kubernetes, behind 2 HAProxy PODs configured so, all queries are executed on the first POD/node of the cluster if available, and the other 2 nodes, provide HA (HA Proxy backend backup nodes).
In the config file, gcache.size is configured to 5 GB, and when a new node is deployed, galera.cache file is 5.1GB so, it seems to get that configuration correctly. However, what we are seeing is galera.cache growing in size up to 80 GBs or more for that first node of the cluster.
As far as we know, this file should not increase in size. The problem is also reproduced when scaling the cluster down to one only node. It does not stop growing.
The version deployed is 10.3.22 (10.3.22-debian-10-r1 Bitnami Docker image) These are the wsrep provider options specified in my.cnf:
wsrep_on=ON
wsrep_provider=/opt/bitnami/mariadb/lib/libgalera_smm.so
wsrep_provider_options="gcache.size=5G"
wsrep_sst_method=mariabackup
wsrep_slave_threads=4
wsrep_cluster_address=gcomm://
wsrep_cluster_name=galera
wsrep_sst_auth="root:"
innodb-flush-log-at-trx-commit=2
# MYISAM REPLICATION SUPPORT #
wsrep_replicate_myisam=ON
We've been dealing with this situation for some time now, we can remove the first node(POD) and then the galera.cache is recreated so, we free disk space. The first node syncs through IST with any of the other 2 nodes and the HAProxy points to a backup node meanwhile, and then back to the first node when recovered, with no downtime.
However, we want to avoid to do this. We can't figure out why galera.cache file size increases, there is no documentation nor bug we could find talking about any similar issue.
Any help will be much appreciated!
Alex Núñez
(21 rep)
Jan 5, 2022, 04:19 PM
• Last activity: Jan 21, 2022, 12:56 PM
0
votes
2
answers
475
views
How to Prevent the Connections for MariaDB Galera-Cluster When There is Only One Active Node?
I was using "rsync" as the wsrep_sst_method in my galera.cnf files on 3-Node Galera Cluster. When it was so, the system served when there was at least 2 active node. So when any two node were gone, the active one did not serve. It was good enough but when I upgraded the system and assigned the wsre_...
I was using "rsync" as the wsrep_sst_method in my galera.cnf files on 3-Node Galera Cluster. When it was so, the system served when there was at least 2 active node. So when any two node were gone, the active one did not serve. It was good enough but when I upgraded the system and assigned the wsre_sst_method = mariabackup, it is serving if there is any active node. So when any of two nodes are gone, it is not preventing the working of the last active node. I dont want system works when there is only one active node. I want the system work like before upgrading without changing the wsrep_sst_method.
So the problem is simple. How can I prevent the system serving when there is just one active node?
TarabydaVllasıCafcaflıAtArabsı
(117 rep)
Oct 13, 2021, 06:41 AM
• Last activity: Oct 31, 2021, 04:08 PM
2
votes
0
answers
456
views
Why setting up new galera cluster with mariabackup as sst starts but all other nodes failed with same error?
I did a fresh reinstallation of mariadb-server on all the nodes (I removed using `sudo apt purge mariadb-*`) I started the first node using sudo galera_new_cluster it went fine and is still running. but other nodes threw this error: ● mariadb.service - MariaDB 10.3.27 database serverLoaded: loaded (...
I did a fresh reinstallation of mariadb-server on all the nodes
(I removed using
sudo apt purge mariadb-*
)
I started the first node using sudo galera_new_cluster it went fine and is still running. but other nodes threw this error:
● mariadb.service - MariaDB 10.3.27 database serverLoaded:
loaded (/lib/systemd/system/mariadb.service;
enabled; vendor preset: enabled)
Active: failed (Result: exit-code) since Sat 2020-12-19 20:23:19 IST; 2min 9s ago
Docs: man:mysqld(8)
Process: 7089
ExecStartPre=/usr/bin/install -m 755 -o mysql -g root -d /var/run/mysqld (code=exited, status=0/SUCCESS)Process: 7090 ExecStartPre=/bin/sh -c systemctl unset-environment _WSREP_START_POSITION (code=exited, status=0/SUCCESS)Process: 7092 ExecStartPre=/bin/sh -c [ ! -e /usr/bin/galera_recovery ] && VAR= || VAR=\cd /usr/bin/..; /usr/bin/galera_recovery\; [ $? -eq 0 ] && s`
Process: 7330 ExecStart=/usr/sbin/mysqld $MYSQLD_OPTS $_WSREP_NEW_CLUSTER $_WSREP_START_POSITION (code=exited, status=1/FAILURE)Main PID: 7330 (code=exited, status=1/FAILURE)
Status: "MariaDB server is down"
Dec 19 20:22:53 phl-pi-3 systemd: Starting MariaDB 10.3.27 database server...
Dec 19 20:22:59 phl-pi-3 sh: WSREP: Recovered position 00000000-0000-0000-0000-000000000000:-1
Dec 19 20:22:59 phl-pi-3 mysqld: 2020-12-19 20:22:59 0 [Note] /usr/sbin/mysqld (mysqld 10.3.27-MariaDB-0+deb10u1-log) starting as process 7330 ...
Dec 19 20:22:59 phl-pi-3 mysqld: 2020-12-19 20:22:59 0 [Warning] Could not increase number of max_open_files to more than 16384 (request: 32186)
Dec 19 20:23:19 phl-pi-3 systemd: mariadb.service: Main process exited, code=exited, status=1/FAILURE
Dec 19 20:23:19 phl-pi-3 systemd: mariadb.service: Failed with result 'exit-code'.
Dec 19 20:23:19 phl-pi-3 systemd: Failed to start MariaDB 10.3.27 database server.
`
this is my galera config:
[mysqld]#mysql settings
binlog_format=ROW
default-storage-engine=innodb
innodb_autoinc_lock_mode=2
innodb_doublewrite=1
query_cache_size=0query_cache_type=0
bind-address=0.0.0.0
#galera settings
wsrep_on=ON
wsrep_provider=/usr/lib/galera/libgalera_smm.so
wsrep_cluster_name="test_cluster"
wsrep_cluster_address=gcomm://192.168.0.15,192.168.0.16,192.168.0.12,10.8.0.6
wsrep_node_address="192.168.0.15"
wsrep_sst_method=mariabackup
wsrep_sst_donor=192.168.0.16
all other nodes have same galera config except different wsrep_address and dont have wsrep_sst_donor set.
And the other server config is as below:
$ cat 50-server.cnf
# These groups are read by MariaDB server.
# Use it for options that only the server (but not clients) should see
## See the examples of server my.cnf files in /usr/share/mysql/
## this is read by the standalone daemon and embedded servers
[server]
skip_name_resolve = 1
# this is only for the mysqld standalone daemon
[mysqld]
transaction_isolation = READ-COMMITTED
binlog_format = ROW
innodb_large_prefix=on
innodb_file_format=barracuda
innodb_file_per_table=1
innodb_io_capacity=4000
# * Basic Settings
user = mysqlpid-file = /var/run/mysqld/mysqld.pid
socket = /var/run/mysqld/mysqld.sock
port = 3306
basedir = /usr
datadir = /var/lib/mysql
tmpdir = /tmp
lc-messages-dir = /usr/share/mysql
skip-external-locking
bind-address = 0.0.0.0
# * Fine Tuning
key_buffer_size = 16
Mmax_allowed_packet = 16M
thread_stack = 192K
thread_cache_size = 8
myisam_recover_options = BACKUP
#max_connections = 100
#table_cache = 64
#thread_concurrency = 10
## * Query Cache Configuration
#query_cache_limit = 1M
query_cache_type = 1
query_cache_limit = 2M
query_cache_min_res_unit = 2k
query_cache_size = 64M
## * Logging and Replication
## Error log - should be very few entries.
#log_error = /var/log/mysql/error.log
server-id = 16
log_bin = mariadb_bin
expire_logs_days = 10
max_binlog_size = 100M
#binlog_do_db = include_database_name
#binlog_ignore_db = exclude_database_name
innodb_buffer_pool_size = 128M
innodb_buffer_pool_instances = 1
innodb_flush_log_at_trx_commit = 2
innodb_log_buffer_size = 32M
innodb_max_dirty_pages_pct = 90
# For generating SSL certificates you can use for example the GUI tool "tinyca".
## ssl-ca=/etc/mysql/cacert.pem
# ssl-cert=/etc/mysql/server-cert.pem
# ssl-key=/etc/mysql/server-key.pem
## Accept only connections using the latest and most secure TLS protocol version.
# ..when MariaDB is compiled with OpenSSL:
# ssl-cipher=TLSv1.2
# ..when MariaDB is compiled with YaSSL (default in Debian):
# ssl=on
## * Character sets## MySQL/MariaDB default is Latin1, but in Debian we rather default to the full
# utf8 4-byte character set. See also client.cnf
#character-set-server = utf8mb4
collation-server = utf8mb4_general_ci
tmp_table_size= 64Mmax_heap_table_size= 64M
slow_query_log = 1
slow_query_log_file = /var/log/mysql/slow.log
long_query_time = 1
all other nodes have same as above except different server-id
Ciasto piekarz
(139 rep)
Dec 19, 2020, 03:25 PM
0
votes
1
answers
243
views
Mariadb 10.1 / Galera - bootstrap node always doesn't lose quorum
I'm using mariadb10.1.37, wsrep_provider_version 25.3.24(r3825). On my dev cluster (2node+1arbitrator) I found the following behaviour: **node1:** bootstrap new cluster (using galera_new_cluster, with pc.weight=3) **node2:** join cluster (pc.weight=3) **arbitrator:** join cluster (pc.weight =1) So,...
I'm using mariadb10.1.37, wsrep_provider_version 25.3.24(r3825). On my dev cluster (2node+1arbitrator) I found the following behaviour:
**node1:** bootstrap new cluster (using galera_new_cluster, with pc.weight=3)
**node2:** join cluster (pc.weight=3)
**arbitrator:** join cluster (pc.weight =1)
So, I have a three node cluster, with total pc.weight of 7. Then I do:
**arbitrator:** shutdown garbd
**node2:** shutdown mysqld
At this point, I expect node1, being only node out of three still alive, should have lost wsrep_cluster_status=Primary status, and no longer accept writes.
Instead, I find wsrep_cluster_status = Primary, wsrep_cluster_size=1, and yet node1 will still accept writes.
Is there something I am missing?
Thanks in advance!
Mark S
(132 rep)
Nov 7, 2018, 03:24 PM
• Last activity: Nov 8, 2018, 03:56 PM
1
votes
1
answers
1253
views
WSREP: Writeset deserialization failed: Unsupported RecordSet version: 2: 71 (Protocol error)
We are running a Mariadb galera cluster with 3 data nodes. Recently one of them crashed and I had to do a SST (State Snapshot Transfer). Nothing I haven't done or seen before. However, after the SST was completed the process crashed on the following WSREP error: [ERROR] WSREP: Writeset deserializati...
We are running a Mariadb galera cluster with 3 data nodes. Recently one of them crashed and I had to do a SST (State Snapshot Transfer). Nothing I haven't done or seen before. However, after the SST was completed the process crashed on the following WSREP error:
[ERROR] WSREP: Writeset deserialization failed: Unsupported RecordSet version: 2: 71 (Protocol error)
at galerautils/src/gu_rset.cpp:header_version():272
at galera/src/trx_handle.cpp:unserialize():268
complete logs:
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] InnoDB: 128 rollback segment(s) are active.
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] InnoDB: Waiting for purge to start
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] InnoDB: Percona XtraDB (http://www.percona.com) 5.6.39-83.1 started; log sequence number 335729536918
mysqld: 2018-10-24 15:03:41 139764732757760 [Note] InnoDB: Dumping buffer pool(s) not yet started
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] Plugin 'FEEDBACK' is disabled.
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] Recovering after a crash using /var/log/mysql/mariadb-bin
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] Starting crash recovery...
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] Crash recovery finished.
mysqld: 2018-10-24 15:03:41 139769088514304 [Note] Server socket created on IP: '0.0.0.0'.
mysqld: 2018-10-24 15:03:42 139769088514304 [Note] WSREP: Signalling provider to continue.
mysqld: 2018-10-24 15:03:42 139769088514304 [Note] WSREP: SST received: 1b859078-cacc-11e8-8e3e-4381b13e7545:4661474
mysqld: 2018-10-24 15:03:42 139769088514304 [Note] Reading of all Master_info entries succeded
mysqld: 2018-10-24 15:03:42 139769088514304 [Note] Added new Master_info '' to hash table
mysqld: 2018-10-24 15:03:42 139769088514304 [Note] /usr/sbin/mysqld: ready for connections.
mysqld: Version: '10.1.36-MariaDB-1~xenial' socket: '/var/run/mysqld/mysqld.sock' port: 3306 mariadb.org binary distribution
mysqld: 2018-10-24 15:03:42 139769088199424 [ERROR] WSREP: Writeset deserialization failed: Unsupported RecordSet version: 2: 71 (Protocol error)
mysqld: at galerautils/src/gu_rset.cpp:header_version():272
mysqld: at galera/src/trx_handle.cpp:unserialize():268
mysqld: WS flags: 0
mysqld: Trx proto: 3
mysqld: Trx source: 00000000-0000-0000-0000-000000000000
mysqld: Trx conn_id: 18446744073709551615
mysqld: Trx trx_id: 18446744073709551615
mysqld: Trx last_seen: -1
mysqld: 2018-10-24 15:03:42 139769088199424 [ERROR] WSREP: Unsupported RecordSet version: 2: 71 (Protocol error)
mysqld: at galerautils/src/gu_rset.cpp:header_version():272
mysqld: at galera/src/trx_handle.cpp:unserialize():268
mysqld: 2018-10-24 15:03:42 139769088199424 [Note] WSREP: applier thread exiting (code:7)
mysqld: 2018-10-24 15:03:42 139769088199424 [ERROR] WSREP: node consistency compromised, aborting
mysqld: 2018-10-24 15:03:42 139769088199424 [Note] WSREP: starting shutdown
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] /usr/sbin/mysqld: Normal shutdown
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: Stop replication
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: Closing send monitor...
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: Closed send monitor.
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: gcomm: terminating thread
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: gcomm: joining thread
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: gcomm: closing backend
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: 3.0 (dbserver08): State transfer from 2.0 (DB) complete.
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Shifting JOINER -> JOINED (TO: 4666631)
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: view(view_id(NON_PRIM,2b30f5eb,167) memb {
mysqld: 7bffa222,0
mysqld: } joined {
mysqld: } left {
mysqld: } partitioned {
mysqld: 2b30f5eb,0
mysqld: 30d0c86b,0
mysqld: 5a7b95e7,0
mysqld: b0f6da74,0
mysqld: f7e81556,0
mysqld: })
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: view((empty))
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: gcomm: closed
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Flow-control interval: [16, 16]
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Trying to continue unpaused monitor
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Received NON-PRIMARY.
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Shifting JOINED -> OPEN (TO: 4666631)
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Received self-leave message.
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Flow-control interval: [0, 0]
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Trying to continue unpaused monitor
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Received SELF-LEAVE. Closing connection.
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 4666631)
mysqld: 2018-10-24 15:03:42 139768797067008 [Note] WSREP: RECV thread exiting 0: Success
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: recv_thread() joined.
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: Closing replication queue.
mysqld: 2018-10-24 15:03:42 139769010707200 [Note] WSREP: Closing slave action queue.
mysqld: 2018-10-24 15:03:44 139769088502528 [Note] WSREP: rollbacker thread exiting
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] Event Scheduler: Purging the queue. 0 events
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: dtor state: JOINING
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: apply mon: entered 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: apply mon: entered 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: mon: entered 1 oooe fraction 0 oool fraction 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: cert index usage at exit 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: cert trx map usage at exit 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: deps set usage at exit 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: avg deps dist 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: avg cert interval 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: cert index size 0
mysqld: 2018-10-24 15:03:44 139768863397632 [Note] WSREP: Service thread queue flushed.
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: wsdb trx map usage 0 conn query map usage 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: MemPool(LocalTrxHandle): hit ratio: 0, misses: 0, in use: 0, in pool: 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Note] WSREP: MemPool(SlaveTrxHandle): hit ratio: 0, misses: 1, in use: 1, in pool: 0
mysqld: 2018-10-24 15:03:44 139769010707200 [Warning] WSREP: Waiting for 5168 items to be fetched.
What does this WSREP error means?
I tried googling this specific error message but nothing came up.
I also checked mariadb versions differences between the nodes as I recently had to reinstall a node. But I couldn't spot any difference between them.
Thomas Wiersema
(111 rep)
Oct 24, 2018, 03:03 PM
• Last activity: Oct 25, 2018, 05:19 AM
0
votes
2
answers
1735
views
wsrep_sst_xtrabackup socat starts and stops right away
What I have done: I have restarted mysql on the JOINERNODE to apply some database settings including increasing the back_log and query_cache_size settings. What I am seeing: When I start mysql on the JOINERNODE, I see socat launch and listen on port 4444 then stop about 1-2 seconds later. Joiner MYS...
What I have done:
I have restarted mysql on the JOINERNODE to apply some database settings including increasing the back_log and query_cache_size settings.
What I am seeing:
When I start mysql on the JOINERNODE, I see socat launch and listen on port 4444 then stop about 1-2 seconds later.
Joiner MYSQL logs:
161107 19:16:43 mysqld_safe Starting mysqld daemon with databases from /var/lib/mysql
161107 19:16:43 mysqld_safe WSREP: Running position recovery with --log_error='/var/lib/mysql/wsrep_recovery.JKtTSU' --pid-file='/var/lib/mysql/JOINERNODE-recover.pid'
161107 19:16:56 mysqld_safe WSREP: Recovered position 3bf24a64-e806-11e5-8238-ea129650fffe:4050608949
161107 19:16:56 [Note] WSREP: wsrep_start_position var submitted: '3bf24a64-e806-11e5-8238-ea129650fffe:4050608949'
161107 19:16:56 [Note] WSREP: Read nil XID from storage engines, skipping position init
161107 19:16:56 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib64/galera/libgalera_smm.so'
161107 19:16:56 [Note] WSREP: wsrep_load(): Galera 25.3.5(rXXXX) by Codership Oy loaded successfully.
161107 19:16:56 [Note] WSREP: CRC-32C: using hardware acceleration.
161107 19:16:56 [Note] WSREP: Found saved state: 00000000-0000-0000-0000-000000000000:-1
161107 19:16:56 [Note] WSREP: Passing config to GCS: base_host = 170.71.77.88; base_port = 4567; cert.log_conflicts = no; debug = no; evs.inactive_check_period = PT0.5S; evs.inactive_timeout = PT15S; evs.join_retrans_period = PT1S; evs.max_install_timeouts = 1; evs.send_window = 4; evs.stats_report_period = PT1M; evs.suspect_timeout = PT5S; evs.user_send_window = 2; evs.view_forget_timeout = PT24H; gcache.dir = /var/lib/mysql/; gcache.keep_pages_size = 0; gcache.mem_size = 0; gcache.name = /var/lib/mysql//galera.cache; gcache.page_size = 1G; gcache.size = 4G; gcs.fc_debug = 0; gcs.fc_factor = 1.0; gcs.fc_limit = 16; gcs.fc_master_slave = no; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = no; gmcast.segment = 0; gmcast.version = 0; pc.announce_timeout = PT3S; pc.checksum = false; pc.ignore_quorum = false; pc.ignore_sb = false; pc.npvo = false; pc.version = 0; pc.wait_prim = true; pc.wait_prim_timeout = P30S; pc.weight = 1; protonet.ba
161107 19:16:56 [Note] WSREP: Service thread queue flushed.
161107 19:16:56 [Note] WSREP: Assign initial position for certification: -1, protocol version: -1
161107 19:16:56 [Note] WSREP: wsrep_sst_grab()
161107 19:16:56 [Note] WSREP: Start replication
161107 19:16:56 [Note] WSREP: Setting initial position to 00000000-0000-0000-0000-000000000000:-1
161107 19:16:56 [Note] WSREP: protonet asio version 0
161107 19:16:56 [Note] WSREP: Using CRC-32C (optimized) for message checksums.
161107 19:16:56 [Note] WSREP: backend: asio
161107 19:16:56 [Note] WSREP: GMCast version 0
161107 19:16:56 [Note] WSREP: (0a872720-a551-11e6-9bc2-eedf51200d4b, 'tcp://0.0.0.0:4567') listening at tcp://0.0.0.0:4567
161107 19:16:56 [Note] WSREP: (0a872720-a551-11e6-9bc2-eedf51200d4b, 'tcp://0.0.0.0:4567') multicast: , ttl: 1
161107 19:16:56 [Note] WSREP: EVS version 0
161107 19:16:56 [Note] WSREP: PC version 0
161107 19:16:56 [Note] WSREP: gcomm: connecting to group 'galera_cluster', peer 'DONORNODE:'
161107 19:16:56 [Note] WSREP: declaring 4159cb50-6323-11e6-946f-f77a9bdd31e2 stable
161107 19:16:56 [Note] WSREP: Node 4159cb50-6323-11e6-946f-f77a9bdd31e2 state prim
161107 19:16:56 [Note] WSREP: view(view_id(PRIM,0a872720-a551-11e6-9bc2-eedf51200d4b,85) memb {
0a872720-a551-11e6-9bc2-eedf51200d4b,0
4159cb50-6323-11e6-946f-f77a9bdd31e2,0
} joined {
} left {
} partitioned {
})
161107 19:16:57 [Note] WSREP: gcomm: connected
161107 19:16:57 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636
161107 19:16:57 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
161107 19:16:57 [Note] WSREP: Opened channel 'galera_cluster'
161107 19:16:57 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 2
161107 19:16:57 [Note] WSREP: Waiting for SST to complete.
161107 19:16:57 [Note] WSREP: STATE_EXCHANGE: sent state UUID: 0ad4f230-a551-11e6-9d07-8244e22ea7fb
161107 19:16:57 [Note] WSREP: STATE EXCHANGE: sent state msg: 0ad4f230-a551-11e6-9d07-8244e22ea7fb
161107 19:16:57 [Note] WSREP: STATE EXCHANGE: got state msg: 0ad4f230-a551-11e6-9d07-8244e22ea7fb from 0 (JOINERNODE)
161107 19:16:57 [Note] WSREP: STATE EXCHANGE: got state msg: 0ad4f230-a551-11e6-9d07-8244e22ea7fb from 1 (DONORNODE)
161107 19:16:57 [Note] WSREP: Quorum results:
version = 3,
component = PRIMARY,
conf_id = 72,
members = 1/2 (joined/total),
act_id = 4053449525,
last_appl. = -1,
protocols = 0/5/2 (gcs/repl/appl),
group UUID = 3bf24a64-e806-11e5-8238-ea129650fffe
161107 19:16:57 [Note] WSREP: Flow-control interval: [23, 23]
161107 19:16:57 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 4053449525)
161107 19:16:57 [Note] WSREP: State transfer required:
Group state: 3bf24a64-e806-11e5-8238-ea129650fffe:4053449525
Local state: 00000000-0000-0000-0000-000000000000:-1
161107 19:16:57 [Note] WSREP: New cluster view: global state: 3bf24a64-e806-11e5-8238-ea129650fffe:4053449525, view# 73: Primary, number of nodes: 2, my index: 0, protocol version 2
161107 19:16:57 [Warning] WSREP: Gap in state sequence. Need state transfer.
161107 19:16:59 [Note] WSREP: Running: 'wsrep_sst_xtrabackup --role 'joiner' --address '170.71.77.88' --auth 'replication:replication' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --parent '18657''
WSREP_SST: [INFO] Streaming with xbstream (20161107 19:16:59.521)
WSREP_SST: [INFO] Using socat as streamer (20161107 19:16:59.524)
WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:4444,reuseaddr stdio | xbstream -x; RC=( ${PIPESTATUS[@]} ) (20161107 19:16:59.663)
161107 19:17:02 [Note] WSREP: Prepared SST request: xtrabackup|170.71.77.88:4444/xtrabackup_sst
161107 19:17:02 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
161107 19:17:02 [Note] WSREP: REPL Protocols: 5 (3, 1)
161107 19:17:02 [Note] WSREP: Service thread queue flushed.
161107 19:17:02 [Note] WSREP: Assign initial position for certification: 4053449525, protocol version: 3
161107 19:17:02 [Note] WSREP: Service thread queue flushed.
161107 19:17:02 [Warning] WSREP: Failed to prepare for incremental state transfer: Local state UUID (00000000-0000-0000-0000-000000000000) does not match group state UUID (3bf24a64-e806-11e5-8238-ea129650fffe): 1 (Operation not permitted)
at galera/src/replicator_str.cpp:prepare_for_IST():447. IST will be unavailable.
161107 19:17:02 [Note] WSREP: Member 0.0 (JOINERNODE) requested state transfer from '*any*'. Selected 1.0 (DONORNODE)(SYNCED) as donor.
161107 19:17:02 [Note] WSREP: Shifting PRIMARY -> JOINER (TO: 4053450570)
161107 19:17:02 [Note] WSREP: Requesting state transfer: success, donor: 1
xbstream: Can't create/write to file './ibdata1' (Errcode: 17 - File exists)
xbstream: failed to create file.
2016/11/07 19:17:38 socat E write(1, 0x84b3e0, 8192): Broken pipe
WSREP_SST: [ERROR] Xbstream failed (20161107 19:17:38.164)
WSREP_SST: [ERROR] Data directory /var/lib/mysql/ may not be empty: lp:1193240 Manual intervention required in that case (20161107 19:17:38.167)
WSREP_SST: [ERROR] Cleanup after exit with status:32 (20161107 19:17:38.169)
WSREP_SST: [INFO] Removing the sst_in_progress file (20161107 19:17:38.172)
161107 19:17:38 [ERROR] WSREP: Process completed with error: wsrep_sst_xtrabackup --role 'joiner' --address '170.71.77.88' --auth 'replication:replication' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --parent '18657': 32 (Broken pipe)
161107 19:17:38 [ERROR] WSREP: Failed to read uuid:seqno from joiner script.
161107 19:17:38 [ERROR] WSREP: SST failed: 32 (Broken pipe)
161107 19:17:38 [ERROR] Aborting
161107 19:17:38 [Warning] WSREP: 1.0 (DONORNODE): State transfer to 0.0 (JOINERNODE) failed: -22 (Invalid argument)
161107 19:17:38 [ERROR] WSREP: gcs/src/gcs_group.c:gcs_group_handle_join_msg():723: Will never receive state. Need to abort.
161107 19:17:38 [Note] WSREP: gcomm: terminating thread
161107 19:17:38 [Note] WSREP: gcomm: joining thread
161107 19:17:38 [Note] WSREP: gcomm: closing backend
161107 19:17:39 [Note] WSREP: view(view_id(NON_PRIM,0a872720-a551-11e6-9bc2-eedf51200d4b,85) memb {
0a872720-a551-11e6-9bc2-eedf51200d4b,0
} joined {
} left {
} partitioned {
4159cb50-6323-11e6-946f-f77a9bdd31e2,0
})
161107 19:17:39 [Note] WSREP: view((empty))
161107 19:17:39 [Note] WSREP: gcomm: closed
161107 19:17:39 [Note] WSREP: /usr/sbin/mysqld: Terminated.
161107 19:17:39 mysqld_safe mysqld from pid file /var/run/mysqld/mysqld.pid ended
Donor Mysql Logs:
161107 19:16:56 [Note] WSREP: declaring 0a872720-a551-11e6-9bc2-eedf51200d4b stable
161107 19:16:56 [Note] WSREP: Node 4159cb50-6323-11e6-946f-f77a9bdd31e2 state prim
161107 19:16:56 [Note] WSREP: view(view_id(PRIM,0a872720-a551-11e6-9bc2-eedf51200d4b,85) memb {
0a872720-a551-11e6-9bc2-eedf51200d4b,0
4159cb50-6323-11e6-946f-f77a9bdd31e2,0
} joined {
} left {
} partitioned {
})
161107 19:16:56 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 1, memb_num = 2
161107 19:16:56 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
161107 19:16:57 [Note] WSREP: STATE EXCHANGE: sent state msg: 0ad4f230-a551-11e6-9d07-8244e22ea7fb
161107 19:16:57 [Note] WSREP: STATE EXCHANGE: got state msg: 0ad4f230-a551-11e6-9d07-8244e22ea7fb from 0 (JOINERNODE)
161107 19:16:57 [Note] WSREP: STATE EXCHANGE: got state msg: 0ad4f230-a551-11e6-9d07-8244e22ea7fb from 1 (DONORNODE)
161107 19:16:57 [Note] WSREP: Quorum results:
version = 3,
component = PRIMARY,
conf_id = 72,
members = 1/2 (joined/total),
act_id = 4053449525,
last_appl. = 4053449492,
protocols = 0/5/2 (gcs/repl/appl),
group UUID = 3bf24a64-e806-11e5-8238-ea129650fffe
161107 19:16:57 [Note] WSREP: Flow-control interval: [23, 23]
161107 19:16:57 [Note] WSREP: New cluster view: global state: 3bf24a64-e806-11e5-8238-ea129650fffe:4053449525, view# 73: Primary, number of nodes: 2, my index: 1, protocol version 2
161107 19:16:57 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
161107 19:16:57 [Note] WSREP: REPL Protocols: 5 (3, 1)
161107 19:16:57 [Note] WSREP: Service thread queue flushed.
161107 19:16:57 [Note] WSREP: Assign initial position for certification: 4053449525, protocol version: 3
161107 19:16:57 [Note] WSREP: Service thread queue flushed.
161107 19:16:57 [Warning] WSREP: Releasing seqno 4053449525 before 4053449526 was assigned.
161107 19:17:02 [Note] WSREP: Member 0.0 (JOINERNODE) requested state transfer from '*any*'. Selected 1.0 (DONORNODE)(SYNCED) as donor.
161107 19:17:02 [Note] WSREP: Shifting SYNCED -> DONOR/DESYNCED (TO: 4053450570)
161107 19:17:02 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
161107 19:17:02 [Note] WSREP: Running: 'wsrep_sst_xtrabackup --role 'donor' --address '170.71.77.88:4444/xtrabackup_sst' --auth 'replication:replication' --socket '/var/lib/mysql/mysql.sock' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --gtid '3bf24a64-e806-11e5-8238-ea129650fffe:4053450570''
161107 19:17:02 [Note] WSREP: sst_donor_thread signaled with 0
WSREP_SST: [INFO] Streaming with xbstream (20161107 19:17:02.257)
WSREP_SST: [INFO] Using socat as streamer (20161107 19:17:02.260)
WSREP_SST: [INFO] Streaming the backup to joiner at 170.71.77.88 4444 (20161107 19:17:02.273)
WSREP_SST: [INFO] Evaluating innobackupex --defaults-file=/etc/my.cnf $INNOEXTRA --galera-info --stream=$sfmt ${TMPDIR} 2>${DATA}/innobackup.backup.log | socat -u stdio TCP:170.71.77.88:4444; RC=( ${PIPESTATUS[@]} ) (20161107 19:17:02.277)
2016/11/07 19:17:38 socat E write(3, 0x689200, 8192): Broken pipe
WSREP_SST: [ERROR] innobackupex finished with error: 1. Check /var/lib/mysql//innobackup.backup.log (20161107 19:17:38.181)
WSREP_SST: [ERROR] Cleanup after exit with status:22 (20161107 19:17:38.185)
161107 19:17:38 [ERROR] WSREP: Failed to read from: wsrep_sst_xtrabackup --role 'donor' --address '170.71.77.88:4444/xtrabackup_sst' --auth 'replication:replication' --socket '/var/lib/mysql/mysql.sock' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --gtid '3bf24a64-e806-11e5-8238-ea129650fffe:4053450570'
161107 19:17:38 [ERROR] WSREP: Process completed with error: wsrep_sst_xtrabackup --role 'donor' --address '170.71.77.88:4444/xtrabackup_sst' --auth 'replication:replication' --socket '/var/lib/mysql/mysql.sock' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --gtid '3bf24a64-e806-11e5-8238-ea129650fffe:4053450570': 22 (Invalid argument)
161107 19:17:38 [ERROR] WSREP: Command did not run: wsrep_sst_xtrabackup --role 'donor' --address '170.71.77.88:4444/xtrabackup_sst' --auth 'replication:replication' --socket '/var/lib/mysql/mysql.sock' --datadir '/var/lib/mysql/' --defaults-file '/etc/my.cnf' --gtid '3bf24a64-e806-11e5-8238-ea129650fffe:4053450570'
161107 19:17:38 [Warning] WSREP: 1.0 (DONORNODE): State transfer to 0.0 (JOINERNODE) failed: -22 (Invalid argument)
161107 19:17:38 [Note] WSREP: Shifting DONOR/DESYNCED -> JOINED (TO: 4053457328)
161107 19:17:39 [Note] WSREP: Member 1.0 (DONORNODE) synced with group.
161107 19:17:39 [Note] WSREP: Shifting JOINED -> SYNCED (TO: 4053457328)
161107 19:17:39 [Note] WSREP: Node 4159cb50-6323-11e6-946f-f77a9bdd31e2 state prim
161107 19:17:39 [Note] WSREP: Synchronized with group, ready for connections
161107 19:17:39 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
161107 19:17:39 [Note] WSREP: view(view_id(PRIM,4159cb50-6323-11e6-946f-f77a9bdd31e2,86) memb {
4159cb50-6323-11e6-946f-f77a9bdd31e2,0
} joined {
} left {
} partitioned {
0a872720-a551-11e6-9bc2-eedf51200d4b,0
})
161107 19:17:39 [Note] WSREP: forgetting 0a872720-a551-11e6-9bc2-eedf51200d4b (tcp://170.71.77.88:4567)
161107 19:17:39 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 0, memb_num = 1
161107 19:17:39 [Note] WSREP: STATE_EXCHANGE: sent state UUID: 23bcece3-a551-11e6-aef5-b26eb62eab01
161107 19:17:39 [Note] WSREP: STATE EXCHANGE: sent state msg: 23bcece3-a551-11e6-aef5-b26eb62eab01
161107 19:17:39 [Note] WSREP: STATE EXCHANGE: got state msg: 23bcece3-a551-11e6-aef5-b26eb62eab01 from 0 (DONORNODE)
161107 19:17:39 [Note] WSREP: Quorum results:
version = 3,
component = PRIMARY,
conf_id = 73,
members = 1/1 (joined/total),
act_id = 4053457328,
last_appl. = 4053457319,
protocols = 0/5/2 (gcs/repl/appl),
group UUID = 3bf24a64-e806-11e5-8238-ea129650fffe
161107 19:17:39 [Note] WSREP: Flow-control interval: [16, 16]
161107 19:17:39 [Note] WSREP: New cluster view: global state: 3bf24a64-e806-11e5-8238-ea129650fffe:4053457328, view# 74: Primary, number of nodes: 1, my index: 0, protocol version 2
161107 19:17:39 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
161107 19:17:39 [Note] WSREP: REPL Protocols: 5 (3, 1)
161107 19:17:39 [Note] WSREP: Service thread queue flushed.
161107 19:17:39 [Note] WSREP: Assign initial position for certification: 4053457328, protocol version: 3
161107 19:17:39 [Note] WSREP: Service thread queue flushed.
161107 19:17:39 [Warning] WSREP: Releasing seqno 4053457328 before 4053457329 was assigned.
161107 19:17:44 [Note] WSREP: cleaning up 0a872720-a551-11e6-9bc2-eedf51200d4b (tcp://170.71.77.88:4567)
innobackup.backup.log from
161107 19:17:02 innobackupex: Starting the backup operation
IMPORTANT: Please check that the backup run completes successfully.
At the end of a successful backup run innobackupex
prints "completed OK!".
161107 19:17:02 version_check Connecting to MySQL server with DSN 'dbi:mysql:;mysql_read_default_group=xtrabackup;port=3306;mysql_socket=/var/lib/mysql/mysql.sock' as 'replication' (using password: YES).
161107 19:17:02 version_check Connected to MySQL server
161107 19:17:02 version_check Executing a version check against the server...
161107 19:17:03 version_check Done.
161107 19:17:03 Connecting to MySQL server host: localhost, user: replication, password: set, port: 3306, socket: /var/lib/mysql/mysql.sock
Using server version 5.5.38-MariaDB-wsrep-log
innobackupex version 2.3.4 based on MySQL server 5.6.24 Linux (x86_64) (revision id: e80c779)
xtrabackup: uses posix_fadvise().
xtrabackup: cd to /var/lib/mysql
xtrabackup: open files limit requested 0, set to 5005
xtrabackup: using the following InnoDB configuration:
xtrabackup: innodb_data_home_dir = ./
xtrabackup: innodb_data_file_path = ibdata1:10M:autoextend
xtrabackup: innodb_log_group_home_dir = ./
xtrabackup: innodb_log_files_in_group = 2
xtrabackup: innodb_log_file_size = 2097152000
xtrabackup: using O_DIRECT
161107 19:17:35 >> log scanned up to (182177566706404)
xtrabackup: Generating a list of tablespaces
161107 19:17:35 Streaming ./ibdata1
161107 19:17:36 >> log scanned up to (182177569423322)
161107 19:17:37 >> log scanned up to (182177571865574)
innobackupex: Error writing file 'UNOPENED' (Errcode: 32 - Broken pipe)
xb_stream_write_data() failed.
innobackupex: Error writing file 'UNOPENED' (Errcode: 32 - Broken pipe)
xtrabackup: Error: xtrabackup_copy_datafile() failed.
xtrabackup: Error: failed to copy datafile.
My.cnf settings:
[client]
port = 3306
socket = /var/lib/mysql/mysql.sock
[mysqld]
back_log = 1000
expire_logs_days = 1
innodb_autoextend_increment = 64
innodb_buffer_pool_instances = 64
innodb_buffer_pool_size = 100G
innodb_log_buffer_size = 128M
innodb_thread_concurrency = 0
thread_cache_size = 512
server-id = 3
port = 3306
binlog_cache_size = 4M
binlog-do-db = zabbix
binlog_format = ROW
binlog-row-event-max-size = 8192
datadir = /var/lib/mysql
innodb_concurrency_tickets = 5000
innodb_flush_log_at_trx_commit = 0
innodb_flush_method = O_DIRECT
innodb_log_file_size = 2000M
innodb_old_blocks_time = 1000
innodb_stats_on_metadata = OFF
ignore-db-dir = lost+found
join_buffer_size = 1M
log-bin = mysql-bin
log-error = /var/log/mysql/mysqld.log
max_allowed_packet = 64M
max_connect_errors = 10000
max_connections = 1000
max_heap_table_size = 256M
net_buffer_length = 8K
pid-file = /var/run/mysqld/mysqld.pid
query_cache_size = 64000000
query_cache_type = 1
read_buffer_size = 1M
relay-log-recovery = 1
relay-log-space-limit = 2G
replicate-do-db = zabbix
replicate-ignore-db = mysql, performance_schema, lost+found
slave-skip-error = 1062
socket = /var/lib/mysql/mysql.sock
sort_buffer_size = 1M
table_open_cache = 4096
tmp_table_size = 1G
wait_timeout = 28800
key_buffer_size = 16M
binlog-format = row
innodb_flush_neighbor_pages = cont
innodb_max_dirty_pages_pct = 30
innodb_io_capacity = 6000
log-slave-updates = true
read_rnd_buffer_size = 16M
relay-log-purge = 1
thread_concurrency = 24
tmpdir = /dev/shm
innodb_file_per_table
skip-slave-start
[sst]
streamfmt=xbstream
[mysqldump]
max_allowed_packet = 64M
quick
[mysql]
no-auto-rehash
[mysqlhotcopy]
interactive-timeout
[mysqld_safe]
pid-file = /var/run/mysqld/mysqld.pid
!includedir /etc/my.cnf.d/
[mysqld]
large-pages
skip-external-locking
[mysqld]
large-pages
skip-external-locking
[mysqld]
wsrep_provider = /usr/lib64/galera/libgalera_smm.so
wsrep_provider_options = gcache.size=4G; gcache.page_size=1G
wsrep_cluster_address = gcomm://cernzbxdb201.cernerasp.com
wsrep_cluster_name = galera_cluster
default_storage_engine = InnoDB
innodb_autoinc_lock_mode = 2
innodb_locks_unsafe_for_binlog = 1
wsrep_sst_method = xtrabackup-v2
wsrep_slave_threads = 64
wsrep_sst_auth =replication:replication
Som3guy
(75 rep)
Nov 8, 2016, 01:42 AM
• Last activity: Mar 9, 2018, 10:11 AM
1
votes
1
answers
8205
views
Should I increase my key_buffer_size?
I have 4 node and database has InnoDB tables. my key_buffer_size is 128M. Should i increase it in my system? My innodb_buffer_pool_size is 75G and innodb_log_buffer_size = 256M. Mem: 96688 92580 4107 0 116 8501 -/+ buffers/cache: 83962 12725 Swap: 10239 5104 5135 MariaDB [mydata]> SHOW STATUS LIKE "...
I have 4 node and database has InnoDB tables. my key_buffer_size is 128M. Should i increase it in my system? My innodb_buffer_pool_size is 75G and innodb_log_buffer_size = 256M.
Mem: 96688 92580 4107 0 116 8501
-/+ buffers/cache: 83962 12725
Swap: 10239 5104 5135
MariaDB [mydata]> SHOW STATUS LIKE "key%";
+------------------------+--------+
| Variable_name | Value |
+------------------------+--------+
| Key_blocks_not_flushed | 0 |
| Key_blocks_unused | 107171 |
| Key_blocks_used | 4 |
| Key_blocks_warm | 0 |
| Key_read_requests | 25 |
| Key_reads | 4 |
| Key_write_requests | 14 |
| Key_writes | 11 |
+------------------------+--------+
8 rows in set (0.00 sec)
I got that error, one of my node closed so I have that question. I use 5.5.46-MariaDB-1~trusty wsrep_25.12.r4f8102 Thanks
RECORD LOCKS space id 513 page no 16 n bits 296 index
GEN_CLUST_INDEX
of table mydata
.user_counter
trx id 65CFC8177 lock_mode X locks rec but not gap
161230 19:08:36 [ERROR] mysqld got signal 6 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
To report this bug, see http://kb.askmonty.org/en/reporting-bugs
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Server version: 5.5.46-MariaDB-1~trusty-wsrep-log
key_buffer_size=134217728
read_buffer_size=2097152
max_used_connections=809
max_threads=2002
thread_count=224
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 12467846 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x0x7feb728c6000
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0x7ff378886a00 thread_stack 0x48000
(my_addr_resolve failure: fork)
/usr/sbin/mysqld(my_print_stacktrace+0x2e) [0x7ff37c2db1ae]
/usr/sbin/mysqld(handle_fatal_signal+0x457) [0x7ff37bebffc7]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x10340) [0x7ff37a90e340]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x39) [0x7ff379f65cc9]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x148) [0x7ff379f690d8]
/usr/sbin/mysqld(+0x2ded4e) [0x7ff37bcafd4e]
/usr/sbin/mysqld(+0x835033) [0x7ff37c206033]
/usr/sbin/mysqld(+0x83b811) [0x7ff37c20c811]
/usr/sbin/mysqld(+0x83c35b) [0x7ff37c20d35b]
/usr/sbin/mysqld(+0x75a2b2) [0x7ff37c12b2b2]
/usr/sbin/mysqld(+0x75e7ad) [0x7ff37c12f7ad]
/usr/sbin/mysqld(+0x729202) [0x7ff37c0fa202]
/usr/sbin/mysqld(Rows_log_event::find_row(Relay_log_info const*)+0x665) [0x7ff37bfa0a45]
/usr/sbin/mysqld(Update_rows_log_event::do_exec_row(Relay_log_info const*)+0x9c) [0x7ff37bfa0e8c]
/usr/sbin/mysqld(Rows_log_event::do_apply_event(Relay_log_info const*)+0x25c) [0x7ff37bf944ac]
/usr/sbin/mysqld(wsrep_apply_cb(void*, void const*, unsigned long, unsigned int, wsrep_trx_meta const*)+0x7ba) [0x7ff37be70afa]
/usr/lib/galera/libgalera_smm.so(galera::TrxHandle::apply(void*, wsrep_cb_status (*)(void*, void const*, unsigned long, unsigned int, wsrep_trx_meta const*), wsrep_trx_meta const&) const+0xd8) [0x7ff377b188f8]
/usr/lib/galera/libgalera_smm.so(+0x1df27d) [0x7ff377b4f27d]
/usr/lib/galera/libgalera_smm.so(galera::ReplicatorSMM::apply_trx(void*, galera::TrxHandle*)+0xd2) [0x7ff377b51b32]
/usr/lib/galera/libgalera_smm.so(galera::ReplicatorSMM::process_trx(void*, galera::TrxHandle*)+0x10e) [0x7ff377b5498e]
/usr/lib/galera/libgalera_smm.so(galera::GcsActionSource::dispatch(void*, gcs_action const&, bool&)+0x1b8) [0x7ff377b33668]
/usr/lib/galera/libgalera_smm.so(galera::GcsActionSource::process(void*, bool&)+0x58) [0x7ff377b33ef8]
/usr/lib/galera/libgalera_smm.so(galera::ReplicatorSMM::async_recv(void*)+0x73) [0x7ff377b54ef3]
/usr/lib/galera/libgalera_smm.so(galera_recv+0x18) [0x7ff377b634e8]
/usr/sbin/mysqld(+0x4a0744) [0x7ff37be71744]
/usr/sbin/mysqld(start_wsrep_THD+0x48e) [0x7ff37bcccc0e]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x8182) [0x7ff37a906182]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7ff37a02947d]
Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (0x0): is an invalid pointer
Connection ID (thread ID): 10
Status: NOT_KILLED
Optimizer switch: index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,index_merge_sort_intersection=off,engine_condition_pushdown=off,index_condition_pushdown=on,derived_merge=on,derived_with_keys=on,firstmatch=on,loosescan=on,materialization=on,in_to_exists=on,semijoin=on,partial_match_rowid_merge=on,partial_match_table_scan=on,subquery_cache=on,mrr=off,mrr_cost_based=off,mrr_sort_keys=off,outer_join_with_cache=on,semijoin_with_cache=on,join_cache_incremental=on,join_cache_hashed=on,join_cache_bka=on,optimize_join_buffer_size=off,table_elimination=on,extended_keys=off
The manual page at http://dev.mysql.com/doc/mysql/en/crashing.html contains
information that should help you find out what is causing the crash.
161230 19:08:40 mysqld_safe Number of processes running now: 0
161230 19:08:40 mysqld_safe WSREP: not restarting wsrep node automatically
161230 19:08:40 mysqld_safe mysqld from pid file /var/run/mysqld/mysqld.pid ended
Melih
(284 rep)
Jan 6, 2017, 08:03 AM
• Last activity: Jan 6, 2017, 11:37 AM
1
votes
0
answers
440
views
Galera log has WSREP: failed to initialize io-cache
I'm having trouble tracking down the cause of an error message that is appearing in my logs every 15 minutes or so. The cause of the error happened whilst performance testing a 3 node Galera Cluster (running on MariaDB 10.1.13). Two of the read-only nodes fell too far behind the write master causing...
I'm having trouble tracking down the cause of an error message that is appearing in my logs every 15 minutes or so.
The cause of the error happened whilst performance testing a 3 node Galera Cluster (running on MariaDB 10.1.13). Two of the read-only nodes fell too far behind the write master causing them to request an SST sync and pulling the cluster off-line whilst the write master fulfilled the sync.
Since that point though the master has regularly been posting the following message into the logs:
2016-08-26 18:03:16 139975496911616 [ERROR] WSREP: failed to initialize io-cache
2016-08-26 18:03:16 139975496911616 [Warning] WSREP: binlog trx cache not empty (0 bytes) @ connection close 6510
2016-08-26 18:03:16 139975496911616 [Warning] WSREP: binlog stmt cache not empty (0 bytes) @ connection close 6510
Google is sparse on any details and the only good hit I've had is from the source code , which appears to suggest that it's unable to initialise a cache for reading or writing (hmmm, much like the error message suggests).
32 if (reinit_io_cache(cache, READ_CACHE, 0, 0, 0))
33 {
34 WSREP_ERROR("failed to initialize io-cache");
35 return ER_ERROR_ON_WRITE;
36 }
This message continues despite a reboot and full SST sync from the other nodes, so I'm a little lost as to the cause. Does anyone know what this means and how to fix it?
Dan
(111 rep)
Aug 26, 2016, 07:22 PM
Showing page 1 of 10 total questions