Skip to content
  • Andrei Elkin's avatar
    9ad3b3f9
    Bug#14258884 RPL.RPL_HEARTBEAT_BASIC FAILS SPORADICALLY ON PB2 · 9ad3b3f9
    Andrei Elkin authored
    Bug#13627066 RPL.RPL_DEADLOCK_INNODB FAILS ON PB2 SPRADICALLY
    
    Sporadic and long time standing mismatch at the test run
     
      include/start_slave.inc
      SET @@global.event_scheduler=1;
      -Number of received heartbeat events: 0
      +Number of received heartbeat events: 1
    
    was caused by a incorrect assumption the no hearbeat event should
    be sent in the context of that test's snippet.
    In fact, there is no guarantee that empty binlog status won't last
    over the hearbeat period. Even though period of scheduled by the server scheduler
    UPDATE queries is 1/5 th of the heartbeat period, the actual time in between of two
    successive bin-logging actions can last as long as the HB period. That's what PB run
    proves in practice.
    
    Fixed with removing ineffecient piece of the test.
    
    Bug#13627066 RPL.RPL_DEADLOCK_INNODB FAILS ON PB2 SPRADICALLY
    
    A possible reason of SQL thread to fail to increment slave_transaction_retries
    status is a failure to start the slave threads that went unnoticible thanks to unblocking
    style of the slave start. 
    combined with also small of innodb_lock_wait_timeout. That could allow a race of 
    the SQL thread retrying after timeout and the mtr user thread counting through polling in a interval.
    
    Attempted to fix with correcting the start slave.
    A separate failure in this test that radomly happens on PB near
    
     source include/wait_for_slave_sql_error.inc;
    
    is addressed with adding a debug print-outs via rpl_debug=1.
    Extra info will be necessary to actually tackle this issue (to be repoted once the extra info will
    be available in PB saved test logs).
    9ad3b3f9
    Bug#14258884 RPL.RPL_HEARTBEAT_BASIC FAILS SPORADICALLY ON PB2
    Andrei Elkin authored
    Bug#13627066 RPL.RPL_DEADLOCK_INNODB FAILS ON PB2 SPRADICALLY
    
    Sporadic and long time standing mismatch at the test run
     
      include/start_slave.inc
      SET @@global.event_scheduler=1;
      -Number of received heartbeat events: 0
      +Number of received heartbeat events: 1
    
    was caused by a incorrect assumption the no hearbeat event should
    be sent in the context of that test's snippet.
    In fact, there is no guarantee that empty binlog status won't last
    over the hearbeat period. Even though period of scheduled by the server scheduler
    UPDATE queries is 1/5 th of the heartbeat period, the actual time in between of two
    successive bin-logging actions can last as long as the HB period. That's what PB run
    proves in practice.
    
    Fixed with removing ineffecient piece of the test.
    
    Bug#13627066 RPL.RPL_DEADLOCK_INNODB FAILS ON PB2 SPRADICALLY
    
    A possible reason of SQL thread to fail to increment slave_transaction_retries
    status is a failure to start the slave threads that went unnoticible thanks to unblocking
    style of the slave start. 
    combined with also small of innodb_lock_wait_timeout. That could allow a race of 
    the SQL thread retrying after timeout and the mtr user thread counting through polling in a interval.
    
    Attempted to fix with correcting the start slave.
    A separate failure in this test that radomly happens on PB near
    
     source include/wait_for_slave_sql_error.inc;
    
    is addressed with adding a debug print-outs via rpl_debug=1.
    Extra info will be necessary to actually tackle this issue (to be repoted once the extra info will
    be available in PB saved test logs).
Loading