Forum
Welcome, Guest
Username: Password: Remember me
  • Page:
  • 1
  • 2

TOPIC: SR Slave constantly disconnects

SR Slave constantly disconnects 4 months 3 weeks ago #1907

Hi,
I had installed ha-lizard with ha-iscsi in default configuration on two computers.
Host list:
xenserver1 master
xenserver2 slave

In XenCenter iSCSI virtual disk storarage Status :
xenserver1 Connected
xenserver2 Connected and after a while Unplugged and after a while Connected
[root@xenserver1 ~]# iscsi-cfg status
-----------------------------------------------------------
| iscsi-ha is in manual mode - current status shown below |
-----------------------------------------------------------
Storage role:   Primary [expected primary]
Replication IP: 10.10.10.1/24 10.10.10.3/24 [10.10.10.3 expected here]
iSCSI target:   Running [expected running]

[root@xenserver2 ~]# iscsi-cfg status
-----------------------------------------------------------
| iscsi-ha is in manual mode - current status shown below |
-----------------------------------------------------------
Storage role:   Secondary [expected secondary]
Replication IP: 10.10.10.2/24 [10.10.10.3 not expected here]
iSCSI target:   Stopped [expected stopped]

[root@xenserver2 ~]# cat /proc/drbd
version: 8.4.3 (api:1/proto:86-101)
srcversion: FB3AC7056350AC64629E395
 1: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----
    ns:0 nr:2830008 dw:2830008 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
Here the problem begins:
[root@xenserver2 log]# tgtadm --lld iscsi --op show --mode target
tgtadm: failed to send request hdr to tgt daemon, Transport endpoint is not connected

[root@xenserver2 log]# pvdisplay
File descriptor 4 (/dev/pts/1) leaked on pvdisplay invocation. Parent PID 15065: bash
File descriptor 8 (pipe:[28560344]) leaked on pvdisplay invocation. Parent PID 15065: bash
  --- Physical volume ---
  PV Name               /dev/sdb
  VG Name               VG_XenStorage-0295f4ac-0c07-60dd-fb5f-e5c130d57ab7
  PV Size               1.82 TiB / not usable 13.09 MiB
  Allocatable           yes
  PE Size               4.00 MiB
  Total PE              476929
  Free PE               220426
  Allocated PE          256503
  PV UUID               3CHkUO-pakZ-i4wY-nEF7-7kM3-Sm0y-x1rXpM
Pvdisplay show only additional backup disk.
/etc/iscsi-ha/scripts/replug_pbd, service tgtd restart , rebooting host - did not help.

Help please

Please Log in or Create an account to join the conversation.

SR Slave constantly disconnects 4 months 3 weeks ago #1908

  • Salvatore Costantino
  • Salvatore Costantino's Avatar
  • Offline
  • Posts: 613
The error message when trying "tgtadm --lld iscsi --op show --mode target" on the secondary host is normal since the tdtd daemon only runs on a single host, the primary.

Regarding the intermittent loss of connectivity, some more details are needed and some things need to be ruled out.

Firstly, can you ensure that there are no intermittent network problems preventing xenserver2 from reaching 10.10.10.3?

If networking proves to be OK, an iscsi-ha log that spans the time of lost connectivity would be helpful.

Please Log in or Create an account to join the conversation.

SR Slave constantly disconnects 4 months 3 weeks ago #1909

Thanks for reply.
Network seems ok:
[root@xenserver2 ~]# ping 10.10.10.3
PING 10.10.10.3 (10.10.10.3) 56(84) bytes of data.
64 bytes from 10.10.10.3: icmp_seq=1 ttl=64 time=0.085 ms
64 bytes from 10.10.10.3: icmp_seq=2 ttl=64 time=0.125 ms
64 bytes from 10.10.10.3: icmp_seq=3 ttl=64 time=0.096 ms
64 bytes from 10.10.10.3: icmp_seq=4 ttl=64 time=0.104 ms
64 bytes from 10.10.10.3: icmp_seq=5 ttl=64 time=0.095 ms
64 bytes from 10.10.10.3: icmp_seq=6 ttl=64 time=0.117 ms
64 bytes from 10.10.10.3: icmp_seq=7 ttl=64 time=0.103 ms
64 bytes from 10.10.10.3: icmp_seq=8 ttl=64 time=0.115 ms
64 bytes from 10.10.10.3: icmp_seq=9 ttl=64 time=0.106 ms
64 bytes from 10.10.10.3: icmp_seq=10 ttl=64 time=0.109 ms
64 bytes from 10.10.10.3: icmp_seq=11 ttl=64 time=0.109 ms
64 bytes from 10.10.10.3: icmp_seq=12 ttl=64 time=0.103 ms
64 bytes from 10.10.10.3: icmp_seq=13 ttl=64 time=1.15 ms
64 bytes from 10.10.10.3: icmp_seq=14 ttl=64 time=0.110 ms
64 bytes from 10.10.10.3: icmp_seq=15 ttl=64 time=0.112 ms

[root@xenserver2 ~]# ifconfig
...
xenbr1: flags=4163<UP,BROADCAST,RUNNING,MULTICAST>  mtu 1500
        inet 10.10.10.2  netmask 255.255.255.0  broadcast 10.10.10.255
        ether 84:2b:2b:45:20:f9  txqueuelen 0  (Ethernet)
        RX packets 52142421  bytes 71806928844 (66.8 GiB)
        RX errors 0  dropped 0  overruns 0  frame 0
        TX packets 24551603  bytes 2059235679 (1.9 GiB)
        TX errors 0  dropped 0 overruns 0  carrier 0  collisions 0

But iscsi-ha :
Oct  3 02:02:22 xenserver2 iscsi-ha:  iscsi-ha Watchdog: iscsi-ha running - OK
Oct  3 02:02:23 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Successfully re-plugged PBD: b2b6f8d4-8d39-4fef-750d-69647b9086f9
Oct  3 02:02:26 xenserver2 iscsi-ha: 17856 Spawning new instance of iscsi-ha
Oct  3 02:02:26 xenserver2 iscsi-ha: 17856 check_logger_processes Checking logger processes
Oct  3 02:02:26 xenserver2 iscsi-ha: 17856 check_logger_processes No processes to clear
Oct  3 02:02:26 xenserver2 iscsi-ha:  Normalized ISCSI_TARGET_SERVICE [ tgtd ]
Oct  3 02:02:26 xenserver2 iscsi-ha:  XenServer Major Release = [ 7 ]
Oct  3 02:02:26 xenserver2 iscsi-ha:  system is currently in manual mode - checking manual role
Oct  3 02:02:26 xenserver2 iscsi-ha:  manual role is set to [ become_secondary ]
Oct  3 02:02:26 xenserver2 iscsi-ha:  Mail Spool Directory Found /dev/shm/iscsi-ha-mail
Oct  3 02:02:26 xenserver2 iscsi-ha:  This iteration is count 3501
Oct  3 02:02:26 xenserver2 iscsi-ha:  Checking if this host is a Pool Master or Slave
Oct  3 02:02:26 xenserver2 iscsi-ha:  This host's pool status = slave:172.20.200.21
Oct  3 02:02:26 xenserver2 iscsi-ha:  service_execute: Execute [ status ] on [ iscsi-ha ]
Oct  3 02:02:26 xenserver2 iscsi-ha:  service_execute: System V mode detected
Oct  3 02:02:26 xenserver2 iscsi-ha:  service_execute: [  OK  ]#015iscsi-ha running: 3912
Oct  3 02:02:26 xenserver2 iscsi-ha:  service_execute: Returning exit status [ 0 ]
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 local_ip_list: Local IP list returned 127.0.0.1#012172.20.200.22#01210.10.10.2
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 service_execute: Execute [ status ] on [ tgtd ]
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 service_execute: systemctl mode being used
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 service_execute: ● tgtd.service - tgtd iSCSI target daemon#012   Loaded: loaded (/usr/lib/systemd/system/tgtd.service; disabled; vendor preset: disabled)#012  Drop-In: /etc/systemd/system/tgtd.service.d#012           └─local.conf#012   Active: failed (Result: exit-code) since Wed 2019-10-02 16:16:21 CEST; 9h ago#012  Process: 31097 ExecStop=/usr/sbin/tgtadm --op delete --mode system (code=exited, status=0/SUCCESS)#012  Process: 31092 ExecStop=/usr/sbin/tgt-admin --update ALL -c /dev/null (code=exited, status=0/SUCCESS)#012  Process: 31090 ExecStop=/usr/sbin/tgtadm --op update --mode sys --name State -v offline (code=exited, status=0/SUCCESS)#012  Process: 31052 ExecStartPost=/usr/sbin/tgt-admin -e -c $TGTD_CONFIG (code=exited, status=22)#012  Process: 31050 ExecStartPost=/usr/sbin/tgtadm --op update --mode sys --name State -v offline (code=exited, status=0/SUCCESS)#012  Process: 31012 ExecStartPost=/bin/sleep 5 (code=exited, status=0/SUCCESS)#012  Process: 31011 ExecStart=/usr/sbin/tgtd -f $TGTD_OPTS (code=exited, status=0/SUCCESS)#012 Main PID: 31011 (code=exited, status=0/SUCCESS)
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 service_execute: Returning exit status [ 3 ]
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 iSCSI target: tgtd status stopped. Expected Stopped . [failed (Result: exit-code) since Wed 2019-10-02 16:16:21 CEST; 9h ago]
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 DRBD Running on this host: version: 8.4.3 (api:1/proto:86-101)#012srcversion: FB3AC7056350AC64629E395 #012#012 1: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----#012    ns:0 nr:54109564 dw:54109564 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 validate_drbd_resources_loaded: Checking DRBD has loaded with resources. Checking [ 5 ] > [ 2 ]
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 validate_drbd_resources_loaded: Resources loaded
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 check_drbd_resource_state: DRBD Resource: iscsi1 in Secondary mode
Oct  3 02:02:26 xenserver2 iscsi-ha: 18312 DRBD Resource: iscsi1 in Connected state
Oct  3 02:02:26 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Scanning for Volume Group -> iscsi-sr: 2950cb5d-4db5-2836-a68c-031b41223022
Oct  3 02:02:26 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Volume Group for iSCSI-SR 2950cb5d-4db5-2836-a68c-031b41223022 not found - attemping to re-plug
Oct  3 02:02:26 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: replugging PBD: b2b6f8d4-8d39-4fef-750d-69647b9086f9
Oct  3 02:02:27 xenserver2 iscsi-ha:  iscsi-ha Watchdog: iscsi-ha running - OK
Oct  3 02:02:32 xenserver2 iscsi-ha:  iscsi-ha Watchdog: iscsi-ha running - OK
Oct  3 02:02:33 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Successfully re-plugged PBD: b2b6f8d4-8d39-4fef-750d-69647b9086f9
Oct  3 02:02:36 xenserver2 iscsi-ha: 18237 Spawning new instance of iscsi-ha
Oct  3 02:02:36 xenserver2 iscsi-ha: 18237 check_logger_processes Checking logger processes
Oct  3 02:02:36 xenserver2 iscsi-ha: 18237 check_logger_processes No processes to clear
Oct  3 02:02:36 xenserver2 iscsi-ha:  Normalized ISCSI_TARGET_SERVICE [ tgtd ]
Oct  3 02:02:36 xenserver2 iscsi-ha:  XenServer Major Release = [ 7 ]
Oct  3 02:02:36 xenserver2 iscsi-ha:  system is currently in manual mode - checking manual role
Oct  3 02:02:36 xenserver2 iscsi-ha:  manual role is set to [ become_secondary ]
Oct  3 02:02:36 xenserver2 iscsi-ha:  Mail Spool Directory Found /dev/shm/iscsi-ha-mail
Oct  3 02:02:36 xenserver2 iscsi-ha:  This iteration is count 3502
Oct  3 02:02:36 xenserver2 iscsi-ha:  Checking if this host is a Pool Master or Slave
Oct  3 02:02:36 xenserver2 iscsi-ha:  This host's pool status = slave:172.20.200.21
Oct  3 02:02:36 xenserver2 iscsi-ha:  service_execute: Execute [ status ] on [ iscsi-ha ]
Oct  3 02:02:36 xenserver2 iscsi-ha:  service_execute: System V mode detected
Oct  3 02:02:36 xenserver2 iscsi-ha:  service_execute: [  OK  ]#015iscsi-ha running: 3912
Oct  3 02:02:36 xenserver2 iscsi-ha:  service_execute: Returning exit status [ 0 ]
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 local_ip_list: Local IP list returned 127.0.0.1#012172.20.200.22#01210.10.10.2
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 service_execute: Execute [ status ] on [ tgtd ]
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 service_execute: systemctl mode being used
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 service_execute: ● tgtd.service - tgtd iSCSI target daemon#012   Loaded: loaded (/usr/lib/systemd/system/tgtd.service; disabled; vendor preset: disabled)#012  Drop-In: /etc/systemd/system/tgtd.service.d#012           └─local.conf#012   Active: failed (Result: exit-code) since Wed 2019-10-02 16:16:21 CEST; 9h ago#012  Process: 31097 ExecStop=/usr/sbin/tgtadm --op delete --mode system (code=exited, status=0/SUCCESS)#012  Process: 31092 ExecStop=/usr/sbin/tgt-admin --update ALL -c /dev/null (code=exited, status=0/SUCCESS)#012  Process: 31090 ExecStop=/usr/sbin/tgtadm --op update --mode sys --name State -v offline (code=exited, status=0/SUCCESS)#012  Process: 31052 ExecStartPost=/usr/sbin/tgt-admin -e -c $TGTD_CONFIG (code=exited, status=22)#012  Process: 31050 ExecStartPost=/usr/sbin/tgtadm --op update --mode sys --name State -v offline (code=exited, status=0/SUCCESS)#012  Process: 31012 ExecStartPost=/bin/sleep 5 (code=exited, status=0/SUCCESS)#012  Process: 31011 ExecStart=/usr/sbin/tgtd -f $TGTD_OPTS (code=exited, status=0/SUCCESS)#012 Main PID: 31011 (code=exited, status=0/SUCCESS)
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 service_execute: Returning exit status [ 3 ]
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 iSCSI target: tgtd status stopped. Expected Stopped . [failed (Result: exit-code) since Wed 2019-10-02 16:16:21 CEST; 9h ago]
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 DRBD Running on this host: version: 8.4.3 (api:1/proto:86-101)#012srcversion: FB3AC7056350AC64629E395 #012#012 1: cs:Connected ro:Secondary/Primary ds:UpToDate/UpToDate C r-----#012    ns:0 nr:54112628 dw:54112628 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 validate_drbd_resources_loaded: Checking DRBD has loaded with resources. Checking [ 5 ] > [ 2 ]
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 validate_drbd_resources_loaded: Resources loaded
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 check_drbd_resource_state: DRBD Resource: iscsi1 in Secondary mode
Oct  3 02:02:36 xenserver2 iscsi-ha: 18898 DRBD Resource: iscsi1 in Connected state
Oct  3 02:02:37 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Scanning for Volume Group -> iscsi-sr: 2950cb5d-4db5-2836-a68c-031b41223022
Oct  3 02:02:37 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Volume Group for iSCSI-SR 2950cb5d-4db5-2836-a68c-031b41223022 not found - attemping to re-plug
Oct  3 02:02:37 xenserver2 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: replugging PBD: b2b6f8d4-8d39-4fef-750d-69647b9086f9
Oct  3 02:02:37 xenserver2 iscsi-ha:  iscsi-ha Watchdog: iscsi-ha running - OK
not found VolumeGroup and replug PBD

Please Log in or Create an account to join the conversation.

SR Slave constantly disconnects 4 months 2 weeks ago #1910

I read documents about drbd tgtd and I don't know where to look. Maybe this is a problem:
tail /var/log/daemon.log  
Oct  3 12:36:39 xenserver2 iscsid: Connection17228:0 to [target: iqn.2015.com.halizard:noSAN, portal: 10.10.10.3,3260] through [iface:
 default] is shutdown.
Oct  3 12:36:39 xenserver2 iscsid: Connection17229:0 to [target: iqn.2015.com.halizard:noSAN, portal: 10.10.10.3,3260] through [iface:
 default] is operational now
Oct  3 12:36:49 xenserver2 multipathd: sdc: remove path (uevent)
Oct  3 12:36:49 xenserver2 iscsid: Connection17229:0 to [target: iqn.2015.com.halizard:noSAN, portal: 10.10.10.3,3260] through [iface:
 default] is shutdown.
Oct  3 12:36:49 xenserver2 iscsid: Connection17230:0 to [target: iqn.2015.com.halizard:noSAN, portal: 10.10.10.3,3260] through [iface:
 default] is operational now
Oct  3 12:36:49 xenserver2 multipathd: sdc: add path (uevent)
Oct  3 12:36:49 xenserver2 multipathd: Warning: should_multipath() only based on wwids. dev = sdc
Oct  3 12:36:59 xenserver2 multipathd: sdc: remove path (uevent)
Oct  3 12:36:59 xenserver2 iscsid: Connection17230:0 to [target: iqn.2015.com.halizard:noSAN, portal: 10.10.10.3,3260] through [iface:
 default] is shutdown.
Oct  3 12:36:59 xenserver2 multipathd: sdc: add path (uevent)
Oct  3 12:36:59 xenserver2 multipathd: Warning: should_multipath() only based on wwids. dev = sdc
Oct  3 12:37:00 xenserver2 iscsid: Connection17231:0 to [target: iqn.2015.com.halizard:noSAN, portal: 10.10.10.3,3260] through [iface:
 default] is operational now
But in XenCeter multipath is disabled.

On Master isci-ha write:
Oct  3 13:32:49 xenserver1 iscsi-ha:  iscsi-ha Watchdog: iscsi-ha running - OK
Oct  3 13:32:50 xenserver1 iscsi-ha: 8558 Spawning new instance of iscsi-ha
Oct  3 13:32:50 xenserver1 iscsi-ha: 8558 check_logger_processes Checking logger processes
Oct  3 13:32:50 xenserver1 iscsi-ha: 8558 check_logger_processes No processes to clear
Oct  3 13:32:50 xenserver1 iscsi-ha:  Normalized ISCSI_TARGET_SERVICE [ tgtd ]
Oct  3 13:32:50 xenserver1 iscsi-ha:  XenServer Major Release = [ 7 ]
Oct  3 13:32:50 xenserver1 iscsi-ha:  system is currently in manual mode - checking manual role
Oct  3 13:32:50 xenserver1 iscsi-ha:  manual role is set to [ become_primary ]
Oct  3 13:32:50 xenserver1 iscsi-ha:  Mail Spool Directory Found /dev/shm/iscsi-ha-mail
Oct  3 13:32:50 xenserver1 iscsi-ha:  This iteration is count 6899
Oct  3 13:32:50 xenserver1 iscsi-ha:  Checking if this host is a Pool Master or Slave
Oct  3 13:32:50 xenserver1 iscsi-ha:  This host's pool status = master
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: Execute [ status ] on [ iscsi-ha ]
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: System V mode detected
Oct  3 13:32:50 xenserver1 iscsi-ha:  auto_plug_pbd: Found LVMoISCSI SR List: 2950cb5d-4db5-2836-a68c-031b41223022
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: [  OK  ]#015iscsi-ha running: 2831
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: Returning exit status [ 0 ]
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 DRBD Running on this host: version: 8.4.3 (api:1/proto:86-101) srcversion: FB3AC7056350AC64629E395 1: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r----- ns:120111940 nr:0 dw:212152032 dr:97670596 al:41522 bm:4545 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 check_drbd_resource_state: DRBD Resource: iscsi1 in Primary mode
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 DRBD Resource: iscsi1 in Connected state
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: Execute [ status ] on [ tgtd ]
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: systemctl mode being used
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: ● tgtd.service - tgtd iSCSI target daemon#012   Loaded: loaded (/usr/lib/systemd/system/tgtd.service; disabled; vendor preset: disabled)#012  Drop-In: /etc/systemd/system/tgtd.service.d#012           └─local.conf#012   Active: active (running) since Tue 2019-10-01 12:09:36 CEST; 2 days ago#012  Process: 5848 ExecStartPost=/usr/sbin/tgtadm --op update --mode sys --name State -v ready (code=exited, status=0/SUCCESS)#012  Process: 5694 ExecStartPost=/usr/sbin/tgt-admin -e -c $TGTD_CONFIG (code=exited, status=0/SUCCESS)#012  Process: 5689 ExecStartPost=/usr/sbin/tgtadm --op update --mode sys --name State -v offline (code=exited, status=0/SUCCESS)#012  Process: 5348 ExecStartPost=/bin/sleep 5 (code=exited, status=0/SUCCESS)#012 Main PID: 5347 (tgtd)#012   CGroup: /system.slice/tgtd.service#012           └─5347 /usr/sbin/tgtd -f
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 service_execute: Returning exit status [ 0 ]
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 iSCSI target: tgtd status = OK. [ active (running) since Tue 2019-10-01 12:09:36 CEST; 2 days ago ]
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 local_ip_list: Local IP list returned 127.0.0.1#01210.10.10.1#01210.10.10.3#012172.20.200.21
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 CHECKING IP 127.0.0.1
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 CHECKING IP 10.10.10.1
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 CHECKING IP 10.10.10.3
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 Virtual IP: 10.10.10.3 discovered on host xenserver1
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 send_replication_network_arp: Sending ARP update to peer
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 send_replication_network_arp: IP address list for [ xenbr1 ] = [ 10.10.10.1#01210.10.10.3 ]
Oct  3 13:32:50 xenserver1 iscsi-ha: 9145 send_replication_network_arp: Updating ARP for device [ xenbr1 ] IP [ 10.10.10.1 ]
Oct  3 13:32:50 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Scanning for Volume Group -> iscsi-sr: 2950cb5d-4db5-2836-a68c-031b41223022
Oct  3 13:32:50 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Volume Group for iSCSI-SR found OK: 2950cb5d-4db5-2836-a68c-031b41223022
Oct  3 13:32:51 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: ARPING 10.10.10.1 from 10.10.10.1 xenbr1
Oct  3 13:32:51 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Sent 2 probes (2 broadcast(s))
Oct  3 13:32:51 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Received 0 response(s)
Oct  3 13:32:51 xenserver1 iscsi-ha: 9145 send_replication_network_arp: Updating ARP for device [ xenbr1 ] IP [ 10.10.10.3 ]
Oct  3 13:32:52 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: ARPING 10.10.10.3 from 10.10.10.3 xenbr1
Oct  3 13:32:52 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Sent 2 probes (2 broadcast(s))
Oct  3 13:32:52 xenserver1 iscsi-ha-NOTICE-/etc/iscsi-ha/init/iscsi-ha.mon: Received 0 response(s)
Oct  3 13:32:54 xenserver1 iscsi-ha:  iscsi-ha Watchdog: iscsi-ha running - OK
[root@xenserver1 log]#

Please Log in or Create an account to join the conversation.

Last edit: by Alfred.

SR Slave constantly disconnects 4 months 2 weeks ago #1911

  • Salvatore Costantino
  • Salvatore Costantino's Avatar
  • Offline
  • Posts: 613
There are 2 possibilities in my opinion.
1) a race conditin caused by iscsi-ha which interferes with server's own attempts to replug the sr

2) a problem in xensrver outside of iscsi-ha

A simple test that would narrow things down would be to disable iscsi-ha on the secondary host. This would prevent the replug attempt. If the problem continues, there is something in server or the underlying host. If it stops, then we need to look at the replug_pbd logic in iscsi-ha

Try this on the secondary host:

service iscsi-ha-watchdog stop && service iscsi-ha stop

Let me know if the problem persists after stopping the services.

Please Log in or Create an account to join the conversation.

SR Slave constantly disconnects 4 months 2 weeks ago #1912

thanks, problem disappear ! But , why ?
vgscan:
[root@xenserver2 /]# vgscan
File descriptor 4 (/dev/pts/2) leaked on vgscan invocation. Parent PID 27481: bash
File descriptor 8 (pipe:[50749070]) leaked on vgscan invocation. Parent PID 27481: bash
  Reading all physical volumes.  This may take a while...
  Found volume group "VG_XenStorage-0295f4ac-0c07-60dd-fb5f-e5c130d57ab7" using metadata type lvm2
[root@xenserver2 /]#
does not show volume group from xe sr-list:
[root@xenserver2 /]# xe sr-list type=lvmoiscsi --minimal
2950cb5d-4db5-2836-a68c-031b41223022
[root@xenserver2 /]#

Please Log in or Create an account to join the conversation.

Last edit: by Alfred.
  • Page:
  • 1
  • 2