Colossus Guardian Failover Procedure

Now that c has become a file server this procedure is rather complicated. This hasnt actually been tried, but these are the basic steps that should be followed:

If guardian dies we recreate it on colossus. Power down both computers, take the bottom two internal harddrives (the boot drives) out of c and store them safely. Then remove all the drives from g and replace them in the corresponding slots in c. Then boot. At this point different things could/should be done.

At this point g is back up but c, and all the files served on it are down. Here is what Id do

  1. The fibre-channel drivers will not be installed, so that must be done first. The current driver is /home/data197/jwilson/sun/sol8/JNIC.pkg, so cd to /home/data197/jwilson/sun/sol8 and do a: pkgadd d JNIC.pkg, then a reboot -- -r
  2. When the machine comes back up, run format(1M) and see all the FC drives are there, they should look like this:
  3. 2. c3t0d0 <SEAGATE-ST173404FCV-4303 cyl 14087 alt 2 hd 24 sec 424>
    /pci@1f,2000/fibre-channel@1/sd@0,0
    3. c3t1d0 <SEAGATE-ST173404FCV-4303 cyl 14087 alt 2 hd 24 sec 424>
    /pci@1f,2000/fibre-channel@1/sd@1,0
    4. c3t2d0 <SEAGATE-ST173404FCV-4303 cyl 14087 alt 2 hd 24 sec 424>

     

  4. I keep copies of /etc/vfstab in /home/jwilson/config/(hostname) so get /home/jwilson/config/colossus/vfstab. It may need modification because the controller numbers have changed (i.e. the c3 in c3t0t0s6 may now be c5t0d0s6) so do that, recreate the directories in /export, mount the file systems
  5. The last things to do are to export the new file systems (there is also a copy /etc/dfs/dfstab in /home/jwilson/config/colossus), and to bring up the virtual interface with cs IP address. The interface should already be setup, do ifconfig a and you should see:
lo0: flags=849<UP,LOOPBACK,RUNNING,MULTICAST> mtu 8232
inet 127.0.0.1 netmask ff000000
ge0: flags=863<UP,BROADCAST,NOTRAILERS,RUNNING,MULTICAST> mtu 1500
inet 128.111.101.250 netmask ffffff00 broadcast 128.111.101.255
ether 8:0:20:9f:2a:8e
ge0:1: flags=842<BROADCAST,RUNNING,MULTICAST> mtu 1500
inet 128.111.101.249 netmask ffffff00 broadcast 128.111.101.255
 
at which point you can do an ifconfig ge0:1 up. Also, in /etc/rc3.d there is a fo_S99ifcon that could be renamed to S99ifcon to automatically bring up the interface at boot time.
 
If colossus dies the main thing will be to move the Fibre Channel disks to g. This basically will involve moving the FC cards from c to g then following steps 1-4 above, of course ge0:1 has the IP address 128.111.101.250 on colossus. There is also a spare FC card on the third shelf next to my desk, the self with all the TrUX documentation.
As always see me with any questions or comments  James
Last updated 7/3/01