Difference between revisions of "SAC:Admin and Troubleshooting"

From OSGeo
Jump to navigation Jump to search
(Emergency plans moved here)
(Contacts in SAC:Primary Administrators)
Line 1: Line 1:
 
= Troubleshooting =  
 
= Troubleshooting =  
 +
 +
[[SAC:Primary Administrators]]
  
 
== NFS Mount Missing ==  
 
== NFS Mount Missing ==  
Line 26: Line 28:
  
 
* ISP/DNS problem: what to do? do we need to call anyone?
 
* ISP/DNS problem: what to do? do we need to call anyone?
* hardware reset: Shawn Barnes (+1 613.565.5056 - Ottawa business hours), Howard Butler, Tyler Mitchell, Frank Warmderdam (+1 613.754.2041 - anytime). One option is a power cycle on the UPS to restart osgeo.org, using the "Reboot Immediate" item on the UPS.
+
* hardware reset: Contact [[SAC:Primary Administrators]]. One option is a power cycle on the UPS to restart osgeo.org, using the "Reboot Immediate" item on the UPS.
  
 
TODO: Define rescue plan with responsible people
 
TODO: Define rescue plan with responsible people
Line 32: Line 34:
 
=== Contact User with Shell Access ===
 
=== Contact User with Shell Access ===
 
If services or o/s need restarting or something else needs emergency attention contact one of the following people with shell access directly:
 
If services or o/s need restarting or something else needs emergency attention contact one of the following people with shell access directly:
* Tyler Mitchell - +1-250-277-1621 - tmitchell at osgeo.org - timezone GMT-7
+
* Contact [[SAC:Primary Administrators]]
* who else...?
 
  
 
=== PEER 1 Trouble Ticket Process ===
 
=== PEER 1 Trouble Ticket Process ===
 
TODO: add details here or point to elsewhere?
 
TODO: add details here or point to elsewhere?
 +
* Contact [[SAC:Primary Administrators]]
  
 
[[Category: Infrastructure]]
 
[[Category: Infrastructure]]

Revision as of 02:35, 6 September 2007

Troubleshooting

SAC:Primary Administrators

NFS Mount Missing

For some reason, the /home mount on the various servers is often lost.

To fix...

LDAP Server Down

The LDAP server runs on .220, and if it needs to be restarted it can be done as root with the command:

/opt/fedora/slapd-ldapt/start-slapd

Peer1 LDAP Server Hanging

If there is a power outage like there was on 2-20-07, slapd's database will need to be recovered.

sudo /usr/sbin/slapd_db_recover -h /var/lib/ldap/osgeo2

MySQL Cleanup for Drupal

If a report in drupal starts saying a table is crashed and needs repair, log into mysql and run the following, for example for the cache table:

REPAIR TABLE cache QUICK;

Entire www.osgeo.org down

  • ISP/DNS problem: what to do? do we need to call anyone?
  • hardware reset: Contact SAC:Primary Administrators. One option is a power cycle on the UPS to restart osgeo.org, using the "Reboot Immediate" item on the UPS.

TODO: Define rescue plan with responsible people

Contact User with Shell Access

If services or o/s need restarting or something else needs emergency attention contact one of the following people with shell access directly:

PEER 1 Trouble Ticket Process

TODO: add details here or point to elsewhere?