Difference between revisions of "SAC:Admin and Troubleshooting"

From OSGeo
Jump to navigation Jump to search
(Emergency plans moved here)
(Add Administrative Access heading and link.)
(11 intermediate revisions by 5 users not shown)
Line 1: Line 1:
 +
= Administrative Access =
 +
[[SAC#Members|SAC Members]] may have [[SAC:Administrative Access|administrative access]] to one or more services, depending on their responsibilities.
 +
 
= Troubleshooting =  
 
= Troubleshooting =  
  
== NFS Mount Missing ==
+
* [[SAC:Primary Administrators]] (with contact info)
 +
* Discuss issues in irc (#telascience)
  
For some reason, the /home mount on the various servers is often lost.
+
== VM hanging on OSUOSL ==
  
To fix...
+
see [[OSL]] for how to open a ticket with OSUOSL's support
  
 
== LDAP Server Down ==
 
== LDAP Server Down ==
  
The LDAP server runs on .220, and if it needs to be restarted it can be done as root with the command:
+
The LDAP server runs on ldap.osgeo.org ([[SAC_Service_Status#Secure|secure vm]]).
 
+
If it's down, refer to [[SAC:LDAP Restarting LDAP server]]
/opt/fedora/slapd-ldapt/start-slapd
 
 
 
== Peer1 LDAP Server Hanging ==
 
 
 
If there is a power outage like there was on 2-20-07, slapd's database will need to be recovered.
 
  
 
  sudo /usr/sbin/slapd_db_recover -h /var/lib/ldap/osgeo2
 
  sudo /usr/sbin/slapd_db_recover -h /var/lib/ldap/osgeo2
Line 23: Line 22:
 
  REPAIR TABLE cache QUICK;
 
  REPAIR TABLE cache QUICK;
  
== Entire www.osgeo.org down ==
 
 
* ISP/DNS problem: what to do? do we need to call anyone?
 
* hardware reset: Shawn Barnes (+1 613.565.5056 - Ottawa business hours), Howard Butler, Tyler Mitchell, Frank Warmderdam (+1 613.754.2041 - anytime). One option is a power cycle on the UPS to restart osgeo.org, using the "Reboot Immediate" item on the UPS.
 
 
TODO: Define rescue plan with responsible people
 
 
=== Contact User with Shell Access ===
 
If services or o/s need restarting or something else needs emergency attention contact one of the following people with shell access directly:
 
* Tyler Mitchell - +1-250-277-1621 - tmitchell at osgeo.org - timezone GMT-7
 
* who else...?
 
  
=== PEER 1 Trouble Ticket Process ===
 
TODO: add details here or point to elsewhere?
 
  
 
[[Category: Infrastructure]]
 
[[Category: Infrastructure]]

Revision as of 15:32, 19 February 2018

Administrative Access

SAC Members may have administrative access to one or more services, depending on their responsibilities.

Troubleshooting

VM hanging on OSUOSL

see OSL for how to open a ticket with OSUOSL's support

LDAP Server Down

The LDAP server runs on ldap.osgeo.org (secure vm). If it's down, refer to SAC:LDAP Restarting LDAP server

sudo /usr/sbin/slapd_db_recover -h /var/lib/ldap/osgeo2

MySQL Cleanup for Drupal

If a report in drupal starts saying a table is crashed and needs repair, log into mysql and run the following, for example for the cache table:

REPAIR TABLE cache QUICK;