Emergency Maintenance
Emergency Maintenance: VPN server to be Reboot
Problem: VPN server to be Reboot Cause: Attempt to Fix VPN session instability Issue Affects: Case VPN service Started: 07/16/2008 12:00 PM Resolved: 07/16/2008 12:10 PM
Notes:
7/16 15:30 Engineer switched VPN server hardware.
7/16 12:18 VPN server has been power cycled. VPN service has been restored. Engineer will continue monitor the VPN service stability issue.
VPN server will be power cycled in attempt to correct the VPN instability issue.
Created: 07/16/2008 09:59:29 by wxc16
Updates: 07/16/2008 12:18:15 by wxc16, 07/16/2008 15:43:48 by wxc16
Problem Report
Problem Report: HIgh temp alarm in KSL data center
Problem: HIgh temp alarm in KSL data center Cause: TBD Affects: nobody...yet Started: 07/17/2008 10:37 AM Resolved:
Notes:
Plant services has been contacted
Created: 07/17/2008 11:50:37 by jan3
Updates:
Problem Report: Norton SER 1 is overheating
Problem: Norton SER 1 is overheating Cause: Cooling problem in Norton SER Affects: Currently module 7 of network switch Started: 07/16/2008 06:26 PM Resolved:
Notes:
2008 Jul 16 18:26:37 EDT -04:00 %SYS-2-MOD_TEMPMINORFAIL:Module 7 minor temperature threshold exceeded
2008 Jul 16 18:26:37 EDT -04:00 %SNMP-5-ENTITYMODTRAP:Module 7 status changed to "failed(7)"
Created: 07/17/2008 11:15:28 by roo
Updates:
Problem Report: Bingham hub overheating
Problem: Bingham hub overheating Cause: Cooling problem in Bingham Hub Affects: See note Started: 07/17/2008 12:00 AM Resolved:
Notes:
[2008 July 18th Thursday 06:00 AM]
The A/C Unit wasn't running right again this morning,
so therefore, first completely shut-down and shut-off
its entire electrical circuit, second allowed the whole
circuit to rest for a good while, while openning
the door to the Bingham Hub in the mean-while, to allow
the building A/C to cool and dry the room, from
the outside of the room, third turn the whole circuit
back on, watching it for a little while, to make sure
that it at least stays on, up and running, for a short
while, but with the outside humidity already more than
fifty percent now, and the out side temperature predicted
to rise up to the nineties again today, just like
yesterday, which would to keep a watch on the room's
environment conditions, throughout the rest of the day,
today.
[2008 July 17th Thursday 09:00 AM]
The Circuit Breaker for the A/C Unit wasn't tripped, but
the A/C Unit needed to be restarted again this morning.
[2008 July 17th Thursday 12:00 AM]
Affects 2 modules on Hub 1 and 1 module on hub 2
bingham-h0-e1
1 0009.11f7.e830 to 0009.11f7.e83f 1.0 7.2(1) 8.5(0.46)RFW MinFail
9 Distributed Forwarding Card WS-F6700-DFC3A SAD074805CH 1.0 MinFail
bingham-h0-e2
9 Distributed Forwarding Card WS-F6700-DFC3A SAD074805N0 1.0 MinFail
Created: 07/17/2008 05:43:34 by roo
Updates: 07/17/2008 09:11:04 by euw, 07/18/2008 07:30:24 by euw
Problem Report: Stone Hub is overheating
Problem: Stone Hub is overheating Cause: no cooling and drying Affects: Switch Module Five Started: 07/16/2008 05:29 AM Resolved:
Notes:
[Wed Jul 16 05:29:43 2008]
stone-h0-e1#show environment temperature
VTT 1 outlet temperature: 34C
VTT 2 outlet temperature: 39C
VTT 3 outlet temperature: 47C
module 1 outlet temperature: 49C
module 1 inlet temperature: 33C
module 5 outlet temperature: 42C
module 5 inlet temperature: 35C
module 5 device-1 temperature: 47C
module 5 device-2 temperature: 47C
module 5 asic-1 (SSO-1) temp: 36C
module 5 asic-2 (SSO-2) temp: 35C
module 5 asic-3 (SSO-3) temp: 35C
module 5 asic-4 (SSO-4) temp: 35C
module 5 asic-5 (SSA-1) temp: 35C
module 5 asic-6 (HYPERION-1) temp: 36C
module 5 RP outlet temperature: 41C
module 5 RP inlet temperature: 42C
module 5 EARL outlet temperature: 48C
module 5 EARL inlet temperature: 31C
stone-h0-e1#
Created: 07/16/2008 08:04:34 by euw
Updates:
Problem Report: stone-h0-e1-lpbk1 - conn
Problem: stone-h0-e1-lpbk1 - conn Cause: unknown Affects: all of the Stone Commons Cisco Area Started: 07/16/2008 05:29 AM Resolved: 07/16/2008 07:44 AM
Notes:
[Wed Jul 16 07:44:42 2008]
Remedied.
[Wed Jul 16 05:29:43 2008]
A Network Technician is on his way to investigate right now.
Created: 07/16/2008 07:16:36 by euw
Updates: 07/16/2008 07:52:16 by euw
Problem Report: Cutler SER overheating
Problem: Cutler SER overheating Cause: no cooling Affects: Switch module 7 Started: 07/15/2008 05:50 PM Resolved:
Notes:
2008 Jul 15 17:48:20 EDT -04:00 %SNMP-5-ENVMONTEMPTRAP:Environmental Monitor Temperature Trap: Module 7 Intake state: warning
2008 Jul 15 17:48:20 EDT -04:00 %SNMP-5-ENVMONTEMPTRAP:Environmental Monitor Temperature Trap: Module 7 Intake state: warning
7 7 96 10/100BaseTX Ethernet WS-X6148X2-RJ-45 yes temp-minor
Created: 07/15/2008 17:52:32 by roo
Updates:
Problem Report: Docshare unavailable
Problem: Docshare unavailable Cause: unknown - the systems appears to have hung Affects: Docshare users Started: 07/15/2008 02:45 AM Resolved: 07/15/2008 08:41 AM
Notes:
[07/15/08 08:40 AM] - The system had lost its virtual connection to disk and needed to be rebooted.
Server engineering is working on the problem. We will post updates as available.
Created: 07/15/2008 07:46:43 by dak
Updates: 07/15/2008 08:41:12 by dak
Problem Report: VPN connectivity issues
Problem: VPN connectivity issues Cause: Unknown Affects: At least users using VPN from the wireless network on campus; possibly others Started: 07/14/2008 10:00 AM Resolved:
Notes:
We are receiving reports of connectivity issues with VPN -- both initially and problems using the Internet after getting connected. Network Engineering is investigating.
Created: 07/14/2008 11:07:25 by cpr
Updates: 07/14/2008 11:16:20 by cpr
Problem Report: VPN Connectivity Lost (7/11 11:00pm)
Problem: VPN Connectivity Lost (7/11 11:00pm) Cause: Unknown Affects: Case VPN Services Started: 07/11/2008 11:00 PM Resolved: 07/11/2008 11:15 PM
Notes:
Engineer rebooted the VPN server. VPN Connection looks more stable now at the point. Engineer will continue to monitor VPN service status.
Engineer was notified of problem around 10:50pm. Engineer is investigating the issue right now.
Created: 07/11/2008 23:07:00 by wxc16
Updates: 07/11/2008 23:12:13 by wxc16
Problem Report: VPN Clients lost connectivity
Problem: VPN Clients lost connectivity Cause: Bad network cable found Affects: Case VPN service Started: 07/10/2008 02:30 PM Resolved: 07/10/2008 02:55 PM
Notes:
Bad cable replaced. Service restored.
VPN users experience lost of connectivity to on and off campus due to a bad network cable. Engineer is currently replacing the bad cable. Services should be restore in 10 minutes.
Created: 07/10/2008 14:55:50 by wxc16
Updates:
Problem Report: VPN Clients lost connectivity
Problem: VPN Clients lost connectivity Cause: Bad network cable found Affects: Case VPN service Started: 07/10/2008 02:30 PM Resolved: 07/10/2008 02:55 PM
Notes:
Bad cable replaced. Service restored
VPN users experience lost of connectivity to on and off campus due to a bad network cable. Engineer is currently replacing the bad cable. Services should be restore in 10 minutes.
Created: 07/10/2008 14:44:18 by wxc16
Updates: 07/10/2008 14:58:03 by wxc16
Problem Report: Yahoo.com issues
Problem: Yahoo.com issues Cause: security feed included ip space that broke some functionality on site Affects: yahoo.com site Started: 07/10/2008 11:23 AM Resolved: 07/10/2008 11:23 AM
Notes:
security feed included ip space that broke some functionality on site
Created: 07/10/2008 11:25:59 by lxc152
Updates:
Problem Report: Legato Networker backup system is down
Problem: Legato Networker backup system is down Cause: corrupted configuration file Affects: All Networker backup clients Started: 07/09/2008 12:01 PM Resolved: 07/10/2008 01:50 AM
Notes:
Service was restored last night. We expect minor issues over the next couple days as we clear the backlog of disk-to-tape migration & get all of the regularly scheduled backup jobs running again, but operations are essentially back to normal.
During a reconfiguration of the backup system (to make it more resilient to externally-caused failures like this weekend's power & A/C issue), a critical configuration file became corrupted such that backup system was no longer runnable at all.
The most critical issue has been resolved, but the system will not be usable until we complete a large amount of reconfiguration work. We expect to have the system functional again by noon tomorrow (Thursday 7/10) for restores, and regularly scheduled backups should begin running Thursday night/Friday morning.
Created: 07/09/2008 22:17:16 by jan3
Updates: 07/10/2008 13:50:12 by jan3
Problem Report: Clients using VPN can't get to sites on and off campus
Problem: Clients using VPN can't get to sites on and off campus Cause: Unknown Affects: Case VPN Users Started: 07/09/2008 08:38 PM Resolved: 07/09/2008 08:48 PM
Notes:
VPN clients can't get to resource on or off campus. Engineers are looking into the problem.
Problem was determined to be bad network cable.
Created: 07/09/2008 20:41:57 by dnd
Updates: 07/09/2008 20:58:19 by dnd
Problem Report: IP registration of new systems problem
Problem: IP registration of new systems problem Cause: Failed script Affects: Newly registered systems registered yesterday and today Started: 07/08/2008 04:00 PM Resolved: 07/09/2008 02:33 PM
Notes:
The failed script was run by hand. Newly registered systems have had their registrations processed and added into our IP Management system.
Created: 07/09/2008 14:39:40 by dnd
Updates:
Problem Report: Problems accessing Google Apps applications
Problem: Problems accessing Google Apps applications Cause: It appears to be either a firewall or routing issue Affects: People trying to get to Google Mail, Calendar or theportal page. Started: 07/09/2008 08:00 AM Resolved: 07/09/2008 09:21 AM
Notes:
Security data feeds listed the google redirector as a malware site
Our connection to the Google server that redirects the following addresses appears to be unreachable:
webstart.case.edu
webmail.case.edu
webcalendar.case.edu
webdocs.case.edu
sites.case.edu
As a work-around we suggest that you connect to http://partnerpage.google.com/case.edu and jump to the application you want to reach from there.
Network engineers are working on the problem.
We will post updates as they are available.
Created: 07/09/2008 09:05:52 by dak
Updates: 07/09/2008 09:21:59 by lxc152
Problem Report: ERP Student down
Problem: ERP Student down Cause: Failed circuit breaker Affects: ERP Student Started: 07/08/2008 09:05 AM Resolved: 07/08/2008 11:30 AM
Notes:
ETA to repair is at least 1 hour. Replacing the breaker will take more than 3 hours. We are researching other options to restore power.
Databases moved to backup server until power situation is resolved. ERP student back up and running.
Created: 07/08/2008 10:02:30 by bsc4
Updates: 07/08/2008 13:01:56 by man27
Problem Report: Internet Sluggish After Heat Issues
Problem: Internet Sluggish After Heat Issues Cause: ISP Router Reduced Performance Affects: Sluggish Internet Connectivity Started: 07/06/2008 01:24 AM Resolved: 07/06/2008 12:22 PM
Notes:
OneCleveland border router suffered reduced availability and performance due to heat related problems resulting in sluggish responses and loss of BGP session with our edge router. Ticket opened with OneCleveland who worked to revive their router and connection successfully.
Initial BGP problem:
BGP neighbor is 209.130.203.245, remote AS 19009, external link
BGP version 4, remote router ID 0.0.0.0
BGP state = Active
Last read 08:35:49, hold time is 180, keepalive interval is 60 seconds
Message statistics:
InQ depth is 0
OutQ depth is 0
Sent Rcvd
Opens: 4 4
Notifications: 1 0
Updates: 751219 20482172
Keepalives: 415159 415159
Route Refresh: 0 0
Total: 1166383 20897335
Default minimum time between advertisement runs is 30 seconds
For address family: IPv4 Unicast
BGP table version 32504998, neighbor version 0
Index 4, Offset 0, Mask 0x10
4 update-group member
Inbound soft reconfiguration allowed
Inbound path policy configured
Outbound path policy configured
Route map for incoming advertisements is as19009-in
Route map for outgoing advertisements is as19009-out
Sent Rcvd
Prefix activity: ---- ----
Prefixes Current: 0 0
Prefixes Total: 0 0
Implicit Withdraw: 0 0
Explicit Withdraw: 0 0
Used as bestpath: n/a 0
Used as multipath: n/a 0
Outbound Inbound
Local Policy Denied Prefixes: -------- -------
Total: 0 0
Number of NLRIs in the update sent: max 0, min 0
For address family: IPv4 Multicast
BGP table version 2736620, neighbor version 0
Index 1, Offset 0, Mask 0x2
1 update-group member
Community attribute sent to this neighbor
Uses NEXT_HOP attribute for MBGP NLRIs
Sent Rcvd
Prefix activity: ---- ----
Prefixes Current: 0 0
Prefixes Total: 0 0
Implicit Withdraw: 0 0
Explicit Withdraw: 0 0
Used as bestpath: n/a 0
Used as multipath: n/a 0
Outbound Inbound
Local Policy Denied Prefixes: -------- -------
Total: 0 0
Number of NLRIs in the update sent: max 0, min 0
Connections established 4; dropped 4
Last reset 08:37:02, due to BGP Notification sent, hold time expired
No active TCP connection
Created: 07/06/2008 21:52:14 by jxo63
Updates:
Problem Report: Many services in Crawford were unavailable
Problem: Many services in Crawford were unavailable Cause: A/C problems in Crawford Hall combined with routing problems at OneCommunity Affects: Several systems were unreachable or running very slowly Started: 07/06/2008 01:15 AM Resolved: 07/06/2008 05:28 PM
Notes:
One community engineers rebooted and re-configured their premise router in Crawford this afternoon which reestablished normal services
[07/06/08 5:28 PM] - The mail list manager (Sympa) is back up and operational. This was the last service that was unavailable as far as we are aware.
[07/06/08 4:00PM] - We have managed to get docshare back up and running. We have some Server Engineering staff working on Sympa now.
Most services are back up as of 12:30 PM although wiki was unavailable until about 1:30 PM and we are still having problems with Docshare and Sympa.
We will post updates on docshare and sympa as they become available.
Created: 07/06/2008 14:38:07 by dak
Updates: 07/06/2008 16:13:22 by dak, 07/06/2008 17:28:03 by dak, 07/06/2008 21:45:26 by lxc152
Problem Report: Faculty and Staff members are incorrectly listed on
Problem: Faculty and Staff members are incorrectly listed on Cause: Problems with data feed from HR Affects: Faculty and staff Started: 07/03/2008 12:00 PM Resolved: 07/03/2008 06:33 PM
Notes:
We are having a problem with several staff and faculty members who are incorrectly being listed as no longer an employee - on grace period. This is due to a problem in processing our data feed from Human Resources. We are investigating and will send an update once we know exactly what the problem is and have it fixed.
The problem has been rectified.
Created: 07/03/2008 13:27:27 by dak
Updates: 07/03/2008 18:33:28 by jms18
Problem Report: phonesetup.case.edu is unavailable
Problem: phonesetup.case.edu is unavailable Cause: User Authentication issue Affects: Case End Users who are trying to login to phonesetup.case.edu Started: 07/03/2008 12:14 AM Resolved: 07/03/2008 06:56 PM
Notes:
End users received "Login Failed" response when trying to log into http://phonesetup.case.edu.
Engineer is working on resolving the problem.
Created: 07/03/2008 12:16:51 by wxc16
Updates:
Problem Report: Mail to smtp.case.edu being rejected
Problem: Mail to smtp.case.edu being rejected Cause: It appears to be a load balancer or firewall issue Affects: Everyone using mail Started: 07/02/2008 04:00 AM Resolved: 07/02/2008 08:21 AM
Notes:
The problem has been resolved and mail is once again being accepted by smtp.case.edu - the issues appears to have been a problem with some firewall rules.
Each of the mail system machines are individually available, but are not reachable through the smtp.case.edu alias for them. Network Engineering has been called and is looking into the problem.
Created: 07/02/2008 07:47:24 by dak
Updates: 07/02/2008 08:21:10 by dak
Problem Report: Wireless network access in Crawford
Problem: Wireless network access in Crawford Cause: unknown Affects: Crawford Hall wireless network Started: 07/01/2008 02:00 PM Resolved: 07/01/2008 03:00 PM
Notes:
Investigating source of the problem
Problem has been resolved.
Created: 07/01/2008 14:15:29 by man27
Updates: 07/01/2008 15:36:34 by cpr
Problem Report: Leutner-m1-e1 is down
Problem: Leutner-m1-e1 is down Cause: unknow Affects: all network connections - phones, wired and wireless in leutner Started: 07/01/2008 10:06 AM Resolved: 07/01/2008 11:58 AM
Notes:
Construction crew mistakenly cut power to the building and caused one hour outage. This has been restored.
Investigating
Created: 07/01/2008 11:08:42 by roo
Updates: 07/01/2008 12:53:28 by roo
Problem Report: Emergency shutdown of Kusch-m1-e1
Problem: Emergency shutdown of Kusch-m1-e1 Cause: water leak in switch room Affects: Wired, wireless and phones in kush Started: 06/30/2008 02:00 PM Resolved: 06/30/2008 04:00 PM
Notes:
The cause of the leak is the Air conditioner which is currently shutdown. The AC will remain powered down till the leakage is stopped. We will deal with temperature failure when the time comes.
The switch has been shutdown to avoid further water damage to the line cards. Plant services have been notified and the cleanup is going on
Created: 06/30/2008 14:18:38 by roo
Updates: 06/30/2008 16:51:34 by roo
Problem Report: ERP Student and DataWarehouse Database Backup
Problem: ERP Student and DataWarehouse Database Backup Cause: Unable to mount filesystems on backup server Affects: Backup of ERP Student and DataWarehouse Databases Started: 06/19/2008 11:34 AM Resolved: 06/19/2008 09:35 PM
Notes:
Waiting on call back from Veritas.
Fixed. I cleanly unmounted the SAN/NAS filesystems and rebooted the server.
I ran a script to mount only the problem filesystems and they mounted okay. I unmounted the filesystems since it was after the time that the umount script would have ran.
Created: 06/19/2008 18:37:32 by rfw
Updates: 06/19/2008 21:43:12 by rfw
Problem Report: New checkpoint firewall nodes require ports opened to managment server
Problem: New checkpoint firewall nodes require ports opened to managment server Cause: new firewall Affects: KSL data center level 3 context Started: 06/19/2008 04:00 AM Resolved: 06/19/2008 05:39 AM
Notes:
firewall context for ksl level 3 will be pushed again tomorrow to allow for the firewalls to talk to the management system
Created: 06/18/2008 16:55:37 by lxc152
Updates: 06/19/2008 05:38:56 by lxc152
Problem Report: its-services host machine was in a bad state
Problem: its-services host machine was in a bad state Cause: Unknown - appears to have been some problems left from the power shutdown Affects: Software Center, Login Service, Services that use the login service Started: 06/18/2008 07:00 AM Resolved: 06/18/2008 08:00 AM
Notes:
The machine in question was one of the first back up after the power shutdown and it seems to have been getting into a progressively more uncommunicative state since then. The graceful restart of the web server this morning seems to have put it into a state where it would not open and close connections. Attempts to bring it back to a better state by restarting services did not work. We were finally required to actually reboot the machine to get it back into an operational state. We will continue to monitor the system through the day to verify that it is now operating properly.
Created: 06/18/2008 08:14:42 by dak
Updates:
Problem Report: Google Start Page (http://webstart.case.edu) Is Performing Infinite Redirects to http://login.case.edu
Problem: Google Start Page (http://webstart.case.edu) Is Performing Infinite Redirects to http://login.case.edu Cause: Google Is Redirecting Improperly Affects: http://webstart.case.edu Started: 03/21/2008 02:00 PM Resolved:
Notes:
Google has been notified of the problem.
In the meantime, to work around the issue, after getting to the error message in your browser, go back to the URL bar and manually enter "webstart.case.edu" and navigate back to the page.
Created: 03/21/2008 15:27:23 by jms18
Updates:
Problem Report: Case IM gateway to Yahoo Messenger not working
Problem: Case IM gateway to Yahoo Messenger not working Cause: Yahoo changed their protocol Affects: Case IM (Spark client) users using the Yahoo gateway Started: 12/18/2007 12:47 AM Resolved:
Notes:
Yahoo changed something with their Messenger protocol which is preventing the Case IM gateway from working. You can find more information about the problem at
http://www.igniterealtime.org/community/thread/30590?tstart=15
In the meantime, the Yahoo gateway has been disabled until the Openfire IM gateway plugin is upgraded.
Created: 12/18/2007 12:53:07 by sdh7
Updates:
Scheduled Maintenance
Scheduled Maintenance: Network unavailable on 6th & 7th floors of Robbins Building
Problem: Network unavailable on 6th & 7th floors of Robbins Building Cause: 5th floor construction will relocate backbone to SER 6 and SER 7 Affects: All data network, wireless, VoIP, and analog phones on 6th and 7th floors Started: 07/26/2008 06:00 AM Resolved: 07/26/2008 04:00 PM
Notes:
Contractor will pull-back fiber and copper to 4th floor; re-pull through new route; and re-terminate.
Created: 07/18/2008 15:42:42 by dar5
Updates:
Scheduled Maintenance: TIS Network Statistic Web Server will be offline
Problem: TIS Network Statistic Web Server will be offline Cause: Need to physically relocate the server Affects: Network Statistic Webpage Started: 07/18/2008 04:00 PM Resolved: 07/18/2008 05:00 PM
Notes:
Server need to be moved.
Created: 07/17/2008 13:10:34 by rfw
Updates:
Scheduled Maintenance: Final RubyCAS testing on MyCase
Problem: Final RubyCAS testing on MyCase Cause: testing with the new RubyCAS Affects: all users of the MyCase portal Started: 07/17/2008 05:00 AM Resolved: 07/17/2008 05:20 AM
Notes:
Test is complete and was successful.
I will be taking the MyCase portal down to test it against the new RubyCAS implementation for final testing. The service should only be down for 15-20 mins, but leaving an hour just incase.
Created: 07/15/2008 13:43:17 by gsr9
Updates: 07/17/2008 05:27:17 by gsr9
Scheduled Maintenance: Core and Distribution switches IOS Upgrade
Problem: Core and Distribution switches IOS Upgrade Cause: IOS Upgrade Affects: Brief 15 minutes network outage Started: 07/14/2008 03:00 AM Resolved: 07/17/2008 06:00 PM
Notes:
This upgrade includes a 15min reboot of each switch. The switches have been divided into 4 groups to be completed in 4 days only during the maintenance window. See list below:
Monday July 14, 2008 3:00 - 6:00AM
bingham-h0-e2
brb-h0-e2
crawford-h0-e2
fribley-h0-e2
ksl-h0-e2
meds-h0-e2
nrv-h0-e2
pbl-h0-e2
wade-h0-e2
westwing-h0-e2
wrb-h0-e2
Tuesday July 15, 2008 3:00 - 6:00AM
core1
voipgw-crawford-h0-e1
Wednesday July 16, 2008 3:00 - 6:00AM
bingham-h0-e1
brb-h0-e1
crawford-h0-e1
eastwing-h0-e1
fribley-h0-e1
ksl-h0-e1
meds-h0-e1
nrv-h0-e1
pbl-h0-e1
stone-h0-e1
wade-h0-e1
westwing-h0-e1
wrb-h0-e1
Thursday July 17, 2008 3:00 - 6:00AM
core0
voipgw-ksl-h0-e1
Questions, send email to roo@case.edu
Created: 07/10/2008 17:19:35 by roo
Updates:
Scheduled Maintenance: System Board Replacement on UNIX server h-129-22-9-202
Problem: System Board Replacement on UNIX server h-129-22-9-202 Cause: Defective System Board. Affects: No end user affected, development server. Started: 07/10/2008 05:00 PM Resolved: 07/10/2008 07:00 PM
Notes:
System board needs to be replaced to fix and issue with the server randomly rebooting.
SMS will replace system board and Tim Wildow will coordinate the work.
Created: 07/10/2008 16:35:42 by tpw9
Updates:
Scheduled Maintenance: OARnet Planned Maintenance Notification - Ticket 48977
Problem: OARnet Planned Maintenance Notification - Ticket 48977 Cause: OARnet Planned Maintenance Notification - Ticket 48977 Affects: OARnet DNS Servers Started: 07/16/2008 12:00 AM Resolved: 07/18/2008 06:00 AM
Notes:
OARnet Network Operations Center
1-800-627-6420
support@oar.net
Planned Maintenance Notification
Affected Ring or Area: OARnet DNS Servers
Start Date & Time: ns1.oar.net name server Tuesday - July 16th, 2008 @ 12:01 AM
ns2.oar.net name server Thursday - July 18th, 2008 @ 12:01 AM
End Date & Time: ns1.oar.net name server Tuesday - July 16th, 2008 @ 06:00 AM
ns2.oar.net name server Thursday - July 18th, 2008 @ 06:00 AM
Summary of Work to be performed: OARnet will be upgrading its
name servers (ns1.oar.net and ns2.oar.net) to bind version 9.4.3b2. This is to address US-CERT vulnerability warnings and more information can be found at http://www.kb.cert.org/vuls/id/800113 .
This is informational only and no downtime is expected.
Risk Assessment: 0 = No downtime/informational only
OARnet Trouble Ticket Number: 48977
If you have any questions or concerns regarding this planned work, please contact the OARnet NOC and reference the above ticket number.
---------------------------------------------------------------------------------------------------------------------
Network Operations Center
OARnet - Networking Division of OSC
Phone: 1-800-627-6420
Email: support@oar.net
Created: 07/10/2008 10:55:38 by euw
Updates:
Scheduled Maintenance: Upgrade of Single Sign On Service
Problem: Upgrade of Single Sign On Service Cause: Newer more fault-tolerant redundant setup is available Affects: Users of the Single Sign On Service Started: 07/21/2008 05:00 AM Resolved: 07/21/2008 06:00 AM
Notes:
ITS will be deploying a more current version of our Single Sign On (SSO) system that is redundant across ITS computer rooms and provides better load balancing than the previous system during the maintenance window on July 21, 2008.
The new system is still based on Yale's Central Authentication System (CAS), but uses a different code set than the previous system that is easier to maintain and which is still under active maintenance by its developers.
While the base (CAS) system is still the same and it functions nearly identically to the previous version, we are offering an open beta period to our users to allow them to test the new SSO system against their own web sites to verify that the new system works as they expect.
To test the new system against your own SSO-protected web sites, set the authentication to https://sso-dev.case.edu rather than https://login.case.edu. Such testing should be done either on a test page or during maintenance times as the current and new systems do not share login information databases. The beta is open to our users starting immediately. Please direct any questions or problems you encounter to sso-admin@case.edu.
Created: 07/09/2008 10:28:50 by dak
Updates:
Scheduled Maintenance: Memory Errors on Windows Server Kamino
Problem: Memory Errors on Windows Server Kamino Cause: Defective Memory Module. Affects: Work to be done Scheduled outage window Started: 07/10/2008 05:00 AM Resolved: 07/10/2008 06:00 AM
Notes:
DIMM in Bank2_B is reporting single bit errors and it has reached its threshold. Replacement of defective DIMM is necessary.
ITS Windows Engineering staff will replace the memory module.
Work will be done during normal Scheduled outage window of 3AM to 6AM.
Created: 07/08/2008 08:43:20 by tpw9
Updates:
Scheduled Maintenance: the A.R.C. Construction Project
Problem: the A.R.C. Construction Project Cause: an Electrical Shut-Down Affects: All the Departments of the Robbins Building Started: 07/05/2008 07:00 AM Resolved: 07/05/2008 05:00 PM
Notes:
On Saturday July 5, 2008 from 7:00 a.m. until 5:00 p.m. the electrical service for the Robbins Building will be shutdown for work related to the A.R.C. construction project.
The normal electrical power will be off first for approximately 5 hours and then the emergency power for 5 hours.
There will be temporary power available for lab areas upon request to Sharon Callahan by email or phone(368-5908, slc17@case.edu) no later than Wednesday June 25, 2008.
Please shutdown all computers and computer operated equipment during this shutdown time.
Any question concerning this matter will be answered by Dan Davis at ext.6383.
Created: 06/18/2008 13:03:17 by euw
Updates: 07/05/2008 10:44:02 by euw
