CASE.EDU:    HOME | DIRECTORIES | SEARCH

Emergency Maintenance

Emergency Maintenance: VPN server to be Reboot

Problem:   VPN server to be Reboot
Cause:     Attempt to Fix VPN session instability Issue
Affects:   Case VPN service
Started:   07/16/2008 12:00 PM
Resolved:  07/16/2008 12:10 PM

Notes:

7/16 15:30 Engineer switched VPN server hardware.

7/16 12:18 VPN server has been power cycled. VPN service has been restored. Engineer will continue monitor the VPN service stability issue.

VPN server will be power cycled in attempt to correct the VPN instability issue.


Created: 07/16/2008 09:59:29 by wxc16

Updates: 07/16/2008 12:18:15 by wxc16, 07/16/2008 15:43:48 by wxc16


Read more Emergency Maintenance posts. Subscribe

Problem Report

Problem Report: HIgh temp alarm in KSL data center

Problem:   HIgh temp alarm in KSL data center
Cause:     TBD
Affects:   nobody...yet
Started:   07/17/2008 10:37 AM
Resolved:  

Notes:

Plant services has been contacted


Created: 07/17/2008 11:50:37 by jan3

Updates:


Problem Report: Norton SER 1 is overheating

Problem:   Norton SER 1 is overheating
Cause:     Cooling problem in Norton SER
Affects:   Currently module 7 of network switch
Started:   07/16/2008 06:26 PM
Resolved:  

Notes:

2008 Jul 16 18:26:37 EDT -04:00 %SYS-2-MOD_TEMPMINORFAIL:Module 7 minor temperature threshold exceeded
2008 Jul 16 18:26:37 EDT -04:00 %SNMP-5-ENTITYMODTRAP:Module 7 status changed to "failed(7)"


Created: 07/17/2008 11:15:28 by roo

Updates:


Problem Report: Bingham hub overheating

Problem:   Bingham hub overheating
Cause:     Cooling problem in Bingham Hub
Affects:   See note
Started:   07/17/2008 12:00 AM
Resolved:  

Notes:

[2008 July 18th Thursday 06:00 AM]
The A/C Unit wasn't running right again this morning,
so therefore, first completely shut-down and shut-off
its entire electrical circuit, second allowed the whole
circuit to rest for a good while, while openning
the door to the Bingham Hub in the mean-while, to allow
the building A/C to cool and dry the room, from
the outside of the room, third turn the whole circuit
back on, watching it for a little while, to make sure
that it at least stays on, up and running, for a short
while, but with the outside humidity already more than
fifty percent now, and the out side temperature predicted
to rise up to the nineties again today, just like
yesterday, which would to keep a watch on the room's
environment conditions, throughout the rest of the day,
today.
[2008 July 17th Thursday 09:00 AM]
The Circuit Breaker for the A/C Unit wasn't tripped, but
the A/C Unit needed to be restarted again this morning.
[2008 July 17th Thursday 12:00 AM]
Affects 2 modules on Hub 1 and 1 module on hub 2
bingham-h0-e1
1 0009.11f7.e830 to 0009.11f7.e83f 1.0 7.2(1) 8.5(0.46)RFW MinFail
9 Distributed Forwarding Card WS-F6700-DFC3A SAD074805CH 1.0 MinFail

bingham-h0-e2
9 Distributed Forwarding Card WS-F6700-DFC3A SAD074805N0 1.0 MinFail


Created: 07/17/2008 05:43:34 by roo

Updates: 07/17/2008 09:11:04 by euw, 07/18/2008 07:30:24 by euw


Problem Report: Stone Hub is overheating

Problem:   Stone Hub is overheating
Cause:     no cooling and drying
Affects:   Switch Module Five
Started:   07/16/2008 05:29 AM
Resolved:  

Notes:

[Wed Jul 16 05:29:43 2008]
stone-h0-e1#show environment temperature
   VTT 1 outlet temperature: 34C
   VTT 2 outlet temperature: 39C
   VTT 3 outlet temperature: 47C
   module 1 outlet temperature: 49C
   module 1 inlet temperature: 33C
   module 5 outlet temperature: 42C
   module 5 inlet temperature: 35C
   module 5 device-1 temperature: 47C
   module 5 device-2 temperature: 47C
   module 5 asic-1 (SSO-1) temp: 36C
   module 5 asic-2 (SSO-2) temp: 35C
   module 5 asic-3 (SSO-3) temp: 35C
   module 5 asic-4 (SSO-4) temp: 35C
   module 5 asic-5 (SSA-1) temp: 35C
   module 5 asic-6 (HYPERION-1) temp: 36C
   module 5 RP outlet temperature: 41C
   module 5 RP inlet temperature: 42C
   module 5 EARL outlet temperature: 48C
   module 5 EARL inlet temperature: 31C
stone-h0-e1#


Created: 07/16/2008 08:04:34 by euw

Updates:


Problem Report: stone-h0-e1-lpbk1 - conn

Problem:   stone-h0-e1-lpbk1 - conn
Cause:     unknown
Affects:   all of the Stone Commons Cisco Area
Started:   07/16/2008 05:29 AM
Resolved:  07/16/2008 07:44 AM

Notes:

[Wed Jul 16 07:44:42 2008]
Remedied.
[Wed Jul 16 05:29:43 2008]
A Network Technician is on his way to investigate right now.


Created: 07/16/2008 07:16:36 by euw

Updates: 07/16/2008 07:52:16 by euw


Problem Report: Cutler SER overheating

Problem:   Cutler SER overheating
Cause:     no cooling
Affects:   Switch module 7
Started:   07/15/2008 05:50 PM
Resolved:  

Notes:

2008 Jul 15 17:48:20 EDT -04:00 %SNMP-5-ENVMONTEMPTRAP:Environmental Monitor Temperature Trap: Module 7 Intake state: warning
2008 Jul 15 17:48:20 EDT -04:00 %SNMP-5-ENVMONTEMPTRAP:Environmental Monitor Temperature Trap: Module 7 Intake state: warning

7 7 96 10/100BaseTX Ethernet WS-X6148X2-RJ-45 yes temp-minor


Created: 07/15/2008 17:52:32 by roo

Updates:


Problem Report: Docshare unavailable

Problem:   Docshare unavailable
Cause:     unknown - the systems appears to have hung
Affects:   Docshare users
Started:   07/15/2008 02:45 AM
Resolved:  07/15/2008 08:41 AM

Notes:

[07/15/08 08:40 AM] - The system had lost its virtual connection to disk and needed to be rebooted.

Server engineering is working on the problem. We will post updates as available.


Created: 07/15/2008 07:46:43 by dak

Updates: 07/15/2008 08:41:12 by dak


Problem Report: VPN connectivity issues

Problem:   VPN connectivity issues
Cause:     Unknown
Affects:   At least users using VPN from the wireless network on campus; possibly others
Started:   07/14/2008 10:00 AM
Resolved:  

Notes:

We are receiving reports of connectivity issues with VPN -- both initially and problems using the Internet after getting connected. Network Engineering is investigating.


Created: 07/14/2008 11:07:25 by cpr

Updates: 07/14/2008 11:16:20 by cpr


Problem Report: VPN Connectivity Lost (7/11 11:00pm)

Problem:   VPN Connectivity Lost (7/11 11:00pm)
Cause:     Unknown
Affects:   Case VPN Services
Started:   07/11/2008 11:00 PM
Resolved:  07/11/2008 11:15 PM

Notes:

Engineer rebooted the VPN server. VPN Connection looks more stable now at the point. Engineer will continue to monitor VPN service status.

Engineer was notified of problem around 10:50pm. Engineer is investigating the issue right now.


Created: 07/11/2008 23:07:00 by wxc16

Updates: 07/11/2008 23:12:13 by wxc16


Problem Report: VPN Clients lost connectivity

Problem:   VPN Clients lost connectivity
Cause:     Bad network cable found
Affects:   Case VPN service
Started:   07/10/2008 02:30 PM
Resolved:  07/10/2008 02:55 PM

Notes:

Bad cable replaced. Service restored.

VPN users experience lost of connectivity to on and off campus due to a bad network cable. Engineer is currently replacing the bad cable. Services should be restore in 10 minutes.


Created: 07/10/2008 14:55:50 by wxc16

Updates:


Problem Report: VPN Clients lost connectivity

Problem:   VPN Clients lost connectivity
Cause:     Bad network cable found
Affects:   Case VPN service
Started:   07/10/2008 02:30 PM
Resolved:  07/10/2008 02:55 PM

Notes:

Bad cable replaced. Service restored

VPN users experience lost of connectivity to on and off campus due to a bad network cable. Engineer is currently replacing the bad cable. Services should be restore in 10 minutes.


Created: 07/10/2008 14:44:18 by wxc16

Updates: 07/10/2008 14:58:03 by wxc16


Problem Report: Yahoo.com issues

Problem:   Yahoo.com issues
Cause:     security feed included ip space that broke some functionality on site
Affects:   yahoo.com site
Started:   07/10/2008 11:23 AM
Resolved:  07/10/2008 11:23 AM

Notes:

security feed included ip space that broke some functionality on site


Created: 07/10/2008 11:25:59 by lxc152

Updates:


Problem Report: Legato Networker backup system is down

Problem:   Legato Networker backup system is down
Cause:     corrupted configuration file
Affects:   All Networker backup clients
Started:   07/09/2008 12:01 PM
Resolved:  07/10/2008 01:50 AM

Notes:

Service was restored last night. We expect minor issues over the next couple days as we clear the backlog of disk-to-tape migration & get all of the regularly scheduled backup jobs running again, but operations are essentially back to normal.



During a reconfiguration of the backup system (to make it more resilient to externally-caused failures like this weekend's power & A/C issue), a critical configuration file became corrupted such that backup system was no longer runnable at all.

The most critical issue has been resolved, but the system will not be usable until we complete a large amount of reconfiguration work. We expect to have the system functional again by noon tomorrow (Thursday 7/10) for restores, and regularly scheduled backups should begin running Thursday night/Friday morning.


Created: 07/09/2008 22:17:16 by jan3

Updates: 07/10/2008 13:50:12 by jan3


Problem Report: Clients using VPN can't get to sites on and off campus

Problem:   Clients using VPN can't get to sites on and off campus 
Cause:     Unknown
Affects:   Case VPN Users
Started:   07/09/2008 08:38 PM
Resolved:  07/09/2008 08:48 PM

Notes:

VPN clients can't get to resource on or off campus. Engineers are looking into the problem.

Problem was determined to be bad network cable.


Created: 07/09/2008 20:41:57 by dnd

Updates: 07/09/2008 20:58:19 by dnd


Problem Report: IP registration of new systems problem

Problem:   IP registration of new systems problem
Cause:     Failed script
Affects:   Newly registered systems registered yesterday and today
Started:   07/08/2008 04:00 PM
Resolved:  07/09/2008 02:33 PM

Notes:

The failed script was run by hand. Newly registered systems have had their registrations processed and added into our IP Management system.


Created: 07/09/2008 14:39:40 by dnd

Updates:


Problem Report: Problems accessing Google Apps applications

Problem:   Problems accessing Google Apps applications
Cause:     It appears to be either a firewall or routing issue
Affects:   People trying to get to Google Mail, Calendar or theportal page.
Started:   07/09/2008 08:00 AM
Resolved:  07/09/2008 09:21 AM

Notes:

Security data feeds listed the google redirector as a malware site

Our connection to the Google server that redirects the following addresses appears to be unreachable:

   webstart.case.edu
   webmail.case.edu
   webcalendar.case.edu
   webdocs.case.edu
   sites.case.edu

As a work-around we suggest that you connect to http://partnerpage.google.com/case.edu and jump to the application you want to reach from there.

Network engineers are working on the problem.

We will post updates as they are available.


Created: 07/09/2008 09:05:52 by dak

Updates: 07/09/2008 09:21:59 by lxc152


Problem Report: ERP Student down

Problem:   ERP Student down
Cause:     Failed circuit breaker
Affects:   ERP Student
Started:   07/08/2008 09:05 AM
Resolved:  07/08/2008 11:30 AM

Notes:

ETA to repair is at least 1 hour. Replacing the breaker will take more than 3 hours. We are researching other options to restore power.

Databases moved to backup server until power situation is resolved. ERP student back up and running.


Created: 07/08/2008 10:02:30 by bsc4

Updates: 07/08/2008 13:01:56 by man27


Problem Report: Internet Sluggish After Heat Issues

Problem:   Internet Sluggish After Heat Issues
Cause:     ISP Router Reduced Performance
Affects:   Sluggish Internet Connectivity
Started:   07/06/2008 01:24 AM
Resolved:  07/06/2008 12:22 PM

Notes:

OneCleveland border router suffered reduced availability and performance due to heat related problems resulting in sluggish responses and loss of BGP session with our edge router. Ticket opened with OneCleveland who worked to revive their router and connection successfully.
   Initial BGP problem:

BGP neighbor is 209.130.203.245, remote AS 19009, external link
   BGP version 4, remote router ID 0.0.0.0
   BGP state = Active
   Last read 08:35:49, hold time is 180, keepalive interval is 60 seconds
   Message statistics:
   InQ depth is 0
   OutQ depth is 0
   Sent Rcvd
   Opens: 4 4
   Notifications: 1 0
   Updates: 751219 20482172
   Keepalives: 415159 415159
   Route Refresh: 0 0
   Total: 1166383 20897335
   Default minimum time between advertisement runs is 30 seconds

   For address family: IPv4 Unicast
   BGP table version 32504998, neighbor version 0
   Index 4, Offset 0, Mask 0x10
   4 update-group member
   Inbound soft reconfiguration allowed
   Inbound path policy configured
   Outbound path policy configured
   Route map for incoming advertisements is as19009-in
   Route map for outgoing advertisements is as19009-out
   Sent Rcvd
   Prefix activity: ---- ----
   Prefixes Current: 0 0
   Prefixes Total: 0 0
   Implicit Withdraw: 0 0
   Explicit Withdraw: 0 0
   Used as bestpath: n/a 0
   Used as multipath: n/a 0

   Outbound Inbound
   Local Policy Denied Prefixes: -------- -------
   Total: 0 0
   Number of NLRIs in the update sent: max 0, min 0

   For address family: IPv4 Multicast
   BGP table version 2736620, neighbor version 0
   Index 1, Offset 0, Mask 0x2
   1 update-group member
   Community attribute sent to this neighbor
   Uses NEXT_HOP attribute for MBGP NLRIs
   Sent Rcvd
   Prefix activity: ---- ----
   Prefixes Current: 0 0
   Prefixes Total: 0 0
   Implicit Withdraw: 0 0
   Explicit Withdraw: 0 0
   Used as bestpath: n/a 0
   Used as multipath: n/a 0

   Outbound Inbound
   Local Policy Denied Prefixes: -------- -------
   Total: 0 0
   Number of NLRIs in the update sent: max 0, min 0

   Connections established 4; dropped 4
   Last reset 08:37:02, due to BGP Notification sent, hold time expired
   No active TCP connection


Created: 07/06/2008 21:52:14 by jxo63

Updates:


Problem Report: Many services in Crawford were unavailable

Problem:   Many services in Crawford were unavailable
Cause:     A/C problems in Crawford Hall combined with routing problems at OneCommunity
Affects:   Several systems were unreachable or running very slowly
Started:   07/06/2008 01:15 AM
Resolved:  07/06/2008 05:28 PM

Notes:

One community engineers rebooted and re-configured their premise router in Crawford this afternoon which reestablished normal services


[07/06/08 5:28 PM] - The mail list manager (Sympa) is back up and operational. This was the last service that was unavailable as far as we are aware.

[07/06/08 4:00PM] - We have managed to get docshare back up and running. We have some Server Engineering staff working on Sympa now.

Most services are back up as of 12:30 PM although wiki was unavailable until about 1:30 PM and we are still having problems with Docshare and Sympa.

We will post updates on docshare and sympa as they become available.


Created: 07/06/2008 14:38:07 by dak

Updates: 07/06/2008 16:13:22 by dak, 07/06/2008 17:28:03 by dak, 07/06/2008 21:45:26 by lxc152


Problem Report: Faculty and Staff members are incorrectly listed on

Problem:   Faculty and Staff members are incorrectly listed on 
Cause:     Problems with data feed from HR
Affects:   Faculty and staff
Started:   07/03/2008 12:00 PM
Resolved:  07/03/2008 06:33 PM

Notes:

We are having a problem with several staff and faculty members who are incorrectly being listed as no longer an employee - on grace period. This is due to a problem in processing our data feed from Human Resources. We are investigating and will send an update once we know exactly what the problem is and have it fixed.

The problem has been rectified.


Created: 07/03/2008 13:27:27 by dak

Updates: 07/03/2008 18:33:28 by jms18


Problem Report: phonesetup.case.edu is unavailable

Problem:   phonesetup.case.edu is unavailable
Cause:     User Authentication issue
Affects:   Case End Users who are trying to login to phonesetup.case.edu
Started:   07/03/2008 12:14 AM
Resolved:  07/03/2008 06:56 PM

Notes:

End users received "Login Failed" response when trying to log into http://phonesetup.case.edu.

Engineer is working on resolving the problem.


Created: 07/03/2008 12:16:51 by wxc16

Updates:


Problem Report: Mail to smtp.case.edu being rejected

Problem:   Mail to smtp.case.edu being rejected
Cause:     It appears to be a load balancer or firewall issue 
Affects:   Everyone using mail
Started:   07/02/2008 04:00 AM
Resolved:  07/02/2008 08:21 AM

Notes:

The problem has been resolved and mail is once again being accepted by smtp.case.edu - the issues appears to have been a problem with some firewall rules.

Each of the mail system machines are individually available, but are not reachable through the smtp.case.edu alias for them. Network Engineering has been called and is looking into the problem.


Created: 07/02/2008 07:47:24 by dak

Updates: 07/02/2008 08:21:10 by dak


Problem Report: Wireless network access in Crawford

Problem:   Wireless network access in Crawford
Cause:     unknown
Affects:   Crawford Hall wireless network
Started:   07/01/2008 02:00 PM
Resolved:  07/01/2008 03:00 PM

Notes:

Investigating source of the problem

Problem has been resolved.


Created: 07/01/2008 14:15:29 by man27

Updates: 07/01/2008 15:36:34 by cpr


Problem Report: Leutner-m1-e1 is down

Problem:   Leutner-m1-e1 is down
Cause:     unknow
Affects:   all network connections - phones, wired and wireless in leutner
Started:   07/01/2008 10:06 AM
Resolved:  07/01/2008 11:58 AM

Notes:

Construction crew mistakenly cut power to the building and caused one hour outage. This has been restored.

Investigating


Created: 07/01/2008 11:08:42 by roo

Updates: 07/01/2008 12:53:28 by roo


Problem Report: Emergency shutdown of Kusch-m1-e1

Problem:   Emergency shutdown of Kusch-m1-e1
Cause:     water leak in switch room
Affects:   Wired, wireless and phones in kush
Started:   06/30/2008 02:00 PM
Resolved:  06/30/2008 04:00 PM

Notes:

The cause of the leak is the Air conditioner which is currently shutdown. The AC will remain powered down till the leakage is stopped. We will deal with temperature failure when the time comes.

The switch has been shutdown to avoid further water damage to the line cards. Plant services have been notified and the cleanup is going on


Created: 06/30/2008 14:18:38 by roo

Updates: 06/30/2008 16:51:34 by roo


Problem Report: ERP Student and DataWarehouse Database Backup

Problem:   ERP Student and DataWarehouse Database Backup
Cause:     Unable to mount filesystems on backup server
Affects:   Backup of ERP Student and DataWarehouse Databases
Started:   06/19/2008 11:34 AM
Resolved:  06/19/2008 09:35 PM

Notes:

Waiting on call back from Veritas.

Fixed. I cleanly unmounted the SAN/NAS filesystems and rebooted the server.

I ran a script to mount only the problem filesystems and they mounted okay. I unmounted the filesystems since it was after the time that the umount script would have ran.


Created: 06/19/2008 18:37:32 by rfw

Updates: 06/19/2008 21:43:12 by rfw


Problem Report: New checkpoint firewall nodes require ports opened to managment server

Problem:   New checkpoint firewall nodes require ports opened to managment server
Cause:     new firewall
Affects:   KSL data center level 3 context
Started:   06/19/2008 04:00 AM
Resolved:  06/19/2008 05:39 AM

Notes:

firewall context for ksl level 3 will be pushed again tomorrow to allow for the firewalls to talk to the management system


Created: 06/18/2008 16:55:37 by lxc152

Updates: 06/19/2008 05:38:56 by lxc152


Problem Report: its-services host machine was in a bad state

Problem:   its-services host machine was in a bad state
Cause:     Unknown - appears to have been some problems left from the power shutdown
Affects:   Software Center, Login Service, Services that use the login service
Started:   06/18/2008 07:00 AM
Resolved:  06/18/2008 08:00 AM

Notes:

The machine in question was one of the first back up after the power shutdown and it seems to have been getting into a progressively more uncommunicative state since then. The graceful restart of the web server this morning seems to have put it into a state where it would not open and close connections. Attempts to bring it back to a better state by restarting services did not work. We were finally required to actually reboot the machine to get it back into an operational state. We will continue to monitor the system through the day to verify that it is now operating properly.


Created: 06/18/2008 08:14:42 by dak

Updates:


Problem Report: Google Start Page (http://webstart.case.edu) Is Performing Infinite Redirects to http://login.case.edu

Problem:   Google Start Page (http://webstart.case.edu) Is Performing Infinite Redirects to http://login.case.edu
Cause:     Google Is Redirecting Improperly
Affects:   http://webstart.case.edu
Started:   03/21/2008 02:00 PM
Resolved:  

Notes:

Google has been notified of the problem.

In the meantime, to work around the issue, after getting to the error message in your browser, go back to the URL bar and manually enter "webstart.case.edu" and navigate back to the page.


Created: 03/21/2008 15:27:23 by jms18

Updates:


Problem Report: Case IM gateway to Yahoo Messenger not working

Problem:   Case IM gateway to Yahoo Messenger not working
Cause:     Yahoo changed their protocol
Affects:   Case IM (Spark client) users using the Yahoo gateway
Started:   12/18/2007 12:47 AM
Resolved:  

Notes:

Yahoo changed something with their Messenger protocol which is preventing the Case IM gateway from working. You can find more information about the problem at
http://www.igniterealtime.org/community/thread/30590?tstart=15

In the meantime, the Yahoo gateway has been disabled until the Openfire IM gateway plugin is upgraded.


Created: 12/18/2007 12:53:07 by sdh7

Updates:


Read more Problem Report posts. Subscribe

Scheduled Maintenance

Scheduled Maintenance: Network unavailable on 6th & 7th floors of Robbins Building

Problem:   Network unavailable on 6th & 7th floors of Robbins Building 
Cause:     5th floor construction will relocate backbone to SER 6 and SER 7
Affects:   All data network, wireless, VoIP, and analog phones on 6th and 7th floors
Started:   07/26/2008 06:00 AM
Resolved:  07/26/2008 04:00 PM

Notes:

Contractor will pull-back fiber and copper to 4th floor; re-pull through new route; and re-terminate.


Created: 07/18/2008 15:42:42 by dar5

Updates:


Scheduled Maintenance: TIS Network Statistic Web Server will be offline

Problem:   TIS Network Statistic Web Server will be offline
Cause:     Need to physically relocate the server
Affects:   Network Statistic Webpage
Started:   07/18/2008 04:00 PM
Resolved:  07/18/2008 05:00 PM

Notes:

Server need to be moved.


Created: 07/17/2008 13:10:34 by rfw

Updates:


Scheduled Maintenance: Final RubyCAS testing on MyCase

Problem:   Final RubyCAS testing on MyCase
Cause:     testing with the new RubyCAS
Affects:   all users of the MyCase portal
Started:   07/17/2008 05:00 AM
Resolved:  07/17/2008 05:20 AM

Notes:

Test is complete and was successful.

I will be taking the MyCase portal down to test it against the new RubyCAS implementation for final testing. The service should only be down for 15-20 mins, but leaving an hour just incase.


Created: 07/15/2008 13:43:17 by gsr9

Updates: 07/17/2008 05:27:17 by gsr9


Scheduled Maintenance: Core and Distribution switches IOS Upgrade

Problem:   Core and Distribution switches IOS Upgrade
Cause:     IOS Upgrade
Affects:   Brief 15 minutes  network outage
Started:   07/14/2008 03:00 AM
Resolved:  07/17/2008 06:00 PM

Notes:

This upgrade includes a 15min reboot of each switch. The switches have been divided into 4 groups to be completed in 4 days only during the maintenance window. See list below:

Monday July 14, 2008 3:00 - 6:00AM
bingham-h0-e2
brb-h0-e2
crawford-h0-e2
fribley-h0-e2
ksl-h0-e2
meds-h0-e2
nrv-h0-e2
pbl-h0-e2
wade-h0-e2
westwing-h0-e2
wrb-h0-e2

Tuesday July 15, 2008 3:00 - 6:00AM
core1
voipgw-crawford-h0-e1

Wednesday July 16, 2008 3:00 - 6:00AM
bingham-h0-e1
brb-h0-e1
crawford-h0-e1
eastwing-h0-e1
fribley-h0-e1
ksl-h0-e1
meds-h0-e1
nrv-h0-e1
pbl-h0-e1
stone-h0-e1
wade-h0-e1
westwing-h0-e1
wrb-h0-e1

Thursday July 17, 2008 3:00 - 6:00AM
core0
voipgw-ksl-h0-e1

Questions, send email to roo@case.edu


Created: 07/10/2008 17:19:35 by roo

Updates:


Scheduled Maintenance: System Board Replacement on UNIX server h-129-22-9-202

Problem:   System Board Replacement on UNIX server h-129-22-9-202
Cause:     Defective System Board.
Affects:   No end user affected, development server. 
Started:   07/10/2008 05:00 PM
Resolved:  07/10/2008 07:00 PM

Notes:

System board needs to be replaced to fix and issue with the server randomly rebooting.

SMS will replace system board and Tim Wildow will coordinate the work.


Created: 07/10/2008 16:35:42 by tpw9

Updates:


Scheduled Maintenance: OARnet Planned Maintenance Notification - Ticket 48977

Problem:   OARnet Planned Maintenance Notification - Ticket 48977
Cause:     OARnet Planned Maintenance Notification - Ticket 48977
Affects:   OARnet DNS Servers
Started:   07/16/2008 12:00 AM
Resolved:  07/18/2008 06:00 AM

Notes:



OARnet Network Operations Center

   1-800-627-6420

   support@oar.net

   

Planned Maintenance Notification

   

Affected Ring or Area: OARnet DNS Servers

   

Start Date & Time: ns1.oar.net name server Tuesday - July 16th, 2008 @ 12:01 AM

   ns2.oar.net name server Thursday - July 18th, 2008 @ 12:01 AM

   

End Date & Time: ns1.oar.net name server Tuesday - July 16th, 2008 @ 06:00 AM

   ns2.oar.net name server Thursday - July 18th, 2008 @ 06:00 AM

   

Summary of Work to be performed: OARnet will be upgrading its
name servers (ns1.oar.net and ns2.oar.net) to bind version 9.4.3b2. This is to address US-CERT vulnerability warnings and more information can be found at http://www.kb.cert.org/vuls/id/800113 .

This is informational only and no downtime is expected.

   

Risk Assessment: 0 = No downtime/informational only

   

OARnet Trouble Ticket Number: 48977

   

If you have any questions or concerns regarding this planned work, please contact the OARnet NOC and reference the above ticket number.

---------------------------------------------------------------------------------------------------------------------

   

   

Network Operations Center
OARnet - Networking Division of OSC
Phone: 1-800-627-6420
Email: support@oar.net


Created: 07/10/2008 10:55:38 by euw

Updates:


Scheduled Maintenance: Upgrade of Single Sign On Service

Problem:   Upgrade of Single Sign On Service
Cause:     Newer more fault-tolerant redundant setup is available
Affects:   Users of the Single Sign On Service
Started:   07/21/2008 05:00 AM
Resolved:  07/21/2008 06:00 AM

Notes:

ITS will be deploying a more current version of our Single Sign On (SSO) system that is redundant across ITS computer rooms and provides better load balancing than the previous system during the maintenance window on July 21, 2008.

The new system is still based on Yale's Central Authentication System (CAS), but uses a different code set than the previous system that is easier to maintain and which is still under active maintenance by its developers.

While the base (CAS) system is still the same and it functions nearly identically to the previous version, we are offering an open beta period to our users to allow them to test the new SSO system against their own web sites to verify that the new system works as they expect.

To test the new system against your own SSO-protected web sites, set the authentication to https://sso-dev.case.edu rather than https://login.case.edu. Such testing should be done either on a test page or during maintenance times as the current and new systems do not share login information databases. The beta is open to our users starting immediately. Please direct any questions or problems you encounter to sso-admin@case.edu.


Created: 07/09/2008 10:28:50 by dak

Updates:


Scheduled Maintenance: Memory Errors on Windows Server Kamino

Problem:   Memory Errors on Windows Server Kamino 
Cause:     Defective Memory Module. 
Affects:   Work to be done Scheduled outage window
Started:   07/10/2008 05:00 AM
Resolved:  07/10/2008 06:00 AM

Notes:

DIMM in Bank2_B is reporting single bit errors and it has reached its threshold. Replacement of defective DIMM is necessary.

ITS Windows Engineering staff will replace the memory module.

Work will be done during normal Scheduled outage window of 3AM to 6AM.


Created: 07/08/2008 08:43:20 by tpw9

Updates:


Scheduled Maintenance: the A.R.C. Construction Project

Problem:   the A.R.C. Construction Project
Cause:     an Electrical Shut-Down
Affects:   All the Departments of the Robbins Building
Started:   07/05/2008 07:00 AM
Resolved:  07/05/2008 05:00 PM

Notes:

On Saturday July 5, 2008 from 7:00 a.m. until 5:00 p.m. the electrical service for the Robbins Building will be shutdown for work related to the A.R.C. construction project.
The normal electrical power will be off first for approximately 5 hours and then the emergency power for 5 hours.
There will be temporary power available for lab areas upon request to Sharon Callahan by email or phone(368-5908, slc17@case.edu) no later than Wednesday June 25, 2008.
Please shutdown all computers and computer operated equipment during this shutdown time.

Any question concerning this matter will be answered by Dan Davis at ext.6383.


Created: 06/18/2008 13:03:17 by euw

Updates: 07/05/2008 10:44:02 by euw


Read more Scheduled Maintenance posts. Subscribe