Norwegian version of this page

TSD Operational Log - Page 16

Published May 14, 2017 5:24 PM

Due to a jump host failure, TSD was not accessible 09:15 - 14:31 today. The failure has also caused NFS hangs on some Linux VMs, due to which we had to reboot those VMs. All thinlinc VMs now should be accessible. Some other Linux VMs my still have problems. We are working in fixing those ASAP

TSD@USIT

[SOLVED] Problems login to TSD via thinlinc

Published May 10, 2017 9:37 AM

Due to NFS hangs, some Linux hosts are now inaccessible. The issue may result in TSD being inaccessible via thinlinc. This should not affect log in through VMWare Horizon.

We doing our best for this issue to be resolved ASAP

TSD@USIT

---------------------------------------------------

The issue was resolved after the affected hosts have been rebooted

[SOLVED] TSD Maintenance: Tues. 02/05 between 13:00-15:00 CET (two hour)

Published Apr. 25, 2017 4:19 PM

There will be a maintenance stop on the 02/05 at 13:00 CET for one hour. During the downtime, the linux and windows VMs will not be accessible. The Colossus cluster will be under maintenance.

TSD: Login problem on 12/04 at 16:15

Published Apr. 12, 2017 4:31 PM

We are having an issue with the main gateway and the login to TSD in this moment is not possible. We are working to solve the problem as soon as possible.

We apologize for the inconvenience.

----

Update 14/04, 12:15 -- TSD partially operational. Still not possible to submit jobs to Colossus

----

Update 18/04, 09:40 -- Still there might be issues when submitting jobs to Colossus

[Solved]TSD: login problem on Friday 17/03 at 20:00

Published Mar. 6, 2017 2:00 PM

The problem was solved on Friday around 22:00. It might be that some of the linux VM are not yet reachable. In that case please mail us and we will restart the machine.

----

TSD is unreachable at this moment. We are trying to solve the problem as soon as possible. We apologize for the inconvenience.

[Solved] TSD login issues

Published Mar. 6, 2017 2:00 PM

The TSD gateway was not reachable between 15:43 and 16:15 today, the issue has been resolved and users should be able to log in again.

Some linux machines might still not be available, if you are unable to log into ThinLinc please contact us.

[Solved] TSD is not reachable at the moment - Saturday 25/02 at 10:39

Published Mar. 6, 2017 2:00 PM

The problem was solved the same day around 18:10. It might be that some of the linux VM are not yet reachable. In that case please mail us and we will restart the machine.

----------------

TSD's main gateway is not reachable at the moment and the TSD infrastructure is not accessible. We are trying to solve the situation as soon as possible. More information will appear on the operational log on Monday 27/02 in the morning.

We apologize for the inconvenience.

[Sloved] TSD: login problem on Friday 24/02 at 16:11

Published Mar. 6, 2017 2:00 PM

The problem was solved on Friday around 17:15. It might be that some of the linux VM are not yet reachable. In that case please mail us and we will restart the machine.

------------------

TSD is unreachable at this moment. We are trying to solve the problem as soon as possible. We apologize for the inconvenience.

[Solved]TSD: login problems on Friday 10/02 at 15:10 - Solved on 10/02 at 16:00

Published Feb. 10, 2017 3:28 PM

We have had a failure of the primary jumphost but the failover machanism took over and moved the system to the secondary one. The infrastructure shall be back to normal very soon. We are investigating what caused the failure.

We apologize for the inconvenience.

[Solved]Problems with login to TSD at the moment

Published Jan. 30, 2017 9:19 AM

We are having some network problem at the moment and the services in TSD is not reachable. We are working to investigate the causes in order to solve the problem as soon as possible.

We apologize for the inconvenience.

Regards

Nihal@TSD

[SOLVED]TSD Colossus Problem: Services on Colossus may not be accessible at the moment

Published Jan. 26, 2017 9:12 AM

Dear TSD-users!

There is an issue accessing the services on colossus at the moment.

This may result in:

/cluster/projects/pXX being inaccessible
Software modules being inaccessible
Issues submitting to Slurm

We are working on this issue to get it resolved ASAP.

We apologize for the inconvenience.

Regards,

Nihal @TSD

[SOLVED]Problems with access to /cluster/projects/pxx

Published Jan. 25, 2017 9:04 AM

Dear TSD-users!

There is an issue accessing the colossus storage at the moment. This may result in /cluster/projects/pXX being inaccessible.

We are working on this issue to get it resolved ASAP.

We apologize for the inconvenience.

Regards,

Nihal @TSD

[SOLVED] File import to TSD unavailable. (2017-01-23, 10:56)

Published Jan. 23, 2017 1:55 PM

Dear TSD-users!

Due to a disk failure on tsd-fx01.tsd.usit.no which happened 2017-01-20, 22:59, the file import in TSD has been unavailable until 2017-01-23, 10:56.

The issue has been resolved, and the system is back in production.

We apologize for the inconvenience.

Regards,
Benjamin

[SOLVED] TSD Colossus Problem: Colossus Disk not accessible at the moment (12:00 CET 23/12-2017)

Published Dec. 23, 2016 12:06 PM

There is an issue accessing the colossus storage at the moment. This may result in:

/cluster/projects/pXX being inaccessible
Software modules being inaccessible
Issues submitting to Slurm

We are working on this issue to get it resolved ASAP.

We apologize for the inconvenience.

Regards,

Abdulrahman

[SOLVED] TSD Colossus Problem: Colossus Disk not visible at the moment (16:00 CET 19/12-2017)

Published Dec. 20, 2016 10:04 AM

Due the network problem we had yesterday morning, the Colossus disk was not properly mounted and exported to the project machines. This results in problem accessing the /cluster/projects partition, mounting modulefiles and running slurm. Some of the jobs that were running when the network issue occurred might have been affected by problem.

We are on the issue now, hoping to solve it as soon as possible during the day.

We apologize for the inconvenience.

Francesca

[SOLVED] TSD: Network problem at the moment (19/12 at 8:00am)

Published Dec. 19, 2016 8:38 AM

Update: Problem is fixed. You can now login into TSD.

---------------------------------------------------------------------------------------------------

We are having some network problem at the moment and the TSD is not reachable. We are working to investigate the causes in order to solve the problem as soon as possible.

We apologize for the inconvenience.

Regards,

Erik

[SOLVED] TSD Colossus Problem: Cluster not available at the moment (05/12-2016 at 13:00)

Published Dec. 5, 2016 1:12 PM

Due to a failure in the heating system, the Colossus front-end and one of the rack went down this morning (05/12-2016) and the cluster is not available at the moment. We are working to reboot the system at the moment.

Jobs that were running on the rack that went down, unavoidably died while those running on the other rack are most likely running even though the front end is not available.

We apologize for the inconvenience.

TSD nettwork problem: some linux VMs are not accessible at the moment.

Published Dec. 1, 2016 9:43 AM

Dear TSD users,

There has been a network problem, which has been safely bypassed by our failover mechanism. However some of the linux VMs are still mounting the filesystem and therefore are not accessible at the moment. The process will take 2 hours. If after 11:00 today you still experience problem with your linux VM, please let us know (tsd-drift@usit.uio.no).

We are investigating the cause of the network problem.

We apologize for the inconvenience.

Regards,

Francesca

TSD filelock problem is solved.

Published Nov. 14, 2016 3:01 PM

Dear TSD-users,

The service is back to normal now.

We apologize for the inconvenience,

Regards

Nihal @TSD

TSD filelock problem: slow import/export - 14/11-2016 at 12:30 am CET.

Published Nov. 14, 2016 12:26 PM

Dear TSD-users,

we are experiencing problem on the Filelock filesystem. The service is in function but extremely slow. We are investigating the causes, hoping to solve the problem as soon as possible.

We apologize for the inconvenience,

Regards,

Francesca@TSD

TSD Operations: login to TSD not possible for 5 minutes (from 13:30 today, 4 Nov 2016)

Published Nov. 4, 2016 1:20 PM

Dear TSD-users,

we need to reboot the machine that allows to two factor authentication. The reboot will happen at 13:30 today and it will take around 5 minutes. During the reboot it will not be possible to perform any login to TSD. Sessions already opened will remain opened.

We apologize for the inconvenience,

Regards,

Francesca@TSD

Service for password change is not availabel at the moment. (brukerinfo in TSD is down)

Published Nov. 4, 2016 12:53 PM

Dear TSD-user

Users will not be able to change your password via "https://brukerinfo.tsd.usit.no inside TSD at the moment.

We apologize for the inconvenience,

Regards,

Nihal D. Perera

TSD: All the Linux machines in TSD has been rebooted due to a security vulnerability in the RHEL6 kernel.

Published Nov. 4, 2016 10:06 AM

Dear TSD-linux users,

it has been found a security VULNERABILITY on the linux RHEL6 kernel and the linux machines (physical and virtual) in TSD have been rebooted during the night. This was absolutely needed. We are now working to reset those virtual linux client that hasn’t rebooted properly. It will take up to one hour.

Please follow the operations on our operational log.

We apologise for the inconvenience.
Regards,
Francesca@TSD

TSD warning: quota statistics for usage of HNAS disk not available and disk quotas not enforced.

Published Nov. 1, 2016 4:26 PM

Dear TSD-user

We encountered a bug on our file server a couple of weeks ago, and to resolve this issue, quota statistics on the HNAS disk usage had to be reset. It will take up to two weeks until you will be able to see the correct quota statistics again. In the meantime you have the disk available but there is no management engine to enforce quotas on it. To avoid severe incident, we kindly request you, for the next three weeks, to inform us in advance if you need extra disk space (more then 1TiB) and we will try to adjust your request. Please do not import or produce amount of date for more then 1TiB without informing us.

Please notice that this message regards only the usage of HNAS disk, and not the usage of Colossus disk.

We apologize for any inconvenience this may cause you.

Regards,

Francesca@TSD

TSD Maintenance: Service downtime on Tue. 25/10 from 13:00 to 16:00 pm. - FINISHED

Published Oct. 25, 2016 7:33 PM

Dear TSD-users,

the maintenance is finished according to plan today at 25/10 at 16:00. The service is back in production.

Regards,

Francesca

Previous page 11 12 13 14 15 16 17 18 19 20 Next page

Feed from this page