Ruggieri, Timothy writes
I went over to UITS and took a look at Nebka. As you suspected, there seems to have been a filesystem corruption problem of some kind. The console was full of EXT3 errors. I shut down the computer and forced a complete fsck on restart. After the disk check, Nebka seems to be working again. I took the liberty of creating an account for myself so I could log in remotely via SSH. SSH seems to be working, along with Apache, although when you connect to nebka.uconn.edu via the web you get the Apache startup page. I don't know anything about what services Nebka is offering, so I have not checked much further.
Please try to connect to Nebka and see what, if anything, is still broken.
The disk has crashed again. I have no backup for the crucial data in /home/aras. The most important there are /home/aras/acis/userdata and /home/aras/acis/backup If the machine is kept running, that data may go away. If you guys don't have a local backup, we would be in a severe fix. If someone can make it there as soon as possible, take a backup of /home/aras, and put in a new disk with a basic debian o/s in it, I can then work on restoring service. The disk that we have there now is a gonner. Cheers, Thomas Krichel http://openlib.org/home/krichel RePEc:per:1965-06-05:thomas_krichel phone: +7 383 330 6813 skype: thomaskrichel
UConn has no backup under the understanding that you were backing up. Christian Zimmermann FIGUGEGL! Department of Economics University of Connecticut 341 Mansfield Road, Unit 1063 Storrs, CT 06269-1063 http://ideas.repec.org/zimm/ christian.zimmermann@uconn.edu http://ideas.repec.org/e/pzi1.html On Sat, 19 Jan 2008, Thomas Krichel wrote:
Ruggieri, Timothy writes
I went over to UITS and took a look at Nebka. As you suspected, there seems to have been a filesystem corruption problem of some kind. The console was full of EXT3 errors. I shut down the computer and forced a complete fsck on restart. After the disk check, Nebka seems to be working again. I took the liberty of creating an account for myself so I could log in remotely via SSH. SSH seems to be working, along with Apache, although when you connect to nebka.uconn.edu via the web you get the Apache startup page. I don't know anything about what services Nebka is offering, so I have not checked much further.
Please try to connect to Nebka and see what, if anything, is still broken.
The disk has crashed again. I have no backup for the crucial data in /home/aras. The most important there are /home/aras/acis/userdata and /home/aras/acis/backup
If the machine is kept running, that data may go away. If you guys don't have a local backup, we would be in a severe fix.
If someone can make it there as soon as possible, take a backup of /home/aras, and put in a new disk with a basic debian o/s in it, I can then work on restoring service. The disk that we have there now is a gonner.
Cheers,
Thomas Krichel http://openlib.org/home/krichel RePEc:per:1965-06-05:thomas_krichel phone: +7 383 330 6813 skype: thomaskrichel
Christian Zimmermann writes
UConn has no backup under the understanding that you were backing\ up.
This is extremely bad news. Because of the problems that I had with fafner and raneb, and in the aftermath of that backup has been shaky. I have no backup of /home/aras. This is putting us into an extremly difficult situation. We need to stop the machine asap, run a disk check, and recover whatever we can on another machine, from /home/aras. Let me know if there is anything I could do to help. I have beenn running an rsync backup as soon as the machine was back up, checking my local records. But it did not get to /home/aras until the machine crashed. This disk is toast. We need more help with backup and system administration. Cheers, Thomas Krichel http://openlib.org/home/krichel RePEc:per:1965-06-05:thomas_krichel phone: +7 383 330 6813 skype: thomaskrichel
I am not sure the disk is toast. We had problems previously that were solved with a better directory structure. Maybe this is the same problem again. The CitEc AMF data has grown a lot recently, if some directory has a lot of files, this would replicate the earlier problem. Christian Zimmermann FIGUGEGL! Department of Economics University of Connecticut 341 Mansfield Road, Unit 1063 Storrs, CT 06269-1063 http://ideas.repec.org/zimm/ christian.zimmermann@uconn.edu http://ideas.repec.org/e/pzi1.html On Sat, 19 Jan 2008, Thomas Krichel wrote:
Christian Zimmermann writes
UConn has no backup under the understanding that you were backing\ up.
This is extremely bad news. Because of the problems that I had with fafner and raneb, and in the aftermath of that backup has been shaky. I have no backup of /home/aras.
This is putting us into an extremly difficult situation. We need to stop the machine asap, run a disk check, and recover whatever we can on another machine, from /home/aras.
Let me know if there is anything I could do to help. I have beenn running an rsync backup as soon as the machine was back up, checking my local records. But it did not get to /home/aras until the machine crashed. This disk is toast.
We need more help with backup and system administration.
Cheers,
Thomas Krichel http://openlib.org/home/krichel RePEc:per:1965-06-05:thomas_krichel phone: +7 383 330 6813 skype: thomaskrichel
Christian Zimmermann writes
I am not sure the disk is toast. We had problems previously that were solved with a better directory structure. Maybe this is the same problem again. The CitEc AMF data has grown a lot recently, if some directory has a lot of files, this would replicate the earlier problem.
I hope pray, cry out for, that you are right. If that is the case, we have to recover to a local backup first. We can not use rsync to backup if this is where the problem is. Last time thte problems came from JMBC storing tons of files in mutabor. This could also cause my backup to hang here. JMBC please check locally. I have stopped the crontab from mutabor that delivers the CitEc data adnetec@mutabor:~$ crontab -l MAILTO=kurmanov@openlib.org # ###33 03 * * * rsync -qa --delete adnetec@all.repec.org:/home/adrepec/RePEc/remo/ /home/adnetec/RePEc/remo 20 * * * * /home/adnetec/CitEc/bin/check_mysql >> /home/adnetec/var/log/check_mysql.log # # # mirror RAS' cit.events exports # # old: 15 8 * * * rsync -t aras@nebka.openlib.org:citec-export/* /home/adnetec/ras-exports/ #15 8 * * * /home/adnetec/etc/ras-exports-cronjob.sh # # weekly update of files: iscited and hasreferences # 10 10 * * sun /home/adnetec/CitEc/bin/weekly Cheers, Thomas Krichel http://openlib.org/home/krichel RePEc:per:1965-06-05:thomas_krichel phone: +7 383 330 6813 skype: thomaskrichel
participants (2)
-
Christian Zimmermann -
Thomas Krichel