  Subject: Re: finding out what went wrong with linux server

Posted on 08-12-2003 02:13 a.m. ET

Original Poster: Mark Krentel

> My Linux server was up and running non-stop for a few months and then
> seemed to be non-responsive all of a sudden - no file sharing, no print
> server, no SSH.

There's not much to go on here. A wedge/freeze can be difficult to
diagnose if you don't have any system monitoring scripts running ahead
of time.

It could be an overloaded server. Maybe one of servers has a memory
leak, and thus keeps growing until it consumes all resources and
freezes the machine. Samba works at the level of the file system,
so an inconsistency between server and client may cause a crash.
(I don't run Samba, so I apologize if I'm spreading Samba FUD.)
And you can never rule out hardware.

System logs are kept in /var/log (hopefully you already know that).
Maybe one of the logs gives a clue, but probably not. Try the "sar"
program (see "man sar") and the /var/log/sa logs, they might tell you

Basically, reboot and don't worry about it too much until it happens a
second time. Then write some scripts to monitor system activity and
try to track down the problem.


