Hola tod@s,
A donde un cliente instalamos un servidor OpenSuse 11.1 con Samba,
para que comparten los archivos de manera centralizada, y no guardan
nada en sus PC.
Parte de la red esta inhalambrica.
En ciertos momento, el proceso smbd ocupa 100% del CPU, y deja el
servidor suuuuuuper lento, o indisponible.
Lo raro es que se hace imposible matar al proceso que ocupa el CPU. Ni
un kill -9 del proceso lo mata :-(((
Solo cortando la luz permite reiniciar el servidor.
Cambiamos la versión de de la versión 3.3.0 a la versión 3.2.6,
esperando que resuelva el problema. Nada, igual en algun momento
(varios minutos o horas) regresa el problema.
Sospechamos que algun(os?) usuario cierra mal su conneccion (o pierde
un intante la conexion wifi (?), y eso vuelve el Samba loco.
En Internet, no encontré respuesta satisfactoria a mi problema.
Alguien ya tuve este problema y lo pudo resolver?
smb.conf
[global]
workgroup = MEDLAB
server string = Servidor de archivos de Medlab
map to guest = Bad User
null passwords = Yes
guest account = samba
printcap name = cups
ldap ssl = no
create mask = 0777
force create mode = 0777
force security mode = 0777
directory mask = 0777
force directory mode = 0777
force directory security mode = 0777
cups options = raw
[users]
comment = All users
path = /shared
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/
[admon]
comment = Administracion
path = /shared/admon
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/
[clientes]
comment = Clientes
path = /shared/clientes
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/
[gerencia]
comment = Gerencia
path = /shared/gerencia
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/
[medicos]
comment = Medicos
path = /shared/medicos/
inherit acls = yes
veto files = /aquota.user/groups/shares/
guest ok = yes
read only = no
[compartido]
comment = All groups
path = /shared/compartido/
username = samba
read only = No
acl check permissions = No
force unknown acl user = Yes
guest ok = Yes
hosts allow = 192.168.1.
===========================================================
Abajo un extracto de smbd.log para el dia de hoy. Espero que
encontraras la razon.
El proceso que ocupa 100% es el 11878, iniciado segun el log por el IP
192.168.1.67 (los IP son fijos) a las 8h51, y este IP cerro las
conexiones a las 11h57, pero son las 12h35 y el pid 11878 ocupa 100%
del CPU y no logro matarlo ;-(((
Talvez hay algo otro que no he visto...
Algo raro: el pid 11878 esta (ahora) a nombre de root y no de admon...
Tambien el TIME dice 37:24 que es igual al tiempo entre las 11h57 y
las 12h35. Algo sale mal al momento de cerrar la conexion y parar el
proceso...
TOP:
top - 12:35:00 up 19:40, 1 user, load average: 3.01, 2.92, 2.33
Tasks: 132 total, 4 running, 128 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 25.0%sy, 0.0%ni, 74.5%id, 0.3%wa, 0.0%hi, 0.2%si, 0.0%st
Mem: 2048884k total, 1996896k used, 51988k free, 98664k buffers
Swap: 2104504k total, 28k used, 2104476k free, 1506752k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
11878 root 20 0 16984 5188 3932 R 100 0.3 37:24.13 smbd
14763 root 20 0 2432 1132 848 R 1 0.1 0:00.04 top
1 root 20 0 1008 380 332 S 0 0.0 0:02.00 init
2 root 15 -5 0 0 0 S 0 0.0 0:00.00 kthreadd
3 root RT -5 0 0 0 S 0 0.0 0:00.00 migration/0
4 root 15 -5 0 0 0 S 0 0.0 0:00.84 ksoftirqd/0
5 root RT -5 0 0 0 S 0 0.0 0:00.00 migration/1
6 root 15 -5 0 0 0 S 0 0.0 0:00.50 ksoftirqd/1
7 root RT -5 0 0 0 S 0 0.0 0:00.00 migration/2
8 root 15 -5 0 0 0 S 0 0.0 0:00.24 ksoftirqd/2
9 root RT -5 0 0 0 S 0 0.0 0:00.00 migration/3
10 root 15 -5 0 0 0 S 0 0.0 0:00.28 ksoftirqd/3
11 root 15 -5 0 0 0 S 0 0.0 0:00.82 events/0
log.smbd:
[2009/02/14 03:45:15, 0] smbd/server.c:main(1208)
smbd version 3.2.6-0.3.1-2042-SUSE-CODE11 started.
Copyright Andrew Tridgell and the Samba Team 1992-2008
[2009/02/14 07:25:36, 1] smbd/service.c:make_connection_snum(1194)
nadia (::ffff:192.168.1.104) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11622)
[2009/02/14 07:34:17, 1] smbd/service.c:make_connection_snum(1194)
lenovo_medicos (::ffff:192.168.1.80) connect to service medicos
initially as user medicos (uid=1004, gid=100) (pid 11651)
[2009/02/14 07:34:17, 1] smbd/service.c:make_connection_snum(1194)
lenovo_medicos (::ffff:192.168.1.80) connect to service compartido
initially as user medicos (uid=1004, gid=100) (pid 11651)
[2009/02/14 07:42:03, 1] smbd/service.c:make_connection_snum(1194)
recepcion (::ffff:192.168.1.112) connect to service compartido
initially as user admon (uid=1002, gid=100) (pid 11661)
[2009/02/14 07:49:43, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11694)
[2009/02/14 07:49:43, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service compartido
initially as user admon (uid=1002, gid=100) (pid 11694)
[2009/02/14 07:49:44, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 07:49:44, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 07:49:44, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 07:53:58, 1] smbd/service.c:make_connection_snum(1194)
direccion_medic (::ffff:192.168.1.70) connect to service medicos
initially as user medicos (uid=1004, gid=100) (pid 11698)
[2009/02/14 07:53:58, 1] smbd/service.c:make_connection_snum(1194)
direccion_medic (::ffff:192.168.1.70) connect to service compartido
initially as user medicos (uid=1004, gid=100) (pid 11698)
[2009/02/14 07:54:06, 1] smbd/service.c:make_connection_snum(1194)
server_medlab (::ffff:192.168.1.71) connect to service clientes
initially as user clientes (uid=1006, gid=100) (pid 11700)
[2009/02/14 07:54:06, 1] smbd/service.c:make_connection_snum(1194)
server_medlab (::ffff:192.168.1.71) connect to service users
initially as user clientes (uid=1006, gid=100) (pid 11700)
[2009/02/14 08:05:12, 1] smbd/service.c:make_connection_snum(1194)
emma (::ffff:192.168.1.162) connect to service medicos initially as
user medicos (uid=1004, gid=100) (pid 11739)
[2009/02/14 08:05:12, 1] smbd/service.c:make_connection_snum(1194)
emma (::ffff:192.168.1.162) connect to service compartido initially
as user medicos (uid=1004, gid=100) (pid 11739)
[2009/02/14 08:05:15, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 08:05:15, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:05:15, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:09:42, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:09:42, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:09:42, 1] smbd/service.c:close_cnum(1405)
contabilidad (::ffff:192.168.1.68) closed connection to service compartido
[2009/02/14 08:09:42, 1] smbd/service.c:close_cnum(1405)
contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:15:07, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11772)
[2009/02/14 08:20:06, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:20:06, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:20:06, 1] smbd/service.c:close_cnum(1405)
contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:22:57, 1] smbd/service.c:make_connection_snum(1194)
medlab (::ffff:192.168.1.67) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11787)
[2009/02/14 08:23:04, 1] smbd/service.c:make_connection_snum(1194)
medlab (::ffff:192.168.1.67) connect to service compartido initially
as user admon (uid=1002, gid=100) (pid 11787)
[2009/02/14 08:24:08, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11791)
[2009/02/14 08:24:11, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:24:11, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:30:35, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:30:35, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:30:35, 1] smbd/service.c:close_cnum(1405)
contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:32:15, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11819)
[2009/02/14 08:38:48, 1] smbd/service.c:close_cnum(1405)
medlab (::ffff:192.168.1.67) closed connection to service admon
[2009/02/14 08:38:48, 1] smbd/service.c:close_cnum(1405)
medlab (::ffff:192.168.1.67) closed connection to service compartido
[2009/02/14 08:44:47, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 08:44:47, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:44:47, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:46:28, 1] smbd/service.c:make_connection_snum(1194)
medlab (::ffff:192.168.1.67) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11868)
[2009/02/14 08:46:28, 1] smbd/service.c:make_connection_snum(1194)
medlab (::ffff:192.168.1.67) connect to service compartido initially
as user admon (uid=1002, gid=100) (pid 11868)
[2009/02/14 08:47:33, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:47:33, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:47:33, 1] smbd/service.c:close_cnum(1405)
contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:47:46, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:47:46, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:47:46, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11873)
[2009/02/14 08:50:18, 1] smbd/service.c:close_cnum(1405)
medlab (::ffff:192.168.1.67) closed connection to service admon
[2009/02/14 08:50:18, 1] smbd/service.c:close_cnum(1405)
medlab (::ffff:192.168.1.67) closed connection to service compartido
****************************
[2009/02/14 08:51:20, 1] smbd/service.c:make_connection_snum(1194)
medlab (::ffff:192.168.1.67) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11878)
[2009/02/14 08:51:20, 1] smbd/service.c:make_connection_snum(1194)
medlab (::ffff:192.168.1.67) connect to service compartido initially
as user admon (uid=1002, gid=100) (pid 11878)
****************************
[2009/02/14 09:37:46, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 09:37:46, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 09:37:46, 1] smbd/service.c:close_cnum(1405)
contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 10:06:23, 1] smbd/service.c:make_connection_snum(1194)
contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 14293)
[2009/02/14 10:06:26, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 10:06:26, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 10:06:26, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 11:47:29, 1] smbd/service.c:close_cnum(1405)
nadia (::ffff:192.168.1.104) closed connection to service admon
[2009/02/14 11:53:07, 1] smbd/notify_inotify.c:watch_destructor(351)
inotify_rm_watch returned Argumento inválido
[2009/02/14 11:53:11, 1] smbd/notify_inotify.c:watch_destructor(351)
inotify_rm_watch returned Argumento inválido
[2009/02/14 11:53:24, 1] smbd/notify_inotify.c:watch_destructor(351)
inotify_rm_watch returned Argumento inválido
[2009/02/14 11:53:27, 1] smbd/notify_inotify.c:watch_destructor(351)
inotify_rm_watch returned Argumento inválido
[2009/02/14 11:55:58, 1] smbd/service.c:close_cnum(1405)
recepcion (::ffff:192.168.1.112) closed connection to service compartido
[2009/02/14 11:57:54, 1] smbd/notify_inotify.c:watch_destructor(351)
inotify_rm_watch returned Argumento inválido
********************
[2009/02/14 11:57:58, 1] smbd/service.c:close_cnum(1405)
medlab (::ffff:192.168.1.67) closed connection to service compartido
[2009/02/14 11:57:58, 1] smbd/service.c:close_cnum(1405)
medlab (::ffff:192.168.1.67) closed connection to service admon
********************
[2009/02/14 12:01:14, 1] smbd/service.c:close_cnum(1405)
lenovo_medicos (::ffff:192.168.1.80) closed connection to service medicos
[2009/02/14 12:01:14, 1] smbd/service.c:close_cnum(1405)
lenovo_medicos (::ffff:192.168.1.80) closed connection to service compartido
[2009/02/14 12:03:56, 1] smbd/service.c:close_cnum(1405)
emma (::ffff:192.168.1.162) closed connection to service medicos
[2009/02/14 12:03:56, 1] smbd/service.c:close_cnum(1405)
emma (::ffff:192.168.1.162) closed connection to service compartido
[2009/02/14 12:05:15, 1] smbd/service.c:make_connection_snum(1194)
ar (::ffff:192.168.1.54) connect to service medicos initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15, 1] smbd/service.c:make_connection_snum(1194)
ar (::ffff:192.168.1.54) connect to service admon initially as user
gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15, 1] smbd/service.c:make_connection_snum(1194)
ar (::ffff:192.168.1.54) connect to service gerencia initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15, 1] smbd/service.c:make_connection_snum(1194)
ar (::ffff:192.168.1.54) connect to service clientes initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15, 1] smbd/service.c:make_connection_snum(1194)
ar (::ffff:192.168.1.54) connect to service compartido initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:18, 0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 12:05:18, 0] lib/util_sock.c:get_peer_addr_internal(1607)
getpeername failed. Error was El otro extremo de la conexión no está conectado
read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 12:08:26, 1] smbd/service.c:close_cnum(1405)
server_medlab (::ffff:192.168.1.71) closed connection to service clientes
[2009/02/14 12:08:26, 1] smbd/service.c:close_cnum(1405)
server_medlab (::ffff:192.168.1.71) closed connection to service users
--
Ing. Alejandro Rodriguez || @LeX
Usuario Linux # 379802
openSUSE 11.1
--
Para dar de baja la suscripción, mande un mensaje a:
opensuse-es+unsubscribe(a)opensuse.org
Para obtener el resto de direcciones-comando, mande
un mensaje a:
opensuse-es+help(a)opensuse.org