New subject: [opensuse-es] Un proceso de samba satura el procesador en openSUSE 11.1

14 Feb 2009

      Hola tod@s,

A donde un cliente instalamos un servidor OpenSuse 11.1 con Samba,
para que comparten los archivos de manera centralizada, y no guardan
nada en sus PC.

Parte de la red esta inhalambrica.

En ciertos momento, el proceso smbd ocupa 100% del CPU, y deja el
servidor suuuuuuper lento, o indisponible.

Lo raro es que se hace imposible matar al proceso que ocupa el CPU. Ni
un kill -9 del proceso lo mata :-(((

Solo cortando la luz permite reiniciar el servidor.

Cambiamos la versión de de la versión 3.3.0 a la versión 3.2.6,
esperando que resuelva el problema. Nada, igual en algun momento
(varios minutos o horas) regresa el problema.

Sospechamos que algun(os?) usuario cierra mal su conneccion (o pierde
un intante la conexion wifi (?), y eso vuelve el Samba loco.

En Internet, no encontré respuesta satisfactoria a mi problema.

Alguien ya tuve este problema y lo pudo resolver?

smb.conf

[global]
workgroup = MEDLAB
server string = Servidor de archivos de Medlab
map to guest = Bad User
null passwords = Yes
guest account = samba
printcap name = cups
ldap ssl = no
create mask = 0777
force create mode = 0777
force security mode = 0777
directory mask = 0777
force directory mode = 0777
force directory security mode = 0777
cups options = raw

[users]
comment = All users
path = /shared
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/

[admon]
comment = Administracion
path = /shared/admon
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/

[clientes]
comment = Clientes
path = /shared/clientes
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/

[gerencia]
comment = Gerencia
path = /shared/gerencia
read only = No
inherit acls = Yes
veto files = /aquota.user/groups/shares/

[medicos]
comment = Medicos
path = /shared/medicos/
inherit acls = yes
veto files = /aquota.user/groups/shares/
guest ok = yes
read only = no

[compartido]
comment = All groups
path = /shared/compartido/
username = samba
read only = No
acl check permissions = No
force unknown acl user = Yes
guest ok = Yes
hosts allow = 192.168.1.

===========================================================

Abajo un extracto de smbd.log  para el dia de hoy. Espero que
encontraras la razon.

El proceso que ocupa 100% es el 11878, iniciado segun el log por el IP
192.168.1.67 (los IP son fijos) a las 8h51,  y este IP cerro las
conexiones a las 11h57, pero son las 12h35 y el pid 11878 ocupa 100%
del CPU y no logro matarlo ;-(((

Talvez hay algo otro que no he visto...

Algo raro: el pid 11878 esta (ahora) a nombre de root y no de admon...
Tambien el TIME dice 37:24 que es igual al tiempo entre las 11h57 y
las 12h35. Algo sale mal al momento de cerrar la conexion y parar el
proceso...

TOP:

top - 12:35:00 up 19:40,  1 user,  load average: 3.01, 2.92, 2.33
Tasks: 132 total,   4 running, 128 sleeping,   0 stopped,   0 zombie
Cpu(s):  0.0%us, 25.0%sy,  0.0%ni, 74.5%id,  0.3%wa,  0.0%hi,  0.2%si,  0.0%st
Mem:   2048884k total,  1996896k used,    51988k free,    98664k buffers
Swap:  2104504k total,       28k used,  2104476k free,  1506752k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
11878 root      20   0 16984 5188 3932 R  100  0.3  37:24.13 smbd
14763 root      20   0  2432 1132  848 R    1  0.1   0:00.04 top
    1 root      20   0  1008  380  332 S    0  0.0   0:02.00 init
    2 root      15  -5     0    0    0 S    0  0.0   0:00.00 kthreadd
    3 root      RT  -5     0    0    0 S    0  0.0   0:00.00 migration/0
    4 root      15  -5     0    0    0 S    0  0.0   0:00.84 ksoftirqd/0
    5 root      RT  -5     0    0    0 S    0  0.0   0:00.00 migration/1
    6 root      15  -5     0    0    0 S    0  0.0   0:00.50 ksoftirqd/1
    7 root      RT  -5     0    0    0 S    0  0.0   0:00.00 migration/2
    8 root      15  -5     0    0    0 S    0  0.0   0:00.24 ksoftirqd/2
    9 root      RT  -5     0    0    0 S    0  0.0   0:00.00 migration/3
   10 root      15  -5     0    0    0 S    0  0.0   0:00.28 ksoftirqd/3
   11 root      15  -5     0    0    0 S    0  0.0   0:00.82 events/0

log.smbd:

[2009/02/14 03:45:15,  0] smbd/server.c:main(1208)
  smbd version 3.2.6-0.3.1-2042-SUSE-CODE11 started.
  Copyright Andrew Tridgell and the Samba Team 1992-2008
[2009/02/14 07:25:36,  1] smbd/service.c:make_connection_snum(1194)
  nadia (::ffff:192.168.1.104) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11622)
[2009/02/14 07:34:17,  1] smbd/service.c:make_connection_snum(1194)
  lenovo_medicos (::ffff:192.168.1.80) connect to service medicos
initially as user medicos (uid=1004, gid=100) (pid 11651)
[2009/02/14 07:34:17,  1] smbd/service.c:make_connection_snum(1194)
  lenovo_medicos (::ffff:192.168.1.80) connect to service compartido
initially as user medicos (uid=1004, gid=100) (pid 11651)
[2009/02/14 07:42:03,  1] smbd/service.c:make_connection_snum(1194)
  recepcion (::ffff:192.168.1.112) connect to service compartido
initially as user admon (uid=1002, gid=100) (pid 11661)
[2009/02/14 07:49:43,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11694)
[2009/02/14 07:49:43,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service compartido
initially as user admon (uid=1002, gid=100) (pid 11694)
[2009/02/14 07:49:44,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 07:49:44,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 07:49:44,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 07:53:58,  1] smbd/service.c:make_connection_snum(1194)
  direccion_medic (::ffff:192.168.1.70) connect to service medicos
initially as user medicos (uid=1004, gid=100) (pid 11698)
[2009/02/14 07:53:58,  1] smbd/service.c:make_connection_snum(1194)
  direccion_medic (::ffff:192.168.1.70) connect to service compartido
initially as user medicos (uid=1004, gid=100) (pid 11698)
[2009/02/14 07:54:06,  1] smbd/service.c:make_connection_snum(1194)
  server_medlab (::ffff:192.168.1.71) connect to service clientes
initially as user clientes (uid=1006, gid=100) (pid 11700)
[2009/02/14 07:54:06,  1] smbd/service.c:make_connection_snum(1194)
  server_medlab (::ffff:192.168.1.71) connect to service users
initially as user clientes (uid=1006, gid=100) (pid 11700)
[2009/02/14 08:05:12,  1] smbd/service.c:make_connection_snum(1194)
  emma (::ffff:192.168.1.162) connect to service medicos initially as
user medicos (uid=1004, gid=100) (pid 11739)
[2009/02/14 08:05:12,  1] smbd/service.c:make_connection_snum(1194)
  emma (::ffff:192.168.1.162) connect to service compartido initially
as user medicos (uid=1004, gid=100) (pid 11739)
[2009/02/14 08:05:15,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 08:05:15,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:05:15,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:09:42,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:09:42,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:09:42,  1] smbd/service.c:close_cnum(1405)
  contabilidad (::ffff:192.168.1.68) closed connection to service compartido
[2009/02/14 08:09:42,  1] smbd/service.c:close_cnum(1405)
  contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:15:07,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11772)
[2009/02/14 08:20:06,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:20:06,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:20:06,  1] smbd/service.c:close_cnum(1405)
  contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:22:57,  1] smbd/service.c:make_connection_snum(1194)
  medlab (::ffff:192.168.1.67) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11787)
[2009/02/14 08:23:04,  1] smbd/service.c:make_connection_snum(1194)
  medlab (::ffff:192.168.1.67) connect to service compartido initially
as user admon (uid=1002, gid=100) (pid 11787)
[2009/02/14 08:24:08,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11791)
[2009/02/14 08:24:11,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:24:11,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:30:35,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:30:35,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:30:35,  1] smbd/service.c:close_cnum(1405)
  contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:32:15,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11819)
[2009/02/14 08:38:48,  1] smbd/service.c:close_cnum(1405)
  medlab (::ffff:192.168.1.67) closed connection to service admon
[2009/02/14 08:38:48,  1] smbd/service.c:close_cnum(1405)
  medlab (::ffff:192.168.1.67) closed connection to service compartido
[2009/02/14 08:44:47,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 08:44:47,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:44:47,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:46:28,  1] smbd/service.c:make_connection_snum(1194)
  medlab (::ffff:192.168.1.67) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11868)
[2009/02/14 08:46:28,  1] smbd/service.c:make_connection_snum(1194)
  medlab (::ffff:192.168.1.67) connect to service compartido initially
as user admon (uid=1002, gid=100) (pid 11868)
[2009/02/14 08:47:33,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:47:33,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:47:33,  1] smbd/service.c:close_cnum(1405)
  contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 08:47:46,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 08:47:46,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 08:47:46,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 11873)
[2009/02/14 08:50:18,  1] smbd/service.c:close_cnum(1405)
  medlab (::ffff:192.168.1.67) closed connection to service admon
[2009/02/14 08:50:18,  1] smbd/service.c:close_cnum(1405)
  medlab (::ffff:192.168.1.67) closed connection to service compartido

****************************
[2009/02/14 08:51:20,  1] smbd/service.c:make_connection_snum(1194)
  medlab (::ffff:192.168.1.67) connect to service admon initially as
user admon (uid=1002, gid=100) (pid 11878)
[2009/02/14 08:51:20,  1] smbd/service.c:make_connection_snum(1194)
  medlab (::ffff:192.168.1.67) connect to service compartido initially
as user admon (uid=1002, gid=100) (pid 11878)
****************************

[2009/02/14 09:37:46,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 09:37:46,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 09:37:46,  1] smbd/service.c:close_cnum(1405)
  contabilidad (::ffff:192.168.1.68) closed connection to service admon
[2009/02/14 10:06:23,  1] smbd/service.c:make_connection_snum(1194)
  contabilidad (::ffff:192.168.1.68) connect to service admon
initially as user admon (uid=1002, gid=100) (pid 14293)
[2009/02/14 10:06:26,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
[2009/02/14 10:06:26,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 10:06:26,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 11:47:29,  1] smbd/service.c:close_cnum(1405)
  nadia (::ffff:192.168.1.104) closed connection to service admon
[2009/02/14 11:53:07,  1] smbd/notify_inotify.c:watch_destructor(351)
  inotify_rm_watch returned Argumento inválido
[2009/02/14 11:53:11,  1] smbd/notify_inotify.c:watch_destructor(351)
  inotify_rm_watch returned Argumento inválido
[2009/02/14 11:53:24,  1] smbd/notify_inotify.c:watch_destructor(351)
  inotify_rm_watch returned Argumento inválido
[2009/02/14 11:53:27,  1] smbd/notify_inotify.c:watch_destructor(351)
  inotify_rm_watch returned Argumento inválido
[2009/02/14 11:55:58,  1] smbd/service.c:close_cnum(1405)
  recepcion (::ffff:192.168.1.112) closed connection to service compartido
[2009/02/14 11:57:54,  1] smbd/notify_inotify.c:watch_destructor(351)
  inotify_rm_watch returned Argumento inválido

********************
[2009/02/14 11:57:58,  1] smbd/service.c:close_cnum(1405)
  medlab (::ffff:192.168.1.67) closed connection to service compartido
[2009/02/14 11:57:58,  1] smbd/service.c:close_cnum(1405)
  medlab (::ffff:192.168.1.67) closed connection to service admon
********************

[2009/02/14 12:01:14,  1] smbd/service.c:close_cnum(1405)
  lenovo_medicos (::ffff:192.168.1.80) closed connection to service medicos
[2009/02/14 12:01:14,  1] smbd/service.c:close_cnum(1405)
  lenovo_medicos (::ffff:192.168.1.80) closed connection to service compartido
[2009/02/14 12:03:56,  1] smbd/service.c:close_cnum(1405)
  emma (::ffff:192.168.1.162) closed connection to service medicos
[2009/02/14 12:03:56,  1] smbd/service.c:close_cnum(1405)
  emma (::ffff:192.168.1.162) closed connection to service compartido
[2009/02/14 12:05:15,  1] smbd/service.c:make_connection_snum(1194)
  ar (::ffff:192.168.1.54) connect to service medicos initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15,  1] smbd/service.c:make_connection_snum(1194)
  ar (::ffff:192.168.1.54) connect to service admon initially as user
gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15,  1] smbd/service.c:make_connection_snum(1194)
  ar (::ffff:192.168.1.54) connect to service gerencia initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15,  1] smbd/service.c:make_connection_snum(1194)
  ar (::ffff:192.168.1.54) connect to service clientes initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:15,  1] smbd/service.c:make_connection_snum(1194)
  ar (::ffff:192.168.1.54) connect to service compartido initially as
user gerencia (uid=1003, gid=100) (pid 14639)
[2009/02/14 12:05:18,  0] lib/util_sock.c:read_socket_with_timeout(939)
[2009/02/14 12:05:18,  0] lib/util_sock.c:get_peer_addr_internal(1607)
  getpeername failed. Error was El otro extremo de la conexión no está conectado
  read_socket_with_timeout: client 0.0.0.0 read error = Conexión
reinicializada por la máquina remota.
[2009/02/14 12:08:26,  1] smbd/service.c:close_cnum(1405)
  server_medlab (::ffff:192.168.1.71) closed connection to service clientes
[2009/02/14 12:08:26,  1] smbd/service.c:close_cnum(1405)
  server_medlab (::ffff:192.168.1.71) closed connection to service users

-- 
Ing. Alejandro Rodriguez  || @LeX
Usuario Linux # 379802
openSUSE 11.1
-- 
Para dar de baja la suscripciÃ³n, mande un mensaje a:
   opensuse-es+unsubscribe@opensuse.org
Para obtener el resto de direcciones-comando, mande
un mensaje a:
   opensuse-es+help@opensuse.org

[opensuse-es] Un proceso de samba satura el procesador en openSUSE 11.1

Alex Rodriguez

Carlos E. R.

Alex Rodriguez

Camaleón

admin-listas

Rafa Grimán

Roberto José Blandino Cisneros

Camaleón

Alex Rodriguez

Camaleón

tags

participants (6)