Project

General

Profile

Anomalie #3928

Le site https://listes.chapril.org/ n'est plus accessible

Added by Christian P. Momon over 1 year ago. Updated 19 days ago.

Status:
Attente d'information
Priority:
Élevée
Category:
Listes de diffusion
Target version:
Start date:
10/14/2019
Due date:
% Done:

0%

Estimated time:

Description

Notre merveilleuse sonde Icinga nous informe que :

17:05 < chatonic[3]> sympa.cluster.chapril.org!WWSympa: => CRITICAL - Le socket n'a pas répondu dans les 10 secondes

Et effectivement, le site ne répond plus.


Related issues

Related to Admins - Anomalie #1702: Problème d'arrêt d'apache2 lors du logrotate sur sympa.cluster.april.orgFermé04/21/2016

Actions

History

#1

Updated by Christian P. Momon over 1 year ago

  • Status changed from Nouveau to Confirmé
#2

Updated by Christian P. Momon over 1 year ago

  • Assignee set to François Poulain

Sur Freenode#april-chapril, le 14/10/2019 :

18:06 < QGuLL> cpm_screen: ya des apache en zombie sur symap
18:06 < QGuLL> root       825  0.0  0.2   5732  2088 ?        Ss   oct.13   0:01 /usr/sbin/apache2 -k start
18:06 < QGuLL> www-data   831  0.0  0.2   7260  2556 ?        S    oct.13   0:00  \_ /usr/sbin/apache2 -k start
18:06 < QGuLL> www-data   837  0.0  0.0      0     0 ?        Z    oct.13   0:01  \_ [apache2] <defunct>
18:06 < QGuLL> www-data   838  0.0  0.0      0     0 ?        Z    oct.13   0:00  \_ [apache2] <defunct>

Redémarrer Apache suffit à décoincer le problème. Par contre, ça n'explique pas le problème…

#3

Updated by Christian P. Momon over 1 year ago

  • Assignee deleted (François Poulain)
#4

Updated by Christian P. Momon over 1 year ago

  • Status changed from Confirmé to Attente d'information
  • Assignee set to Christian P. Momon

C'est la deuxième fois en quelques jours. En l'absence d'autres explications, difficile d'aller plus loin.

Si le problème ne se reproduit pas d'ici 7 jours alors fermer le ticket.

#5

Updated by Christian P. Momon over 1 year ago

Ça n'a pas mis longtemps à recommencer :

=(^-^)=root@sympa:~# ps auxwww|grep apache2
root     16396  0.0  0.1   5732  1976 ?        Ss   oct.14   0:00 /usr/sbin/apache2 -k start
www-data 16397  0.0  0.2   7260  2808 ?        S    oct.14   0:00 /usr/sbin/apache2 -k start
www-data 16398  0.0  0.0      0     0 ?        Z    oct.14   0:01 [apache2] <defunct>
www-data 16399  0.0  0.0      0     0 ?        Z    oct.14   0:00 [apache2] <defunct>
root     24749  0.0  0.0   6224   876 pts/2    S+   00:15   0:00 grep --color=auto apache2

#6

Updated by Christian P. Momon about 1 year ago

Apache n'est pas le seul à avoir des zombies :

=(^-^)=root@sympa:/var/log#  ps auxwww|grep Z
USER       PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
root       580  0.0  0.0      0     0 ?        ZNs  oct.19   0:00 [sudo] <defunct>
root       918  0.0  0.0      0     0 ?        ZNs  oct.28   0:00 [sudo] <defunct>
root      1097  0.0  0.0      0     0 ?        ZNs  oct.28   0:00 [sudo] <defunct>
root      1311  0.0  0.0      0     0 ?        ZNs  oct.28   0:00 [sudo] <defunct>
root      1862  0.0  0.0      0     0 ?        ZNs  oct.28   0:00 [sudo] <defunct>
root      2250  0.0  0.0      0     0 ?        ZNs  oct.26   0:00 [sudo] <defunct>
root      2767  0.0  0.0      0     0 ?        ZNs  oct.20   0:00 [sudo] <defunct>
root      2846  0.0  0.0      0     0 ?        ZNs  oct.26   0:00 [sudo] <defunct>
www-data  3049  0.0  0.0      0     0 ?        Z    oct.28   0:00 [apache2] <defunct>
www-data  3112  0.0  0.0      0     0 ?        Z    oct.28   0:00 [apache2] <defunct>
www-data  3142  0.0  0.0      0     0 ?        Z    oct.28   0:01 [apache2] <defunct>
root      3231  0.0  0.0      0     0 ?        ZNs  oct.28   0:00 [sudo] <defunct>
root      4997  0.0  0.0      0     0 ?        ZNs  oct.15   0:00 [sudo] <defunct>
root      8475  0.0  0.0   6092   844 pts/0    S+   00:55   0:00 grep --color=auto Z
root      8998  0.0  0.0      0     0 ?        ZNs  oct.20   0:00 [sudo] <defunct>
root     11268  0.0  0.0      0     0 ?        ZNs  oct.13   0:00 [sudo] <defunct>
root     12918  0.0  0.0      0     0 ?        ZNs  oct.19   0:00 [sudo] <defunct>
root     14424  0.0  0.0      0     0 ?        ZNs  oct.20   0:00 [sudo] <defunct>
root     16130  0.0  0.0      0     0 ?        ZNs  oct.24   0:00 [sudo] <defunct>
root     17417  0.0  0.0      0     0 ?        ZNs  oct.14   0:00 [sudo] <defunct>
root     17603  0.0  0.0      0     0 ?        ZNs  oct.14   0:00 [sudo] <defunct>
root     17762  0.0  0.0      0     0 ?        ZNs  oct.14   0:00 [sudo] <defunct>
root     19785  0.0  0.0      0     0 ?        ZNs  oct.23   0:00 [sudo] <defunct>
root     19823  0.0  0.0      0     0 ?        ZNs  oct.18   0:00 [sudo] <defunct>
root     20675  0.0  0.0      0     0 ?        ZNs  oct.28   0:00 [sudo] <defunct>
root     21496  0.0  0.0      0     0 ?        ZNs  oct.26   0:00 [sudo] <defunct>
root     22291  0.0  0.0      0     0 ?        ZNs  oct.29   0:00 [sudo] <defunct>
root     22889  0.0  0.0      0     0 ?        ZNs  oct.23   0:00 [sudo] <defunct>
root     27995  0.0  0.0      0     0 ?        ZNs  oct.22   0:00 [sudo] <defunct>
root     28120  0.0  0.0      0     0 ?        ZNs  oct.27   0:00 [sudo] <defunct>
root     28374  0.0  0.0      0     0 ?        ZNs  oct.26   0:00 [sudo] <defunct>
root     28529  0.0  0.0      0     0 ?        ZNs  oct.25   0:00 [sudo] <defunct>
root     28597  0.0  0.0      0     0 ?        ZNs  oct.22   0:00 [sudo] <defunct>
root     29006  0.0  0.0      0     0 ?        ZNs  oct.22   0:00 [sudo] <defunct>
root     29781  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     29940  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     30503  0.0  0.0      0     0 ?        ZNs  oct.26   0:00 [sudo] <defunct>
root     30975  0.0  0.0      0     0 ?        ZNs  oct.26   0:00 [sudo] <defunct>
root     31336  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     31499  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     31659  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     31728  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     31781  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     32030  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
root     32196  0.0  0.0      0     0 ?        ZNs  oct.21   0:00 [sudo] <defunct>
root     32277  0.0  0.0      0     0 ?        ZNs  oct.16   0:00 [sudo] <defunct>
=(^-^)=root@sympa:/var/log#

Du coup, une idée ?

#7

Updated by Laurent POUJOULAT about 1 year ago

Que raconte le syslog ?

#8

Updated by Christian P. Momon about 1 year ago

  • Related to Anomalie #1702: Problème d'arrêt d'apache2 lors du logrotate sur sympa.cluster.april.org added
#9

Updated by François Poulain about 1 year ago

Marre. J'ai installé nginx.

- installation de nginx et spawn-fcgi
- reprise mutatis mutandis de la conf du nginx sympa april:

=(^-^)=root@sympa:/etc/nginx# cat snippets/sympa.conf 
location = / {
    return 301 /sympa/;
}

# shoot down rss for perfs issues
location = /sympa/rss {
    return 301 /sympa/;
}

location ~ ^/sympa/.*/chaton {
    return 301 /sympa/info/chapril;
}

location /sympa {
    include fastcgi_params;
    fastcgi_pass unix:/run/sympa/wwsympa.socket;
    fastcgi_split_path_info ^(/sympa)(.*)$;
        fastcgi_param SCRIPT_FILENAME /usr/lib/cgi-bin/sympa/wwsympa.fcgi;
        fastcgi_param PATH_INFO $fastcgi_path_info;
}

location /static-sympa {
    alias /usr/share/sympa/static_content;
    access_log off;
}

location /css-sympa/chapril.org {
    alias /var/lib/sympa/css/chapril.org;
    access_log off;
}

- installation du launcher:
=(^-^)=root@sympa:/etc/sympa# cat > /etc/systemd/system/wwsympa.service
[Unit]
Description=WWSympa - Web interface for Sympa mailing list manager
After=syslog.target sympa.service
BindsTo=sympa.service

[Service]
Type=forking
PIDFile=/var/run/sympa/wwsympa.pid
ExecStart=/usr/bin/spawn-fcgi -F $FCGI_CHILDREN \
    -P /var/run/sympa/wwsympa.pid \
    -u $FCGI_USER -g $FCGI_GROUP $FCGI_OPTS -- \
    /usr/lib/cgi-bin/sympa/wwsympa.fcgi
Environment="FCGI_CHILDREN=5" 
Environment="FCGI_USER=sympa" 
Environment="FCGI_GROUP=sympa" 
Environment="FCGI_OPTS=-s /var/run/sympa/wwsympa.socket -M 0600 -U www-data" 
EnvironmentFile=-/etc/default/sympa
Restart=always

[Install]
WantedBy=multi-user.target

#10

Updated by François Poulain about 1 year ago

On a un comportement curieux. Le truc passe en ipv6 mais pas en ipv4.

Comme on a proxy_pass http://sympa.cluster.chapril.org dans le bastion, des fois ça marche et des fois pas.

=> forçage en ipv6 dans le bastion.

#11

Updated by Christian P. Momon about 1 year ago

  • Project changed from Chapril to Infra Chapril
#12

Updated by Christian P. Momon 10 months ago

  • Target version set to Backlog
#13

Updated by Christian P. Momon 3 months ago

  • Assignee changed from Christian P. Momon to François Poulain
#14

Updated by Christian P. Momon 19 days ago

  • Category set to Listes de diffusion

Also available in: Atom PDF