[smokeping-users] Problem after reboot apache and smokeping
William Vidal
wrcvidal at gmail.com
Fri Nov 18 02:08:09 CET 2016
Hi folks,
I'm use a master/slave architecture in my envarioment.
My master is Centos 7, and my slaves are Ubuntu/Debian. Version smokepin is
2.7
In my architecture I have around 40 slaves and in each slave I have ~50 to
~90 objects to monitor.
So far so good. All works fine.
Today, I restarted smokeping and after this I cant UP the service anymore.
After try start smokeping, I see this logs :
tail -f /var/log/httpd/error_log
[Thu Nov 17 22:46:29.718257 2016] [core:error] [pid 22288] [client
172.18.101.214:40875] End of script output before headers: smokeping.fcgi
[Thu Nov 17 22:46:32.847477 2016] [fcgid:warn] [pid 22287] [client
172.18.25.98:35090] mod_fcgid: read data timeout in 400 seconds
[Thu Nov 17 22:46:32.847599 2016] [core:error] [pid 22287] [client
172.18.25.98:35090] End of script output before headers: smokeping.fcgi
[Thu Nov 17 22:46:34.302075 2016] [fcgid:warn] [pid 22533] [client
172.18.25.66:49311] mod_fcgid: can't apply process slot for
/usr/share/smokeping/cgi/smokeping.fcgi
[Thu Nov 17 22:46:34.716011 2016] [fcgid:warn] [pid 22290] [client
172.18.26.46:52243] mod_fcgid: read data timeout in 400 seconds
[Thu Nov 17 22:46:34.716255 2016] [core:error] [pid 22290] [client
172.18.26.46:52243] End of script output before headers: smokeping.fcgi
[Thu Nov 17 22:46:35.110750 2016] [fcgid:warn] [pid 22291] [client
172.18.23.110:45341] mod_fcgid: read data timeout in 400 seconds
[Thu Nov 17 22:46:35.110933 2016] [core:error] [pid 22291] [client
172.18.23.110:45341] End of script output before headers: smokeping.fcgi
[Thu Nov 17 22:46:35.746571 2016] [fcgid:warn] [pid 22285] [client
172.18.23.174:46649] mod_fcgid: read data timeout in 400 seconds
[Thu Nov 17 22:46:35.746674 2016] [core:error] [pid 22285] [client
172.18.23.174:46649] End of script output before headers: smokeping.fcgi
My config of fcgi:
/etc/httpd/conf.d/fcgid.conf
# This is the Apache server configuration file for providing FastCGI support
# through mod_fcgid
#
# Documentation is available at
# http://httpd.apache.org/mod_fcgid/mod/mod_fcgid.html
# Use FastCGI to process .fcg .fcgi & .fpl scripts
AddHandler fcgid-script fcg fcgi fpl
# fix for: mod_fcgid: read data timeout in 40 seconds
IdleTimeout 600
FcgidBusyTimeout 600
DefaultMinClassProcessCount 100
FcgidConnectTimeout 120
IPCCommTimeout 600
IPCConnectTimeout 300
# to get around upload errors when uploading images increase the
MaxRequestLen size to 35MB
MaxRequestLen 35728640
# Sane place to put sockets and shared memory file
FcgidIPCDir /run/mod_fcgid
FcgidProcessTableFile /run/mod_fcgid/fcgid_shm
In my access log I have this:
tail -f /var/log/httpd/access_log
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:30:40 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
127.0.0.1 - - [17/Nov/2016:22:32:45 -0200] "GET
/smokeping/smokeping.cgi?target=Sites HTTP/1.1" 500 527 "-" "Wget/1.14
(linux-gnu)"
::1 - - [17/Nov/2016:22:39:45 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
::1 - - [17/Nov/2016:22:41:45 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.26.122 - - [17/Nov/2016:22:44:26 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.26.78 - - [17/Nov/2016:22:44:29 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.26.62 - - [17/Nov/2016:22:44:28 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.23.174 - - [17/Nov/2016:22:44:27 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.25.5.1 - - [17/Nov/2016:22:44:36 -0200] "POST /smokeping/smokeping.cgi
HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.23.138 - - [17/Nov/2016:22:44:35 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.26.34 - - [17/Nov/2016:22:44:36 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.25.126 - - [17/Nov/2016:22:44:35 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.25.22 - - [17/Nov/2016:22:44:36 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.25.5.2 - - [17/Nov/2016:22:44:51 -0200] "POST /smokeping/smokeping.cgi
HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.26.90 - - [17/Nov/2016:22:44:44 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.101.214 - - [17/Nov/2016:22:44:52 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.101.150 - - [17/Nov/2016:22:44:41 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.25.126 - - [17/Nov/2016:22:44:52 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:01 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.25.58 - - [17/Nov/2016:22:44:55 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:02 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.23.70 - - [17/Nov/2016:22:39:27 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.25.118 - - [17/Nov/2016:22:39:28 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.26.50 - - [17/Nov/2016:22:39:31 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.23.226 - - [17/Nov/2016:22:39:25 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.26.78 - - [17/Nov/2016:22:39:28 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.26.26 - - [17/Nov/2016:22:45:13 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.12.1 - - [17/Nov/2016:22:39:28 -0200] "POST /smokeping/smokeping.cgi
HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:20 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.25.110 - - [17/Nov/2016:22:45:11 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:21 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.101.218 - - [17/Nov/2016:22:39:38 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:22 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.25.5.2 - - [17/Nov/2016:22:39:41 -0200] "POST /smokeping/smokeping.cgi
HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.26.50 - - [17/Nov/2016:22:45:15 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:23 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.22.118 - - [17/Nov/2016:22:39:41 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.22.18 - - [17/Nov/2016:22:39:40 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:24 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.26.26 - - [17/Nov/2016:22:39:41 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.26.122 - - [17/Nov/2016:22:39:43 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:25 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.22.38 - - [17/Nov/2016:22:39:41 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:26 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.25.118 - - [17/Nov/2016:22:45:20 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
172.18.26.78 - - [17/Nov/2016:22:45:19 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 503 299 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:27 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.23.162 - - [17/Nov/2016:22:39:36 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
172.18.101.150 - - [17/Nov/2016:22:39:40 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:28 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
172.18.101.214 - - [17/Nov/2016:22:39:45 -0200] "POST
/smokeping/smokeping.cgi HTTP/1.1" 500 527 "-" "smokeping-slave/1.0"
::1 - - [17/Nov/2016:22:46:29 -0200] "OPTIONS * HTTP/1.0" 200 - "-"
"Apache/2.4.6 (CentOS) mod_fcgid/2.3.9 (internal dummy connection)"
In my message logs:
Nov 17 22:31:29 HOL-MONITORIA-02 smokeping: FPingCS-POA-CCS: WARNING:
smokeping took 61 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:31:29 HOL-MONITORIA-02 smokeping: FPingCS-SP-PARRA-BT: NOTE:
smokeping took 60 seconds to complete 1 round of polling. This is over 80%
of the max time available for a polling cycle (60 seconds).
Nov 17 22:31:31 HOL-MONITORIA-02 smokeping: FPingCS-Mdeo-Onix: WARNING:
smokeping took 61 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:31:32 HOL-MONITORIA-02 smokeping: FPingCS-MDEO-DN-TERRAZAS: NOTE:
smokeping took 60 seconds to complete 1 round of polling. This is over 80%
of the max time available for a polling cycle (60 seconds).
Nov 17 22:31:37 HOL-MONITORIA-02 smokeping: FPingCS-POA-ECS: WARNING:
smokeping took 63 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:31:37 HOL-MONITORIA-02 smokeping: FPingSF-USA-1: WARNING:
smokeping took 64 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:31:38 HOL-MONITORIA-02 smokeping: FPingESTUDIO-PIAZZA: WARNING:
smokeping took 63 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:31:38 HOL-MONITORIA-02 smokeping: FPingCS-MDEO-LCS-Res01:
WARNING: smokeping took 64 seconds to complete 1 round of polling. It
should complete polling in 60 seconds. You may have unresponsive devices in
your setup.
Nov 17 22:31:38 HOL-MONITORIA-02 smokeping: FPingCA-MDEO-HORMIGON: WARNING:
smokeping took 62 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:31:38 HOL-MONITORIA-02 smokeping: FPingCS-PUNTA-ECS: WARNING:
smokeping took 64 seconds to complete 1 round of polling. It should
complete polling in 60 seconds. You may have unresponsive devices in your
setup.
Nov 17 22:32:04 HOL-MONITORIA-02 smokeping: FPingCS-LASCANO-OFICINA: NOTE:
smokeping took 60 seconds to complete 1 round of polling. This is over 80%
of the max time available for a polling cycle (60 seconds).
Nov 17 22:32:05 HOL-MONITORIA-02 smokeping: FPingCS-BAGE-SEVERO: NOTE:
smokeping took 60 seconds to complete 1 round of polling. This is over 80%
of the max time available for a polling cycle (60 seconds).
Nov 17 22:32:06 HOL-MONITORIA-02 smokeping: FPingCS-Mdeo-RCS-Res02:
WARNING: smokeping took 61 seconds to complete 1 round of polling. It
should complete polling in 60 seconds. You may have unresponsive devices in
your setup.
Nov 17 22:32:07 HOL-MONITORIA-02 smokeping: FPingHOL-OPENVPN-02: NOTE:
smokeping took 60 seconds to complete 1 round of polling. This is over 80%
of the max time available for a polling cycle (60 seconds).
.... and more logs like this.
The problem is that some many data receve by master.
Any sugestion?
Thanks
--
William Vidal
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.oetiker.ch/pipermail/smokeping-users/attachments/20161117/78283421/attachment-0001.html>
More information about the smokeping-users
mailing list