Status of the connection #4

daviddetorres · 2019-11-14T21:26:00Z

In some cases it can happen that the TCP connection with the device is ok, but the modbus server is not running, the client is connecting to another open port or that the modbus ID configured is not correct.

It also can happen that in the configuration shown in the README with a modbus TCP/RTU bridge, that the bridge is online (so the connection is established) but the connection with the RTU devices (usually a RS232 or RS485 bus) is not ok (bus disconnected, incorrect serial configuration, etc).

It could be interesting to add a metric to inform about the connection status (connection_up?) if the socket is correctly established, independently of the result of the query of the modbus registers. This can help to detect failures in the devices or configuration issues.

If you think any of this ideas would be worth to work in, I could work in a PR.

RichiH · 2019-12-08T21:06:29Z

Sounds interesting; the normal pattern would be to dump this on STDOUT/STDERR, but a /metrics endpoint for stats about the exporter itself would be nice and that could carry a counter for failures. That way, operators would know to check the logs.

daviddetorres · 2019-12-09T20:02:30Z

I'll work in a PR to add these error metrics.

Maybe it would be interesting, as you pointed, to give information of the number of failures, even adding a label with the type of failure (connection_error, timeout, incorrect_function?, bad_address?). This way operator would have more information on the type of problem they are addressing.

RichiH · 2019-12-27T13:36:58Z

Sounds good. If you touch that part, moving the exporter metrics from :0911/metrics to :9602/metrics and the target metrics from :9602/metrics to :9602/modbus/target=1.2.3.4 would be nice.

Basically https://github.com/prometheus/snmp_exporter#usage

daviddetorres · 2019-12-28T10:47:12Z

Seems similar to how the blackbox exporter also works. I'll try to work in a PR these days.

RichiH · 2019-12-28T11:10:32Z

Yes, we made blackbox and snmp exporters behave the same so we have a bit of a standard already

daviddetorres · 2019-12-29T09:13:29Z

I'm already working in adding information about the error rate and codes to the exporter. I think it would be also important to add information about time and number of the requests, so it is possible to visualize latency, traffic and error rate in a dashboard. Similar to what the blackbox exporter does, but with the modbus queries.

I'll include them in the metrics of the exporter with a label for the target, to be able to visualize these metrics in total and per target.

RichiH · 2019-12-29T11:55:09Z

Query runtime should be part of the target metric data. That way, you can pin down specific PLCs becoming slower, etc.

daviddetorres · 2019-12-29T12:38:58Z

In the PR #7 I added those metrics with the label "target", so error rates, latency, number of queries, etc. can be treated in total and per target. (I had to deal with specific PLCs with problems and that's why I added the label with the target).

The number of possible values of the label if bounded by the number of modbus TCP devices, and keeping in account that usually modbus IP PLCs act as aggregators of modbus RTU devices, there shouldn't be a great number of different targets. This way, the high cardinality problem should be contained.

For further information about the cause of a specific PLC failure or malfunction, there are logs, but at least, the alarm can be configured to note that something is happening and with information about which target is with problems and what kind of error.

dssantana-zz · 2020-01-14T01:08:59Z

@daviddetorres We are having some issues with some connections to the modbus server. some sessions are left as close_wait in the server side. in the same request we are querying around 600 devices, should we split the request?

dshatokhin · 2020-05-06T21:01:48Z

Modbus has interesting function code "0x11 - Report Slave ID". You can send request to modbus device and get unit id back, if device is up and running.
In other words - modbus ping.
Unfortunately, I'm only in halfway of my online golang course, so my skill and experience is way beyond useful PR at this point.

daviddetorres mentioned this issue Dec 28, 2019

Add counter for errors and change the entry point of exporter metrics #7

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Status of the connection #4

Status of the connection #4

daviddetorres commented Nov 14, 2019 •

edited

Loading

RichiH commented Dec 8, 2019 •

edited

Loading

daviddetorres commented Dec 9, 2019

RichiH commented Dec 27, 2019

daviddetorres commented Dec 28, 2019

RichiH commented Dec 28, 2019

daviddetorres commented Dec 29, 2019

RichiH commented Dec 29, 2019

daviddetorres commented Dec 29, 2019

dssantana-zz commented Jan 14, 2020

dshatokhin commented May 6, 2020 •

edited

Loading

Status of the connection #4

Status of the connection #4

Comments

daviddetorres commented Nov 14, 2019 • edited Loading

RichiH commented Dec 8, 2019 • edited Loading

daviddetorres commented Dec 9, 2019

RichiH commented Dec 27, 2019

daviddetorres commented Dec 28, 2019

RichiH commented Dec 28, 2019

daviddetorres commented Dec 29, 2019

RichiH commented Dec 29, 2019

daviddetorres commented Dec 29, 2019

dssantana-zz commented Jan 14, 2020

dshatokhin commented May 6, 2020 • edited Loading

daviddetorres commented Nov 14, 2019 •

edited

Loading

RichiH commented Dec 8, 2019 •

edited

Loading

dshatokhin commented May 6, 2020 •

edited

Loading