This environment:
CentOS 7.x WP 5.x PHP 7.x Fail2ban 0.11.x WP/PHP running in Docker
In /etc/fail2ban/filter.d/wplogin.conf:
[Definition] failregex = .*php7:notice.*\[client (<HOST>).*wp-admin.*Password Mismatch ignoreregex = Then in the WP home, we want to log errors to SYSLOG.
Getting copies of annual returns and company information from Companies House is easy. Searching the data in those returns isn’t quite so easy.
CH use a PDF format (PDF/A, akin to fax) that ensures maximum compatability.
I had a small project to display some simple stats for, for some static content sitting in an AWS S3 bucket. I could have forwarded everything to Elastic+Kibana and showed some fancy graphs and charts, but I was only being asked for what I could easily produce via AWStats.
This is more written as an aide-memoire to myself than anything. It’s a process I’m currently using for bulk-processing a set of documents of various forms (MS Word, PPT, PDF, LibreOffice etc), converting them all to PDF, running OCR on any embedded images and then sticking the end-result into Elasticsearch via Tika (not documented, plenty documentation elsewhere re this final step).