Dev

CMake ExternalProject options

CMake manpages are about the most verbose manpages on the planet. All I wanted to do was import an external project and customise the build options without having to go and add files into the child project / patch them / whatever.

AWS S3 log format for awstats

For S3 logging, awstats needs its LogFormat set up in the following manner: %other %extra1 %time1 %host %logname %other %method %url %methodurl %code %other %extra2 %bytesd %other %extra3 %refererquot %uaquot %other %other %other %other %other %virtualname %other Amazon’s documentation is available here

Bulk OCRing mixed content and exporting as PDF

This is more written as an aide-memoire to myself than anything. It’s a process I’m currently using for bulk-processing a set of documents of various forms (MS Word, PPT, PDF, LibreOffice etc), converting them all to PDF, running OCR on any embedded images and then sticking the end-result into Elasticsearch via Tika (not documented, plenty documentation elsewhere re this final step).