TITLE: Compressing images for this website
DATE: 2019-07-05
AUTHOR: John L. Godlee
====================================================================
I wanted my webpages to load faster, and I was somewhat inspired by
this website which runs entirely on a solar powered raspberry pi.
They needed to minimize the amount of data transfer to keep energy
usage down. I was also inspired a bit by this blog post on their
image dithering procedure which converts images to black and white
while preserving the essence of what the image is trying to
portray. Unfortunately they do their image conversion in Adobe
Photoshop, which I didn't want to use. Instead I decided to see if
I could replicate their conversion process and achieve highly
compressed dithered images using ImageMagick, which could be
written into a reproducible script that I could run on a set of
images. I also wanted to see if there was a better method of
compressing the images.
[this website]:
https://solar.lowtechmagazine.com/2018/09/how-to-build-a-lowtech-web
site/
[blog post]:
http://gauthierroussilhe.com/en/posts/convert-low-tech
[ImageMagick]:
https://imagemagick.org/index.php
ImageMagick runs on the command line and the command I use is
convert which is designed to take an image, alter it in a multitude
of ways, then save to a new file. mogrify is better for overwriting
the same image.
I have an example image here which is currently 6.8 MB.
[example image here]: /img_full/spot_autumn/bike.png
y first pass at the image is simply to reduce the -quality of the
image, which takes a percentage value that pertains to the
compression level.
convert bike.jpg -quality 90% output_1.jpg
This produces an image of 3.3 MB.
Here is a plot of the file size as the -quality percentage is
reduced. and a table of file sizes. Note that 100% compression on
the original image actually made the original image of 6.8 MB
larger, to 7.8 MB.
![Plot of comperession percentage and image
size](
https://johngodlee.xyz/img_full/img_convert/plot.png)
Compression (%) Size (MB)
------------------- --------------
100 7.799
95 4.479
90 3.229
85 2.6
80 2.24
75 1.973
70 1.808
65 1.658
60 1.531
55 1.434
50 1.349
45 1.268
40 1.172
35 1.086
30 0.9793
25 0.8683
20 0.7416
15 0.5998
10 0.4374
5 0.2395
----------------- ------------
It's interesting that even at 5% -quality, the image is still not
bad to look at.
For the next part I'll take an arbitrary -quality value of 50%,
meaning the base image is 1.349 MB.
Converting the image to grayscale makes the file slightly smaller,
but not much
convert bike.jpg -quality 50% -colorspace gray output_21.jpg
The original image was 1,414,879 bytes (1.349 MB) and the grayscale
image is 1,325,856 (1.264 MB), a difference of 89,023 bytes, or 87
KB.
I also experimented with other options for reducing the colors, but
none of these made any positive difference. -posterize 4 = 1.781
MB, -quantize gray = 1.349 MB.
Next I want to dither the image to hopefully save even more file
size. I'll use the grayscale image I generated earlier, which is
1.264 MB. There are multiple options for dithering so here is a
comparison.
convert bike.jpg -quality 50% -colorspace gray -dither
floyd-steinberg output_25.jpg
Both methods produced files of exactly the same size as the
grascale image so I think I'm doing something wrong. I think I
should apply some sort of dithering color reduction flag as well.
Here are examples with different color reduction methods:
convert bike.jpg -quality 50% -colorspace gray -dither
floyd-steinberg -monochrome output_26.jpg
convert bike.jpg -quality 50% -colorspace gray -dither
floyd-steinberg -posterize 10 output_27.jpg
The -monochrome option produced a file of 2.812 MB and the
-posterize 10 option produced a file of 1.428 MB.
Posterize is an interesting one to play with. Here is a comparison
of -posterize with different numbers of colours:
Size (MB) Number of colours
------------- ---------------------
1.428 10
1.459 9
1.487 8
1.532 7
1.578 6
1.631 5
1.674 4
1.793 3
1.437 2
0.06754 1
----------- -------------------
Weirdly, the file size actually increases as the number of colours
decreases, up until the number of colours is two.
There are also predefined dithering threshold maps that can be used
with -ordered-dither:
convert bike.jpg -quality 50% -colorspace gray -ordered-dither
c7x7w output_43.jpg
Here is a comparison of these methods:
![Plot of ordered dither method and output
size](
https://johngodlee.xyz/img_full/img_convert/plot_dith.png)
Code Description Size (MB)
---------- ----------------------------- -------------
checks Checkerboard 2x1 (dither) 2.678
o2x2 Ordered 2x2 (dispersed) 3.702
o3x3 Ordered 3x3 (dispersed) 4.445
o4x4 Ordered 4x4 (dispersed) 4.557
o8x8 Ordered 8x8 (dispersed) 4.675
h6x6a Halftone 6x6 (angled) 4.824
h8x8a Halftone 8x8 (angled) 1.264
h6x6o Halftone 6x6 (orthogonal) 4.877
h8x8o Halftone 8x8 (orthogonal) 4.831
c5x5b Circles 5x5 (black) 4.879
c5x5w Circles 5x5 (white) 4.879
c6x6b Circles 6x6 (black) 4.831
c6x6w Circles 6x6 (white) 4.823
c7x7b Circles 7x7 (black) 4.915
c7x7w Circles 7x7 (white) 4.899
-------- --------------------------- -----------
So the h8x8a method produced the smallest file size of 1.264 MB,
but this is only as small as the undithered grayscale image!
So from my little test so far it seems that the best way of
compressing images is to use a grayscale undithered image an adjust
the -quality of the image to my liking.
Here are a few other options to reduce file size I came across.
Here is a comparison using the undithered grayscale image.
Removing EXIF data, which I would probably want to do regardless of
the file size, just for privacy reasons:
convert bike.jpg -quality 50% -colorspace gray -strip
output_54.jpg
This reduces the file size from 1.264 MB to 1.242 MB, a difference
of 23.2 KB.
Adding some Gaussian blur to reduce the detail level:
convert bike.jpg -quality 50% -colorspace gray -strip
-gaussian-blur 0.05 output_54.jpg
This reduces the stripped grayscale image from 1.242 MB to 1.031
MB, a difference of 215 KB.
Here is a comparison of images converted with different levels of
gaussian blur.
One thing I did find was that the computation time increased with
the radius of the blur.
![Plot of gaussian blur and output
size](
https://johngodlee.xyz/img_full/img_convert/plot_blur.png)
Blur Size (MB)
-------- -------------
0.05 1.031
0.1 1.031
0.2 1.031
0.3 1.031
0.4 1.031
0.5 1.031
0.6 1.031
0.7 1.031
0.8 1.031
0.9 1.031
1 1.031
2 0.9472
3 0.9377
4 0.9373
5 0.9373
10 0.9373
20 0.9373
30 0.9373
40 0.9373
50 0.9373
------ -----------
So, finally, the optimal way of compressing my JPEG images seems to
be:
convert bike.jpg -quality 20% -colorspace gray -strip
-gaussian-blur 3.5 output.jpg
This leads to an image of 538 KB. It looks good, except when
zooming in, then it gets a bit hairy, this is where dithering might
become useful if I can find a way of dithering while maintaining
the small size of the image.
Another thing I learned from reading the comments on this website
was that JPEG is the most appropriate format for reducing image
file size as it is lossy, unlike PNG, so I can convert all my PNG
images to JPEG using:
[this website]:
https://solar.lowtechmagazine.com/2018/09/how-to-build-a-lowtech-web
site/
convert test.png -background white -flatten output.jpg
-background white -flatten is needed to allow the JPEG to properly
display the transparent areas of the JPG as white background.
I have a folder on my website with images sorted into directories
based on what blog post they refer to. I wrote this shell script to
prepare the images to be put onto the website:
#!/bin/bash
shopt -s nullglob
# For each image
for i in img_full/*/*.{jpg,JPG,jpeg,png,PNG}; do
# If the directory in the compressed images directory
doesn't exist, create it
new_dir=$(dirname ${i} | sed "s/img_full/img/g")
if [ ! -d $new_dir ]; then
mkdir -p $new_dir;
fi
echo $dir
# Create new file path for output
new_path=$(echo "${i}" | sed "s/img_full/img/g")
echo $new_path
# Convert image files
if [ ! -f $new_path ]; then
convert $i -quality 20% -colorspace gray -strip
-background white -flatten -gaussian-blur 3.5 $new_path
fi
done
# Maintain full size 404 image
cp img_full/404/404.png img/404/404.png