Use imageio.v3 when read or write images #1956

xunmeibuyue · 2023-04-25T03:59:36Z

Using imageio.v3 instead of imageio when read or write images, which has the advantage that imageio.v3 support more image format than imageio such as .webp.

I have provided code that clearly demonstrates the bug and that only works correctly when applying this fix
I have added suitable tests demonstrating a fixed bug or new/changed feature to the test suite in tests/
cumented new or changed features in the documentation or in the docstrings
I have properly explained unusual or unexpected code in the comments around it

Fix #1894

mgaitan · 2023-04-25T12:50:25Z

@xunmeibuyue for the next time, there is no need to open a new PR for each update after a review comment. Simply you can push new commit to the same remote branch you pushed. In that way the context of the review will keep in a single place.

Regarding the PR itself, we mandatorily need all the CI builds passing. So, could you review what have broken and fix it? You can follow the CONTRIBUTING.md doc to install the dependencies and run the test locally before push it.

I look forward to your updates

xunmeibuyue · 2023-04-26T01:11:12Z

OK, I opened a new PR just for a cleaning commit, but it seems that keeping the context of the review in a single place will be more helpful.

I can't believe a change of just few lines arises so much errors 😭, I will review and fix it later. Thanks for the patient comment!

FirefoxMetzger · 2023-04-28T12:00:55Z

Looks like the tests are no longer being collected properly because pytest tries to load images from a container that stores multiple, differently-shaped images. The image in question is pigs_in_a_polka.gif which uses an RGB palette as its first frame and RGBA palettes in all subsequent frames. As a result, the first frame has shape (273, 314, 3) whereas all others have shape (273, 314, 4).

This is one of the changes in ImageIO v2 vs ImageIO v3: when reading GIF users typically just want all the frames in one go, which can be neatly packed into a (frames, *frame.shape) array for 99.5% of all the GIFs out there. For others, users either don't want to read all frames (in which case you'd use our index=value kwarg to select the image to read) or the image has become corrupt along the way and should be updated.

A couple of ways forward, depending on what the test tries to accomplish:

If you only expect a single (first) frame from the GIF load it using index=0. This will make the behavior similar to the old imread from v2.
If you expect RGB data (or RGBA) then you can use mode="RGB" to convert all the frames to RGB before getting them back from ImageIO.
If you expect the image to use the same palette throughout, update the first frame to have a RGBA palette just like the other frames.

Also, for this PR it might be worthwhile to have a look at our migration guide: https://imageio.readthedocs.io/en/stable/reference/userapi.html#migrating-to-the-v3-api

On a more high level, MoviePy seems to use ImageIO for reading/writing images and to then rely on FFMPEG (I assume via subprocesses?) for reading/writing of video. ImageIO has supported video reading/writing since forever by calling an FFMPEG executable via a subprocess. Recently, (about 1 year ago) we added a second backend exclusive to v3 that can call directly into the underlying FFMPEG API via pyAV, and - as a result - gives performance on par with or exceeding OpenCV.

To me, this sounds like there is an overlap between both projects, so I was wondering if it is worthwhile to investigate if we can combine some of the code, reduce duplication, and share some of the maintenance burdens in the process.

xunmeibuyue · 2023-04-28T13:11:44Z

Sorry, I just try to commit a change (follow the guidance of @FirefoxMetzger ) on the fork of mine, but it was wrongly submitted to here. Hence there is no need to run the actions @mgaitan and I'll update it once I finished the local test.

FirefoxMetzger

Some quick comments - and suggestions on where new API functions might be more useful :)

FirefoxMetzger · 2023-04-28T13:06:26Z

moviepy/video/VideoClip.py

@@ -1040,7 +1040,7 @@ def __init__(

        if not isinstance(img, np.ndarray):
            # img is a string or path-like object, so read it in from disk
-            img = imread(img)
+            img = imread(img, index=0)[0]


Suggested change

img = imread(img, index=0)[0]

img = imread(img, index=0)

If you add a slice to the call you will get the first row of the image. I suppose you meant to just get the first frame/image in the file?

FirefoxMetzger · 2023-04-28T13:07:44Z

moviepy/video/VideoClip.py

@@ -11,7 +11,7 @@

 import numpy as np
 import proglog
-from imageio import imread, imsave
+from imageio.v3 import imread, imwrite


While not strictly necessary, it is recommended to import Imageio as import imageio.v3 as iio. This gives you access to the full API and reads nicer than only importing the methods used.

You would then consume it as iio,imwrite and iio.imread respectively.

Thanks for pointing out. I'll push a new commit

FirefoxMetzger · 2023-04-28T13:08:47Z