We need one sentence that states that we take either the first or last frame of the image. The last will probably make the most sense in cases of non-repeatable animations.
This was fixed a while back with this text:
For animated raster image formats (such as GIF), the first frame of the animation sequence is used. For SVG images ([SVG11]), the image is rendered without animations applied.