Re: New RIFE filter - 3x faster AI interpolation possible in SVP!!!
Thanks lwk7454 for another test and showing how hardware scheduling in Windows 11 with RIFE works in practice.
FP16 720p: Cuda ~40%, SVP index 1.0
FP16 1080p: Cuda jumps between 35% - 51%, SVP index N/A
The result surprised me a bit. I counted that it will be 100% and everything will be clear. That's why such tests are worth doing. It looks like for some reason 50% CUDA utilization is the ceiling. Or maybe it's not CUDA that is the bottleneck? Maybe it's not the amount of VRAM either, because there's plenty of VRAM in the 3090, maybe it's the VRAM bandwidth, something similar to when RAM bandwidth is the bottleneck in the native SVP algorithm: https://www.svp-team.com/forum/viewtopic.php?id=6349
Or maybe as you wrote earlier there is still potential for optimization? After all, 40% or even 46% GPU utilization for 720p gives hope for the future for 1080p.
FP32 did not work at all, images were all frozen and all nodes were at 0%
And here's another surprise. Looks like there is a problem somewhere in the software.
Turns out only Cuda node was used.
Here is a confirmation that indeed the RIFE filter for VapourSynth (PyTorch) uses CUDA, while the RIFE filter for VapourSynth (ncnn) does not: https://www.svp-team.com/forum/viewtopi … 96&p=2 Although there is hope that the latter filter will also use CUDA, as Chainik mentioned https://github.com/nihui/rife-ncnn-vulkan/issues/22 pointing to this unfinished project: https://github.com/atanmarko/ncnn-with-cuda Maybe some talented interpolation enthusiast will fill the missing one layer (Interp) of the above project?
Until then, however, it is worth focusing on what currently gives us the most potential for real-time interpolation, namely the RIFE filter for VapourSynth (PyTorch).
I would be very interested to see the results of a few more simple tests, which maybe will show where the cause of the FP32 problems lies and what the potential is for interpolating 1080p files. However, I will be away from the forum for 2 days and will try to describe it all on 3 November. I wouldn't want to ask for something when I won't have the opportunity to thank and comment on the results of my request. I will try to get it all worded properly somehow by November 3.
Thanks again for the tests and I encourage others to check how the real-time RIFE intertpolation results look on their graphics cards, other media players or other Windows.