Re: New RIFE filter - 3x faster AI interpolation possible in SVP!!!
https://www.svp-team.com/forum/viewtopi … 483#p81483
and copy the onnx moel in the correct folder
you need put the *.onnx files here C:\Program Files (x86)\SVP 4\rife\models\rife
You are not logged in. Please login or register.
SmoothVideo Project → Using SVP → New RIFE filter - 3x faster AI interpolation possible in SVP!!!
https://www.svp-team.com/forum/viewtopi … 483#p81483
and copy the onnx moel in the correct folder
you need put the *.onnx files here C:\Program Files (x86)\SVP 4\rife\models\rife
here's a full script you can use for testing
run with
> vspipe test.py .variables:
- frame size on line 16, "length" is the number of source frames
- CPU threads on line 6
- GPU streams on line 27
- SVP's add. functions on/off - comment line 69I thought the problem was in a CPU threads number, which was 4 before, and it's not enough to feed the GPU. Now it's 8, but you're saying there's no difference. Which is strange.
DON'T FORGET TO RESTART SVP after .js/.py file replacing, it may not reload these files automatically in some cases---
BTW use_cuda_graph=True gives only like 2% difference in my case
Better late than never
Output 2000 frames in 15.43 seconds (129.62 fps)
changing clip = core.std.BlankClip(width=1920,height=1080,length=1000,format=vs.YUV420P8)
to
clip = core.std.BlankClip(width=3840,height=2160,length=1000,format=vs.YUV420P8)
Output 2000 frames in 62.34 seconds (32.08 fps)
P.S. i ordered new hardware to test .. Ryzen 7700x, DDR5 7200 Memory etc.
[vapoursynth]
RIFE TRT ~83FPS, easy to watch x3 24 1080p in realtime. Compare with vsrife ~60fps
this screenshot shows software h264 transcoding, if i use the settings of the screenshot my 6core CPU is at 100%, GPU with 2 threads at about 35% utilization and 19fps (4k) or 41 fps (starts above 50 but after some time 41 is stabelized) (1080p)
however, your CPU is a 2CCD Model with higher memory write bandwidth ... let's see what happens when my new hardware arrives, it seems the 4090 is quite limited on the ryzen 5xxx platform
this screenshot shows software h264 transcoding, if i use the settings of the screenshot my 6core CPU is at 100%, GPU with 2 threads at about 35% utilization and 19fps (4k) or 41 fps (starts above 50 but after some time 41 is stabelized) (1080p)
Use the 'ultrafast' preset and let us know if performance has improved:
https://trac.ffmpeg.org/wiki/Encode/H.264
We are now testing RIFE and looking for bottlenecks
with software transcoding ?
UHD
> We are now testing
you are not
UHD
> We are now testingyou are not
I'm testing virtually without a proper graphics card, and this is even more difficult
with software transcoding ?
288 new fps (1080p) for 4090+13900k (TensorRT8.5+vs_threads=4+fp16) (rife46) (num_streams=10) (benchmark was done with vspipe file.py -p . instead of piping into ffmpeg and rendering to avoid cpu bottleneck)
164 new fps (1080p) for 4090+5950x (ncnn+2 threads+4 vs threads+ffmpeg (ultrafast) (rife4.6)
Source: https://github.com/styler00dollar/VSGAN-tensorrt-docker
It is best to check and test all options.
In my opinion, the more options are tested and the more test details are given the better. As for example in this already quite old post:
### Environment ###
Windows 10
DDR4-2933 48GiB
Nvidia RTX2070 8GiB
Nvidia Driver 511.79
CUDA Toolkit 11.3
cuDNN v8.2.1 (June 7th, 2021), for CUDA 11.x### Software ###
Python 3.10.4
VapourSynth R58-RC2
PyTorch 1.11.0 (CUDA 11.3)
vs_rife v2.0.0
VapourSynth-RIFE-ncnn-Vulkan r3 (model: 4.0)### Tools & Seting ###
GPU-Z 2.45.0
VapourSynth Editor r19-mod-5-AC2
VapourSynth threads: core.num_threads = 4
Decoder: lsmas.LWLibavSource(format="yuv420p8", prefer_hw=3)
Video: demo.mp4 [720p]### Result ###
1. RIFE filter for VapourSynth (PyTorch CUDA) - vs_rife v2.0.0
Interpolation: x2
RIFE model: 4.0
scale: 1.0
FP16: False
FPS: 54.115
CUDA: ~50%
PerfCap: VRel, VOp, Pwr2. RIFE filter for VapourSynth (PyTorch CUDA) - vs_rife v2.0.0
Interpolation: x2
RIFE model: 4.0
scale: 0.5
FP16: False
FPS: 69.997
CUDA: ~40%
PerfCap: VRel, VOp3. RIFE filter for VapourSynth (PyTorch CUDA) - vs_rife v2.0.0
Interpolation: x2
RIFE model: 4.0
scale: 0.5
FP16: True
FPS: 70.936
CUDA: ~32%
PerfCap: VRel, VOp4. RIFE filter for VapourSynth (ncnn Vulkan) - VapourSynth-RIFE-ncnn-Vulkan r3
Interpolation: x2
RIFE model: 4.0
GPU thread: 1
tta: False
uhd: False
sc: True
FPS: 27.356
CUDA: ~1%
Compute_1: 30%
PerfCap: Idle
5. RIFE filter for VapourSynth (ncnn Vulkan) - VapourSynth-RIFE-ncnn-Vulkan r3
Interpolation: x2
RIFE model: 4.0
GPU thread: 2
tta: False
uhd: False
sc: True
FPS: 92.956
CUDA: ~15%
Compute_1: ~94%
PerfCap: VRel, VOp, Pwr
6. RIFE filter for VapourSynth (ncnn Vulkan) - VapourSynth-RIFE-ncnn-Vulkan r3
Interpolation: x2
RIFE model: 4.0
GPU thread: 2
tta: False
uhd: True
sc: True
FPS: 92.366
CUDA: ~15%
Compute_1: ~94%
PerfCap: VRel, VOp, Pwr
7. RIFE filter for VapourSynth (ncnn Vulkan) - VapourSynth-RIFE-ncnn-Vulkan r3
Interpolation: x2
RIFE model: 4.0
GPU thread: 2
tta: False
uhd: False
sc: False
FPS: 87.083
CUDA: ~15%
Compute_1: ~94%
PerfCap: VRel, VOp, Pwr
8. RIFE filter for VapourSynth (ncnn Vulkan) - VapourSynth-RIFE-ncnn-Vulkan r3
Interpolation: x2
RIFE model: 4.0
GPU thread: 3
tta: False
uhd: False
sc: True
FPS: 90.645
CUDA: ~15%
Compute_1: ~94%
PerfCap: Idle
https://www.svp-team.com/forum/viewtopi … 483#p81483
and copy the onnx moel in the correct folder
you need put the *.onnx files here C:\Program Files (x86)\SVP 4\rife\models\rife
I followed the steps, think I did everything I was supposed to, but somehow this implementation does not seem as smooth as the mvp-lazy one. I get almost no contour artifacts for sure, but I can still see some sort of "ghost" trails and smoothness seems a bit off too. Maybe something is wrong on my side? As you can see the TensorRT is part of the Automatic profile, not sure if that's right.
Realized that I need to be in the RIFE video profile. So I switched to RIFE, I activate it with the new ON/OFF switch, a window pops up with some processes, after 3 minutes it's done and... it's not working properly. I'm getting 0.75 out of 1 and sound is lagging.
don't do "fixed 60 fps" with RIFE
don't do "fixed 60 fps" with RIFE
Switched to "To Screen" , Movie X2/3/4, still getting 0.75 and sound is lagging.
GOOD NEWS EVERYONE!
updated instructions
should improve FPS on 4080-and-better (probably 4070/3080 too, dunno), when performance is bound by the system's RAM bandwidth, not GPU power
i.e. for 4K playback
grobalt
> P.S. i ordered new hardware to test .. Ryzen 7700x, DDR5 7200 Memory etc.
cancel your order
updated instructions
should improve FPS on 4080-and-better (probably 4070 too, dunno), when performance is bound by the system's RAM bandwidth, not GPU power
i.e. for 4K playback
I have 32 GB DDR4-3200 RAM and trying to do real-time for 1080p. Grobalt said 1080p real-time should be ok on RTX 3070 laptop GPU. I can run things smoothly on mvp-lazy.
> Grobalt said 1080p real-time should be ok on RTX 3070 laptop GPU
not sure what you're doing, but it's OK even on a 2060 laptop now
> Grobalt said 1080p real-time should be ok on RTX 3070 laptop GPU
not sure what you're doing, but it's OK even on a 2060 laptop now
Well, what should I do? Reinstall SVP + MPV + Rife support and redo the installation for the new TRT stuff?
So I reinstalled everything and now the SVP index shows N/A and I did not get any window pop up to wait 3 minutes before it calculates stuff. Not sure what I'm doing wrong.
Chainik wrote:> Grobalt said 1080p real-time should be ok on RTX 3070 laptop GPU
not sure what you're doing, but it's OK even on a 2060 laptop now
Well, what should I do? Reinstall SVP + MPV + Rife support and redo the installation for the new TRT stuff?
maybe you follow VSRIFE guide, and not vsmltrt guide
uninstall SVP and follow the guide
https://www.svp-team.com/forum/viewtopi … 483#p81483
GOOD NEWS EVERYONE!
updated instructions
should improve FPS on 4080-and-better (probably 4070/3080 too, dunno), when performance is bound by the system's RAM bandwidth, not GPU power
i.e. for 4K playback
not sure what you're doing, but it's OK even on a 2060 laptop now
What exactly did you do? I'm very curious to know what solved the memory problems. 1080p real time with RIFE using 2060 laptop is impressive!
Xenocyde wrote:Chainik wrote:> Grobalt said 1080p real-time should be ok on RTX 3070 laptop GPU
not sure what you're doing, but it's OK even on a 2060 laptop now
Well, what should I do? Reinstall SVP + MPV + Rife support and redo the installation for the new TRT stuff?
maybe you follow VSRIFE guide, and not vsmltrt guide
uninstall SVP and follow the guide
https://www.svp-team.com/forum/viewtopi … 483#p81483
I reinstalled SVP, I did the TRT update, and now real-time interpolation does not even start. Well actually I do not know how to do this part: Make a copy of the RIFE video profile, one with TensorRT ON and another with OFF.
Hmm, even after reinstal.
EDIT: I downloaded the new pack and reinstalled it. it works again
basic, redownload all the files.
I deleted SVP and startet from scratch, just to check if the inscructions are complete and everything is working.
There is a step missing.
After replacing generate.js and base.py, start SVP4, add the new option TensorRT etc.
Then the missing step:
Copy the Rife AI profile and select the AI Model "rife"
Enable the new Option TensorRT On
GOOD NEWS EVERYONE!
updated instructions
should improve FPS on 4080-and-better (probably 4070/3080 too, dunno), when performance is bound by the system's RAM bandwidth, not GPU power
i.e. for 4K playbackgrobalt
> P.S. i ordered new hardware to test .. Ryzen 7700x, DDR5 7200 Memory etc.cancel your order
Just tried the updated instructions. Getting around 50fps at 4k with an RTX 4090 now. Between 170-190fps(it kept going up and down for some reason) at 1080p. These improvements are huge. Thanks for updating the instructions.
Edit: Just tested a bit more. 4k at 2x in real time now works for non HDR videos. With HDR at 4k, I only get around 35-40fps, so it is not possible yet.
SmoothVideo Project → Using SVP → New RIFE filter - 3x faster AI interpolation possible in SVP!!!
Powered by PunBB, supported by Informer Technologies, Inc.