dont understand why the LG file is causing issues
You are not logged in. Please login or register.
SmoothVideo Project → Posts by grobalt
dont understand why the LG file is causing issues
What alola wrote the last days has just be to wrong .. impossible the results for his 3070ti with the old script.
DragonicPrime is something odd as well.
I can go MPV HDR on and OFF (Try to recover HDR colors) with frame doubling without dropped frames with a 5600x and RTX4090.
Now i need to figure out how to get that stuff integrated into a MadVR Player
The LG Demo File is lagging (about 2900 dropped frames at output for the full clip), 3 Rife Threads. No performance difference HDR on and off setting.
will do this soon .. currently generating the engine files for 4k/UHD resolution
I deleted SVP and startet from scratch, just to check if the inscructions are complete and everything is working.
There is a step missing.
After replacing generate.js and base.py, start SVP4, add the new option TensorRT etc.
Then the missing step:
Copy the Rife AI profile and select the AI Model "rife"
Enable the new Option TensorRT On
with software transcoding ?
[vapoursynth]
RIFE TRT ~83FPS, easy to watch x3 24 1080p in realtime. Compare with vsrife ~60fps
this screenshot shows software h264 transcoding, if i use the settings of the screenshot my 6core CPU is at 100%, GPU with 2 threads at about 35% utilization and 19fps (4k) or 41 fps (starts above 50 but after some time 41 is stabelized) (1080p)
however, your CPU is a 2CCD Model with higher memory write bandwidth ... let's see what happens when my new hardware arrives, it seems the 4090 is quite limited on the ryzen 5xxx platform
here's a full script you can use for testing
run with
> vspipe test.py .variables:
- frame size on line 16, "length" is the number of source frames
- CPU threads on line 6
- GPU streams on line 27
- SVP's add. functions on/off - comment line 69I thought the problem was in a CPU threads number, which was 4 before, and it's not enough to feed the GPU. Now it's 8, but you're saying there's no difference. Which is strange.
DON'T FORGET TO RESTART SVP after .js/.py file replacing, it may not reload these files automatically in some cases---
BTW use_cuda_graph=True gives only like 2% difference in my case
Better late than never
Output 2000 frames in 15.43 seconds (129.62 fps)
changing clip = core.std.BlankClip(width=1920,height=1080,length=1000,format=vs.YUV420P8)
to
clip = core.std.BlankClip(width=3840,height=2160,length=1000,format=vs.YUV420P8)
Output 2000 frames in 62.34 seconds (32.08 fps)
P.S. i ordered new hardware to test .. Ryzen 7700x, DDR5 7200 Memory etc.
https://www.svp-team.com/forum/viewtopi … 483#p81483
and copy the onnx moel in the correct folder
you need put the *.onnx files here C:\Program Files (x86)\SVP 4\rife\models\rife
Depends on your resolution .. 1080p runs without any problem .. 2k as well ... 4k is still not fast enough
i posted some benchmark data with a sample script: https://github.com/AmusementClub/vs-mlr … nt-4630465
numbers should be valid as someone else with RTX4080 is half performance
the parameter peak_performance = True increases performance by ~60%
thanks for the parameter feedback@github
I see no performance change with the TensorRT setting - so i am still thinking it is not running with Tensorcores
Maybe an indication that TRT is not used in my setup .. are you in discord or TeamViewer etc tomorrow? Going to bed now
It seems you run the SVP basic RIFE and not vs-mlrt as introduced here: https://www.svp-team.com/forum/viewtopi … 483#p81483
correct ?
anyhow, i tried the basic Rife implementation and nothing changed with 3 threads ..
which nvidia driver version are you using ?
i run windows 11 with the latest driver from january 5th, 528.02
Did you rename the profile to the name of the Ride default profile Name ? Generic 4.4 profile is odd as the 4.6 is used in vs-mlrt
80 to 115 is same result for me .. i would be happy to have a discord evening even curious with the rtx3070ti results shown above ... and 3 gpu threads as i cannot select odd numbers
Transcoding is completely different to MPV playback ... Transcoding H265 with profile fast is utilizing my CPU 100% .. playback not.
Chainik wrote:aloola
> I just want to report that I got 30fps on 4k transcodingand you're capped by the encoder speed
try h264 insteadit did 10% better
do you have discord ? please show me your profile setup .. you said 3 gpu threads, SVP allowes 1,2 and 4
i tried to replicate what you did .. advanced encoding with preset fast:
23:53:34.822: ===== Starting mpv ======
23:53:34.822: Command line: C:\Program Files (x86)\SVP 4\mpv64\mpv.exe D:/The Northman (2022).mkv --o=D://The Northman (2022).SVP.temporary.mkv --no-audio --no-sub --no-sub-auto --input-ipc-server=mpvencodepipe --input-media-keys=no --no-msg-color --vf=vapoursynth:[C:\Users\patri\AppData\Roaming\SVP4\scripts\ffff.py]:4:12 --of=matroska --ovc=h264_nvenc --ovcopts=b=10000000,preset=slow,profile=high,rc=vbr,maxrate=13000000,bufsize=20000000,time_base=1000/47952,threads=12
23:53:34.879: (+) Video --vid=1 (hevc 3840x2160 23.976fps)
23:53:34.879: Audio --aid=1 --alang=deu (*) 'Surround 7.1' (truehd 8ch 48000Hz)
23:53:34.879: Audio --aid=2 --alang=eng 'Surround 7.1' (truehd 8ch 48000Hz)
23:53:34.879: Audio --aid=3 --alang=eng 'Stereo' (ac3 2ch 48000Hz)
23:53:34.879: Subs --sid=1 --slang=deu (hdmv_pgs_subtitle)
23:53:34.879: Subs --sid=2 --slang=deu (hdmv_pgs_subtitle)
23:53:34.879: Subs --sid=3 --slang=deu (hdmv_pgs_subtitle)
23:53:34.879: Subs --sid=4 --slang=eng (hdmv_pgs_subtitle)
23:53:34.879: Subs --sid=5 --slang=eng (hdmv_pgs_subtitle)
23:53:34.879: Subs --sid=6 --slang=eng (hdmv_pgs_subtitle)
23:53:34.879: File tags:
23:53:34.879: Title: The Northman (2022)
23:53:35.954: vstrt: TensorRT version mismatch, built with 2135 but loaded with 2136; continue but fingers crossed...
23:53:36.859: CUDA lazy loading is not enabled. Enabling it can significantly reduce device memory usage. See `CUDA_MODULE_LOADING` in https://docs.nvidia.com/cuda/cuda-c-pro … l#env-vars
23:53:36.941: CUDA lazy loading is not enabled. Enabling it can significantly reduce device memory usage. See `CUDA_MODULE_LOADING` in https://docs.nvidia.com/cuda/cuda-c-pro … l#env-vars
23:53:39.511: [autoconvert] Converting yuv420p10 -> p010
23:53:39.521: VO: [lavc] 3840x2160 p010
23:53:39.521: [vo/lavc] Opening encoder: NVIDIA NVENC H.264 encoder [h264_nvenc]
23:53:39.725: [ffmpeg] h264_nvenc: 10 bit encode not supported
23:53:39.749: [ffmpeg] h264_nvenc: No capable devices found
23:53:39.749: [vo/lavc] Could not initialize encoder.
23:53:39.749: Could not initialize video chain.
23:53:39.947: Video: no video
23:53:39.947: [encode] no data written to target file
23:53:39.947: Exiting... (Interrupted by error)
23:53:40.128: (!!!) Intermediate file may be broken: D:\\The Northman (2022).SVP.temporary.mkv
23:53:40.128: ===== mpv exited with code 2 =====
*edit* error found .. movies was 10bit and ... this does not work. so i have to use convert to 8bit or .. use h265 instead, see below
Thanks, will invest later tonight how to apply this do you read the github thread ?
just to understand the workflow .. mpv creates a full script based on input from base.py and generate.js and this script is stored under C:\Users\%user%\AppData\Roaming\SVP4\scripts
correct ?
This file does not contain the parameter use_cuda_graph=True and stream maximum is 4 and github user recommends 5 streams to saturate the Ampere GPU - but RTX is already the successor https://github.com/AmusementClub/vs-mlr … nt-4625003
I deleted all models in the appdata folder to make sure they are created from scratch -> with the new generate.js still 22 fps, nothing changed
i tried to investigate with the nvidia monitoring tool ...
https://github.com/AmusementClub/vs-mlr … nt-4624607
Does this help ? https://github.com/AmusementClub/vs-mlr … nt-4623148
I run the 4090 as a pcie 4 card in a pcie 4 x16 slot..
2 and 4 streams have identical performance on my 4090 where k can confirm 115 fps in 1080p. Will try tomorrow with 4.4
damn .. too difficult to do something useful in this forum as it seems just one posting per hour is doable or something like this ...
1080P with 4x is possible .. currently creating the files for 4k ... 4k lagging.
transcoding results with TensorRT onnx 4.6 gputhreads=2(and 4-> same results)
1080p -> 117.8 fps
2160p -> 22,4 fps
RTX4090:
Rife 4.4 with TensorRT on Scence change threshold 8%, GPU threads 2
Transcoding 1080p with 94fps
Transcoding 2160p with 19.7fps
Rife 4.6 ensembleFalse with TensorRT on Scence change threshold 10%, GPU threads 2
Transcoding 1080p with 100fps
Transcoding 2160p with 20fps
Rife 4.6 ensembleTrue with TensorRT on Scence change threshold 10%, GPU threads 2
Transcoding 2160p with 20.2fps
Rife 4.6 ensembleTrue with TensorRT on Scence change threshold 10%, GPU threads 4
Transcoding 2160p with 21.3fps
realtime mpv crashes or does not even start
the files in AppData\Roaming\SVP4\cache\trt look as:
flownet_v4.6.pkl_NVIDIA GeForce RTX 4090_trt-8.5.2.2_3840x2176_fp32_workspace-1073741824_scale-1.0_ensemble-False
i can offer teamviewer, discord .. whatever
Thanks, now colors but just 19fps
I have no clue how to install RifeRT Version 4.6 ...
SmoothVideo Project → Posts by grobalt
Powered by PunBB, supported by Informer Technologies, Inc.