Re: SVP 4 Linux
deleted
You are not logged in. Please login or register.
SmoothVideo Project → Using SVP → SVP 4 Linux
deleted
SVP doesn´t detect my Nvidia GTX 1060 GPU in Manjaro Linux, I can´t set GPU acceleration but in Windows I can. Why is that? I installed Nvidia Drivers and GPU works fine with games, what should I do in order to get GPU acceleration in SVP?, didn´t it work in Linux? then why is the option there?
should I install both? what is this?, if it is nvidia-utils I already have it, I installed the propietary drivers, but svp still doesn´t detect it
Prerequisites
6. (recommended) Proprietary video drivers including OpenCL ICDFor GPU acceleration support use proprietary video drivers with (probably) some additional modules - search for "OpenCL ICD". For example in case of NV cards "nvidia-modprobe" package is needed for OpenCL to work. Check the installation with utils/clinfo.
Ok, I solved it, thank you.
I installed GPU-viewer and I saw that opencl was not installed, so I installed it throw pacman with the package opencl-nvidia.
Linux noob here, running Xubuntu 18.04.1.
I'm having trouble giving SVP write permission for VLC's video_filter folder.
I run
sudo chmod 777 /usr/lib/x86_64-linux-gnu/vlc/plugins/video_filter
in the console and it appears to work fine - no errors or other messages.
But every time I try to activate "SVP in VLC (replace 'deinterlace' filter)" in SVP's menu, I get this error:
Operation failed. Check that SVP Manager has write access to '/usr/lib/x86_64-linux-gnu/vlc/plugins/video_filter/video_filter'.
What am I doing wrong?
'chmod -R' probably
Still no go.
I also tried "-R a+rwx". Properties for the folder and files inside show Read&Write permissions for everyone, but SVP still gives the same error.
why '/usr/lib/x86_64-linux-gnu/vlc/plugins/video_filter/video_filter' ("video_filter" written twice)?
is it your typo when copying text?
check 'main.setup.vlc.plugins' value in "All settings", it must be equal to "/usr/lib/x86_64-linux-gnu/vlc/plugins"
Damn, I'm stupid. I entered the full path to filters instead of plugins and didn't even notice it in the error message.
Thanks, now it's working.
As a minor follow-up to what I said on the previous page about propietary vs open drivers for getting OpenCL GPU acceleration on AMD and Intel GPUs, I can at least confirm that on Mint 19.1 (and therefore presumably anything based on Ubuntu 18.04) that all I had to do was install "mesa-opencl-icd" and, specifically for my Intel Haswell iGPU, "beignet-opencl-icd" through my OS's package manager.
And that was it - I was then able to use my Intel iGPU to accelerate SVP without seemingly any issue whatsoever and performance was then comparable to what I get in Windows with the same hardware configuration and customized SVP interpolation profile settings (I also confirmed that disabling GPU acceleration absolutely tanked my performance).
----------------------------------------------------------------
You know, the Linux page on the wiki could probably use a bit of a cleanup - an easy example is that one needs to use libass9 nowadays instead of libass5.
Also the process for getting SVP working on at least Linux Mint 19.1 (and therefore presumably any distro based on Ubuntu 18.04) seems to be much less involved than the wiki makes it out to be. From my totally-not-in-depth testing, it seems like all one needs to do is run the following four commands individually before running the SVP installer, and then you'll be good to go:
sudo add-apt-repository ppa:deadsnakes/ppa
sudo add-apt-repository ppa:djcj/vapoursynth
sudo apt-get update
sudo apt-get install libqt5concurrent5 libqt5qml5 libpython3.5 mediainfo mpv libqt5svg5 libass9
Now I don't know what goes into the process of making a .deb, but I do know that other software distrobuted as .deb, like MEGAsync, will automatically install the required dependencies before installing the according program, and I can't help but think that something like that could be useful here.
And of course, like I stated, I was then also able to just use the following command to get GPU acceleration in SVP on my Haswell Intel graphics (this supposedly would work for Intel Ivy Bridge graphics as well and maybe even Intel Sandy Bridge graphics):
sudo apt-get install mesa-opencl-icd beignet-opencl-icd
I believe the "mesa-opencl-icd" would also be used for AMD GPUs, but at this time I don't know what (if anything) would be the AMD GPU equivalent to beignet-opencl-icd.
EDIT: While playing with things a little farther, I did notice something interesting - if I open a video file with SVP, then it'll play in mpv with SVP's interpolation active like usual. If I open a video file in mpv directly, it will play in in mpv without SVP's interpolation being active.
Is this the expected behavior? I mean, I only have a single installation of mpv installed since this was tested on a completely fresh installation of Linux Mint 19.1 Cinnamon 64bit.
mpv.conf is the key for your last question
So similar to what I've mentioned in my "OpenCL minimum requirement" thread, is it accurate to say that, for AMD graphics, the minimum requirement to use SVP's GPU acceleration (without going all the way back to Ubuntu 14.04) would be the Radeon HD7000 series?
I really have no idea. If you post a clinfo's output for a pre-7000 card in Linux, I'll tell why it's not supported in SVP.
Here you go, Radeon HD5850:
htpc@htpc-linux:~$ clinfo
Number of platforms 2
Platform Name Clover
Platform Vendor Mesa
Platform Version OpenCL 1.1 Mesa 18.2.2
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd
Platform Extensions function suffix MESA
Platform Name Portable Computing Language
Platform Vendor The pocl project
Platform Version OpenCL 1.2 pocl 1.1 None+Asserts, LLVM 6.0.0, SPIR, SLEEF, DISTRO, POCL_DEBUG
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd
Platform Extensions function suffix POCL
Platform Name Clover
Number of devices 1
Device Name AMD CYPRESS (DRM 2.50.0 / 4.15.0-20-generic, LLVM 7.0.0)
Device Vendor AMD
Device Vendor ID 0x1002
Device Version OpenCL 1.1 Mesa 18.2.2
Driver Version 18.2.2
Device OpenCL C Version OpenCL C 1.1
Device Type GPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Max compute units 10
Max clock frequency 725MHz
Max work item dimensions 3
Max work item sizes 256x256x256
Max work group size 256
Preferred work group size multiple 64
Preferred / native vector sizes
char 16 / 16
short 8 / 8
int 4 / 4
long 2 / 2
half 0 / 0 (n/a)
float 4 / 4
double 2 / 2 (cl_khr_fp64)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 32, Little-Endian
Global memory size 1073741824 (1024MiB)
Error Correction support No
Max memory allocation 751619276 (716.8MiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 32768 bits (4096 bytes)
Global Memory cache type None
Image support No
Local memory type Local
Local memory size 32768 (32KiB)
Max number of constant args 15
Max constant buffer size 751619276 (716.8MiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Profiling timer resolution 0ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64
Platform Name Portable Computing Language
Number of devices 1
Device Name pthread-Intel(R) Xeon(R) CPU X3470 @ 2.93GHz
Device Vendor GenuineIntel
Device Vendor ID 0x8086
Device Version OpenCL 1.2 pocl HSTR: pthread-x86_64-pc-linux-gnu-nehalem
Driver Version 1.1
Device OpenCL C Version OpenCL C 1.2 pocl
Device Type CPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 8
Max clock frequency 2926MHz
Device Partition (core)
Max number of sub-devices 8
Supported partition types equally, by counts
Max work item dimensions 3
Max work item sizes 4096x4096x4096
Max work group size 4096
Preferred work group size multiple 8
Preferred / native vector sizes
char 16 / 16
short 8 / 8
int 4 / 4
long 2 / 2
half 0 / 0 (n/a)
float 4 / 4
double 2 / 2 (cl_khr_fp64)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 4655864832 (4.336GiB)
Error Correction support No
Max memory allocation 2147483648 (2GiB)
Unified memory for Host and Device Yes
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Global Memory cache type Read/Write
Global Memory cache size 8388608 (8MiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 16
Max size for 1D images from buffer 134217728 pixels
Max 1D or 2D image array size 2048 images
Max 2D image size 8192x8192 pixels
Max 3D image size 2048x2048x2048 pixels
Max number of read image args 128
Max number of write image args 128
Local memory type Global
Local memory size 4194304 (4MiB)
Max number of constant args 8
Max constant buffer size 4194304 (4MiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Prefer user sync for interop Yes
Profiling timer resolution 1ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels Yes
SPIR versions 1.2
printf() buffer size 1048576 (1024KiB)
Built-in kernels
Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_spir cl_khr_fp64 cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_fp64
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) Clover
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [MESA]
clCreateContext(NULL, ...) [default] Success [MESA]
clCreateContext(NULL, ...) [other] Success [POCL]
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) Success (1)
Platform Name Clover
Device Name AMD CYPRESS (DRM 2.50.0 / 4.15.0-20-generic, LLVM 7.0.0)
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) Success (1)
Platform Name Clover
Device Name AMD CYPRESS (DRM 2.50.0 / 4.15.0-20-generic, LLVM 7.0.0)
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) Success (1)
Platform Name Clover
Device Name AMD CYPRESS (DRM 2.50.0 / 4.15.0-20-generic, LLVM 7.0.0)
ICD loader properties
ICD loader Name OpenCL ICD Loader
ICD loader Vendor OCL Icd free software
ICD loader Version 2.2.11
ICD loader Profile OpenCL 2.1
htpc@htpc-linux:~$
Image support No
Image support No
Now is that the only thing missing on the list that would be required?
I just want to make sure before I go doing any more research only to discover "but wait, there's more!".
Been using SVP for quite some time but now I moved to Manjaro Linux and I have no clue what to do with the error that the deinterlaced filter for VLC is not available? Am I missing anything or is it just not possible at the time to use SVP and VLC?
it's not error but an information message meaning "deinterlace filter replaced with SVP's filter, you can't use deinterlacing now"
Really? Well, it doesn't sound like it: "SVP support ENABLED but the deinterlaced filter in VLC is not available any more." And it's not working btw. Appreciate the support though!
what exactly it sounds like?
And it's not working btw. Appreciate the support though!
What doesn't work: SVP or deinterlace? Deinterlace shouldn't but SVP should.
Has anyone tested the ROCm OpenCL runtime (https://github.com/RadeonOpenCompute/RO … CL-Runtime) and made it work with SVP? I installed it recently but SVP failed to detect my GPU as a supported device when using the ROCm runtime. I'm running on Manjaro Linux 18 .
12:59:17.662 [i]: Main: starting up SVP 4 Linux [4.3.0.157]...
12:59:17.663 [i]: Main: args: none
12:59:17.663 [i]: Main: working dir is /opt/svp/
12:59:17.664 [i]: Main: data dir set to /home/mikhail/.local/share/SVP4/
12:59:17.664 [i]: Settings: loading reg.cfg OK
12:59:17.664 [i]: Settings: loading main.cfg OK
12:59:17.665 [i]: Settings: loading ui.cfg OK
12:59:17.665 [i]: Settings: loading frc.cfg OK
12:59:17.672 [i]: Settings: loading profiles.cfg OK
12:59:17.672 [i]: Settings: loading custom.cfg OK
12:59:17.673 [i]: Settings: loading lights.cfg OK
12:59:17.673 [i]: Main: using Qt 5.12.1 (x86_64-little_endian-lp64 shared (dynamic) release build; by GCC 8.2.1 20181127)
12:59:17.673 [i]: Main: device scale is 1, user defined scale is 1
12:59:17.674 [i]: Main: system locale is [en]
12:59:17.674 [i]: Main: preferred language is [en-us]
12:59:17.676 [i]: Main: setting language file to en.qm...
12:59:17.680 [i]: Main: module 'plugins/libsvpflow1_vs64.so': 4.3.0.156
12:59:17.680 [i]: Main: module 'plugins/libsvpflow2_vs64.so': 4.3.0.155
12:59:17.682 [i]: Main: VLC filter (64 bit): 0.9.0.155
12:59:17.682 [i]: Main: running OpenCL info...
12:59:17.685 [i]: Main: collecting system information...
12:59:17.694 [i]: OS: Linux 5.0.2-2-MANJARO #1 SMP PREEMPT Thu Mar 14 07:54:42 UTC 2019 x86_64
12:59:17.729 [i]: Desktop environment: /usr/bin/sddm / /usr/share/xsessions/plasma / kde
12:59:17.732 [i]: CPU: AMD Ryzen 3 1200 Quad-Core [base frequency 3024 MHz, 4 threads]
12:59:17.732 [i]: Video: reading OpenCL info...
12:59:18.155 [i]: Video: 1 GPU OpenCL device(s) on AMD Accelerated Parallel Processing [OpenCL 2.1 AMD-APP (2766.4)] (Advanced Micro Devices, Inc.)
12:59:18.155 [i]: Video 1: device name 'Radeon 550 Series' (Advanced Micro Devices, Inc., ver.2766.4) [gpuID=11]: OK
12:59:18.155 [i]: Video: 1 GPU OpenCL device(s) on AMD Accelerated Parallel Processing [OpenCL 2.0 AMD-APP.internal (2814.0)] (Advanced Micro Devices, Inc.)
12:59:18.155 [i]: Video 1: device name 'Lexa PRO [Radeon RX 550/550X]' (Advanced Micro Devices, Inc., ver.2814.0 (HSA1.1,LC)) [gpuID=21]: NOT SUPPORTED
12:59:18.156 [i]: Video: 1 GPU OpenCL device(s) on AMD Accelerated Parallel Processing [OpenCL 2.0 AMD-APP.internal (2814.0)] (Advanced Micro Devices, Inc.)
12:59:18.156 [i]: Video 1: device name 'Lexa PRO [Radeon RX 550/550X]' (Advanced Micro Devices, Inc., ver.2814.0 (HSA1.1,LC)) [gpuID=31]: NOT SUPPORTED
12:59:18.157 [i]: Memory: 7992 MB total, 5662 MB free
12:59:18.157 [i]: System: finding network settings...
12:59:18.218 [i]: Screens: updating information, 1 screen(s) found
12:59:18.218 [i]: Screens: screen 0 (DisplayPort-0) - 1920x1080 @144.000 Hz, x1.0 [91 DPI]
12:59:18.218 [i]: Screens: primary screen is 0
12:59:18.393 [i]: Main: preparing video profiles...
12:59:18.602 [i]: Main: preparing performance graphs...
12:59:18.749 [i]: Main: preparing mpv...
12:59:18.770 [i]: Main: preparing remote control...
12:59:18.771 [i]: RemoteControl: started
12:59:18.771 [i]: Main: preparing main menu...
12:59:18.817 [i]: Main: loading extensions...
12:59:18.831 [i]: Extensions: found svpcast 1.0.0.156 ...
12:59:18.834 [i]: Extensions: found svplight 2.0.0.156 ...
12:59:18.836 [i]: Extensions: found svptube 2.1.0.155 ...
12:59:18.837 [i]: Settings: loading tube.cfg OK
12:59:18.846 [i]: Tube: initializing youtube_dl...
12:59:18.962 [i]: Main: initialization completed in 1278 ms
12:59:19.043 [i]: Updates: checking now...
12:59:19.762 [i]: Tube: youtube_dl ready
12:59:22.120 [i]: Performance: quick estimation = 583 (previous value was 527)
This is the my clinfo output. I have installed 2 OpenCL runtime from AMD:
1) OpenCL driver extracted from the proprietary AMDGPU-PRO driver (https://aur.archlinux.org/packages/opencl-amd/) - The one I can only use
2) OpenCL driver provided by ROCm OpenCL runtime ((https://github.com/RadeonOpenCompute/RO … CL-Runtime) - The one I'm trying to use
[mikhail@MS-7A39 ~]$ clinfo
Number of platforms 3
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.1 AMD-APP (2766.4)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Platform Host timer resolution 1ns
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.0 AMD-APP.internal (2814.0)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_object_metadata cl_amd_event_callback
Platform Max metadata object keys (AMD) 8
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.0 AMD-APP.internal (2814.0)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_object_metadata cl_amd_event_callback
Platform Max metadata object keys (AMD) 8
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Number of devices 1
Device Name gfx804
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 1.2 AMD-APP (2766.4)
Driver Version 2766.4
Device OpenCL C Version OpenCL C 1.2
Device Type GPU
Device Board Name (AMD) Radeon 550 Series
Device Topology (AMD) PCI-E, 1f:00.0
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 8
SIMD per compute unit (AMD) 4
SIMD width (AMD) 16
SIMD instruction width (AMD) 1
Max clock frequency 1206MHz
Graphics IP (AMD) 8.0
Device Partition (core)
Max number of sub-devices 8
Supported partition types None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 1024x1024x1024
Max work group size 256
Preferred work group size (AMD) 256
Max work group size (AMD) 1024
Preferred work group size multiple 64
Wavefront width (AMD) 64
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (cl_khr_fp16)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs No
Round to nearest No
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 1817083904 (1.692GiB)
Global free memory (AMD) 1752916 (1.672GiB)
Global memory channels (AMD) 4
Global memory banks per channel (AMD) 16
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 1345937817 (1.254GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 2048 bits (256 bytes)
Global Memory cache type Read/Write
Global Memory cache size 16384 (16KiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 16
Max size for 1D images from buffer 134217728 pixels
Max 1D or 2D image array size 2048 images
Base address alignment for 2D image buffers 256 bytes
Pitch alignment for 2D image buffers 256 pixels
Max 2D image size 16384x16384 pixels
Max 3D image size 2048x2048x2048 pixels
Max number of read image args 128
Max number of write image args 8
Local memory type Local
Local memory size 32768 (32KiB)
Local memory syze per CU (AMD) 65536 (64KiB)
Local memory banks (AMD) 32
Max number of constant args 8
Max constant buffer size 1345937817 (1.254GiB)
Preferred constant buffer size (AMD) 16384 (16KiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Prefer user sync for interop Yes
Profiling timer resolution 1ns
Profiling timer offset since Epoch (AMD) 1552793083099585780ns (Sun Mar 17 11:24:43 2019)
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Thread trace supported (AMD) Yes
Number of async queues (AMD) 2
Max real-time compute queues (AMD) 0
Max real-time compute units (AMD) 0
SPIR versions 1.2
printf() buffer size 4194304 (4MiB)
Built-in kernels (n/a)
Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_amd_bus_addressable_memory cl_khr_spir cl_khr_gl_event
Platform Name AMD Accelerated Parallel Processing
Number of devices 1
Device Name gfx803
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 1.2
Driver Version 2814.0 (HSA1.1,LC)
Device OpenCL C Version OpenCL C 2.0
Device Type GPU
Device Board Name (AMD) Lexa PRO [Radeon RX 550/550X]
Device Topology (AMD) PCI-E, 1f:00.0
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 8
SIMD per compute unit (AMD) 4
SIMD width (AMD) 16
SIMD instruction width (AMD) 1
Max clock frequency 1206MHz
Graphics IP (AMD) 8.3
Device Partition (core)
Max number of sub-devices 8
Supported partition types None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 1024x1024x1024
Max work group size 256
Preferred work group size (AMD) 256
Max work group size (AMD) 1024
Preferred work group size multiple 64
Wavefront width (AMD) 64
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (cl_khr_fp16)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs No
Round to nearest No
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 2147483648 (2GiB)
Global free memory (AMD) 2095104 (1.998GiB)
Global memory channels (AMD) 4
Global memory banks per channel (AMD) 4
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 1825361100 (1.7GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Global Memory cache type Read/Write
Global Memory cache size 16384 (16KiB)
Global Memory cache line size 64 bytes
Image support No
Local memory type Local
Local memory size 65536 (64KiB)
Local memory syze per CU (AMD) 65536 (64KiB)
Local memory banks (AMD) 32
Max number of constant args 8
Max constant buffer size 1825361100 (1.7GiB)
Preferred constant buffer size (AMD) 16384 (16KiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Prefer user sync for interop Yes
Number of P2P devices (AMD) 0
P2P devices (AMD) (n/a)
Profiling timer resolution 1ns
Profiling timer offset since Epoch (AMD) 0ns (Thu Jan 1 07:30:00 1970)
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Thread trace supported (AMD) No
Number of async queues (AMD) 8
Max real-time compute queues (AMD) 8
Max real-time compute units (AMD) 8
printf() buffer size 4194304 (4MiB)
Built-in kernels (n/a)
Device Extensions cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_media_ops cl_amd_media_ops2 cl_khr_subgroups cl_khr_depth_images cl_amd_copy_buffer_p2p cl_amd_assembly_program
Platform Name AMD Accelerated Parallel Processing
Number of devices 1
Device Name gfx803
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 1.2
Driver Version 2814.0 (HSA1.1,LC)
Device OpenCL C Version OpenCL C 2.0
Device Type GPU
Device Board Name (AMD) Lexa PRO [Radeon RX 550/550X]
Device Topology (AMD) PCI-E, 1f:00.0
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 8
SIMD per compute unit (AMD) 4
SIMD width (AMD) 16
SIMD instruction width (AMD) 1
Max clock frequency 1206MHz
Graphics IP (AMD) 8.3
Device Partition (core)
Max number of sub-devices 8
Supported partition types None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 1024x1024x1024
Max work group size 256
Preferred work group size (AMD) 256
Max work group size (AMD) 1024
Preferred work group size multiple 64
Wavefront width (AMD) 64
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (cl_khr_fp16)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs No
Round to nearest No
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 2147483648 (2GiB)
Global free memory (AMD) 2095104 (1.998GiB)
Global memory channels (AMD) 4
Global memory banks per channel (AMD) 4
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 1825361100 (1.7GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Global Memory cache type Read/Write
Global Memory cache size 16384 (16KiB)
Global Memory cache line size 64 bytes
Image support No
Local memory type Local
Local memory size 65536 (64KiB)
Local memory syze per CU (AMD) 65536 (64KiB)
Local memory banks (AMD) 32
Max number of constant args 8
Max constant buffer size 1825361100 (1.7GiB)
Preferred constant buffer size (AMD) 16384 (16KiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Prefer user sync for interop Yes
Number of P2P devices (AMD) 0
P2P devices (AMD) (n/a)
Profiling timer resolution 1ns
Profiling timer offset since Epoch (AMD) 0ns (Thu Jan 1 07:30:00 1970)
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Thread trace supported (AMD) No
Number of async queues (AMD) 8
Max real-time compute queues (AMD) 8
Max real-time compute units (AMD) 8
printf() buffer size 4194304 (4MiB)
Built-in kernels (n/a)
Device Extensions cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_media_ops cl_amd_media_ops2 cl_khr_subgroups cl_khr_depth_images cl_amd_copy_buffer_p2p cl_amd_assembly_program
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) AMD Accelerated Parallel Processing
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [AMD]
clCreateContext(NULL, ...) [default] Success [AMD]
clCreateContext(NULL, ...) [other] Success [AMD]
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) Success (1)
Platform Name AMD Accelerated Parallel Processing
Device Name gfx804
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) Success (1)
Platform Name AMD Accelerated Parallel Processing
Device Name gfx804
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) Success (1)
Platform Name AMD Accelerated Parallel Processing
Device Name gfx804
ICD loader properties
ICD loader Name OpenCL ICD Loader
ICD loader Vendor OCL Icd free software
ICD loader Version 2.2.12
ICD loader Profile OpenCL 2.2
SVP doesn't want to init or benchmark any of my GPUs. Mostly vanilla Arch. Using intel and amdgpu drivers.
[vapoursynth] Script evaluation failed:
[vapoursynth] Python exception: SVSmoothFps: unable to init GPU-based renderer - code 327674
[vapoursynth]
[vapoursynth] Traceback (most recent call last):
[vapoursynth] File "src/cython/vapoursynth.pyx", line 1927, in vapoursynth.vpy_evaluateScript
[vapoursynth] File "src/cython/vapoursynth.pyx", line 1928, in vapoursynth.vpy_evaluateScript
[vapoursynth] File "/home/eric/.local/share/SVP4/scripts/a5c37452.py", line 52, in <module>
[vapoursynth] smooth = interpolate(clip)
[vapoursynth] File "/home/eric/.local/share/SVP4/scripts/a5c37452.py", line 35, in interpolate
[vapoursynth] smooth = core.svp2.SmoothFps(input_m,super["clip"],super["data"],vectors["clip"],vectors["data"],smoothfps_params,src=input_um,fps=src_fps)
[vapoursynth] File "src/cython/vapoursynth.pyx", line 1833, in vapoursynth.Function.__call__
[vapoursynth] vapoursynth.Error: SVSmoothFps: unable to init GPU-based renderer - code 327674
[vapoursynth]
[vapoursynth] could not init VS
Disabling filter vapoursynth.00 because it has failed.
21:51:56.291 [i]: Main: starting up SVP 4 Linux [4.3.0.157]...
21:51:56.292 [i]: Main: args: none
21:51:56.292 [i]: Main: working dir is /opt/svp/
21:51:56.293 [i]: Main: data dir set to /home/eric/.local/share/SVP4/
21:51:56.293 [i]: Settings: loading reg.cfg OK
21:51:56.294 [i]: Settings: loading main.cfg OK
21:51:56.294 [i]: Settings: loading ui.cfg OK
21:51:56.294 [i]: Settings: loading frc.cfg OK
21:51:56.295 [i]: Settings: loading profiles.cfg OK
21:51:56.295 [i]: Settings: loading custom.cfg OK
21:51:56.295 [i]: Settings: loading lights.cfg OK
21:51:56.296 [i]: Main: using Qt 5.12.2 (x86_64-little_endian-lp64 shared (dynamic) release build; by GCC 8.2.1 20181127)
21:51:56.296 [i]: Main: device scale is 1, user defined scale is 0
21:51:56.297 [i]: Main: system locale is [en]
21:51:56.297 [i]: Main: preferred language is [en-us]
21:51:56.299 [i]: Main: setting language file to en.qm...
21:51:56.303 [i]: Main: module 'plugins/libsvpflow1_vs64.so': 4.3.0.156
21:51:56.304 [i]: Main: module 'plugins/libsvpflow2_vs64.so': 4.3.0.155
21:51:56.306 [i]: Main: VLC filter (64 bit): 0.9.0.155
21:51:56.306 [i]: Main: running OpenCL info...
21:51:56.313 [i]: Main: collecting system information...
21:51:56.318 [i]: OS: Linux 5.0.0-arch1-1-ARCH #1 SMP PREEMPT Mon Mar 4 14:11:43 UTC 2019 x86_64
21:51:56.345 [i]: Desktop environment: /usr/bin/gdm / gnome
21:51:56.351 [i]: CPU: Intel Core i7-3632QM [base frequency 2552 MHz, 8 threads]
21:51:56.352 [i]: Video: reading OpenCL info...
21:51:56.743 [i]: Video: 1 GPU OpenCL device(s) on Intel Gen OCL Driver [OpenCL 2.0 beignet 1.4 (git-fc5f430c)] (Intel)
21:51:56.746 [i]: Video 1: device name 'Intel(R) HD Graphics IvyBridge M GT2' (Intel, ver.1.4) [gpuID=11]: OK
21:51:56.746 [i]: Video: 0 GPU OpenCL device(s) on Intel(R) OpenCL [OpenCL 1.2] (Intel(R) Corporation)
21:51:56.746 [i]: Video: 1 GPU OpenCL device(s) on Clover [OpenCL 1.1 Mesa 19.0.0] (Mesa)
21:51:56.746 [i]: Video 1: device name 'AMD Radeon HD 7700M Series (VERDE, DRM 3.27.0, 5.0.0-arch1-1-ARCH, LLVM 7.0.1)' (AMD, ver.19.0.0) [gpuID=31]: NOT SUPPORTED
21:51:56.746 [i]: Video: 1 GPU OpenCL device(s) on AMD Accelerated Parallel Processing [OpenCL 2.1 AMD-APP (2766.4)] (Advanced Micro Devices, Inc.)
21:51:56.746 [i]: Video 1: device name 'AMD Radeon HD 7700M Series' (Advanced Micro Devices, Inc., ver.2766.4) [gpuID=41]: OK
21:51:56.748 [i]: Memory: 7844 MB total, 4907 MB free
21:51:56.748 [i]: System: finding network settings...
21:51:56.773 [i]: Screens: updating information, 2 screen(s) found
21:51:56.773 [i]: Screens: screen 0 (HDMI1) - 2048x1152 @60.000 Hz, x1.0 [101 DPI]
21:51:56.773 [i]: Screens: screen 1 (LVDS1) - 1920x1080 @60.000 Hz, x1.0 [143 DPI]
21:51:56.773 [i]: Screens: primary screen is 0
21:51:56.807 [i]: Main: preparing video profiles...
21:51:56.875 [i]: Main: preparing performance graphs...
21:51:56.928 [i]: Main: preparing mpv...
21:51:56.950 [i]: Main: preparing remote control...
21:51:56.950 [i]: RemoteControl: started
21:51:56.951 [i]: Main: preparing main menu...
21:51:56.981 [i]: Main: loading extensions...
21:51:56.987 [i]: Extensions: found svpcast 1.0.0.156 ...
21:51:56.988 [i]: Settings: loading cast.cfg OK
21:51:56.996 [i]: Streaming: Web UI started at http://192.168.1.180:9900
21:51:57.013 [i]: Extensions: found svplight 2.0.0.156 ...
21:51:57.014 [i]: Settings: loading leds.cfg OK
21:51:57.055 [i]: Streaming: found Chromecast [Eureka Dongle] at 192.168.1.175
21:51:57.075 [i]: Main: initialization completed in 764 ms
21:51:57.135 [i]: Updates: checking now...
21:51:59.830 [i]: Performance: quick estimation = 468 (previous value was 487)
21:52:13.664 [i]: Performance: motion vectors estimation = 1876
21:52:24.179 [i]: Performance: CPU-based frame rendering = 2836
21:52:24.270 [W]: Video [gpuID=11] initialization failed
21:52:24.270 [W]: Performance: GPU-based frame rendering [gpuID=11] FAILED
21:52:26.034 [W]: Video [gpuID=41] initialization failed
21:52:26.034 [W]: Performance: GPU-based frame rendering [gpuID=41] FAILED
Number of platforms 4
Platform Name Intel Gen OCL Driver
Platform Vendor Intel
Platform Version OpenCL 2.0 beignet 1.4 (git-fc5f430c)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short cl_intel_required_subgroup_size cl_intel_media_block_io cl_intel_planar_yuv cl_khr_gl_sharing
Platform Extensions function suffix Intel
Platform Name Intel(R) OpenCL
Platform Vendor Intel(R) Corporation
Platform Version OpenCL 1.2
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_3d_image_writes cl_intel_exec_by_local_thread cl_khr_spir cl_khr_fp64
Platform Extensions function suffix INTEL
Platform Name Clover
Platform Vendor Mesa
Platform Version OpenCL 1.1 Mesa 19.0.0
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd
Platform Extensions function suffix MESA
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.1 AMD-APP (2766.4)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Platform Host timer resolution 1ns
Platform Extensions function suffix AMD
Platform Name Intel Gen OCL Driver
Number of devices 1
Device Name Intel(R) HD Graphics IvyBridge M GT2
Device Vendor Intel
Device Vendor ID 0x8086
Device Version OpenCL 1.2 beignet 1.4 (git-fc5f430c)
Driver Version 1.4
Device OpenCL C Version OpenCL C 1.2 beignet 1.4 (git-fc5f430c)
Device Type GPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 16
Max clock frequency 1000MHz
Device Partition (core)
Max number of sub-devices 1
Supported partition types None, None, None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 512x512x512
Max work group size 512
Preferred work group size multiple 16
Sub-group sizes (Intel) 8, 16
Preferred / native vector sizes
char 16 / 8
short 8 / 8
int 4 / 4
long 2 / 2
half 0 / 8 (n/a)
float 4 / 4
double 0 / 2 (n/a)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Double-precision Floating-point support (n/a)
Address bits 32, Little-Endian
Global memory size 2147483648 (2GiB)
Error Correction support No
Max memory allocation 1610612736 (1.5GiB)
Unified memory for Host and Device Yes
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Global Memory cache type Read/Write
Global Memory cache size 8192 (8KiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 16
Max size for 1D images from buffer 65536 pixels
Max 1D or 2D image array size 2048 images
Base address alignment for 2D image buffers 4096 bytes
Pitch alignment for 2D image buffers 1 pixels
Max 2D image size 8192x8192 pixels
Max planar YUV image size 8192x8192 pixels
Max 3D image size 8192x8192x2048 pixels
Max number of read image args 128
Max number of write image args 8
Local memory type Local
Local memory size 65536 (64KiB)
Max number of constant args 8
Max constant buffer size 134217728 (128MiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Prefer user sync for interop Yes
Profiling timer resolution 80ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels Yes
SPIR versions 1.2
printf() buffer size 1048576 (1024KiB)
Built-in kernels __cl_copy_region_align4;__cl_copy_region_align16;__cl_copy_region_unalign_same_offset;__cl_copy_region_unalign_dst_offset;__cl_copy_region_unalign_src_offset;__cl_copy_buffer_rect;__cl_copy_buffer_rect_align4;__cl_copy_image_1d_to_1d;__cl_copy_image_2d_to_2d;__cl_copy_image_3d_to_2d;__cl_copy_image_2d_to_3d;__cl_copy_image_3d_to_3d;__cl_copy_image_2d_to_buffer;__cl_copy_image_2d_to_buffer_align4;__cl_copy_image_2d_to_buffer_align16;__cl_copy_image_3d_to_buffer;__cl_copy_image_3d_to_buffer_align4;__cl_copy_image_3d_to_buffer_align16;__cl_copy_buffer_to_image_2d;__cl_copy_buffer_to_image_2d_align4;__cl_copy_buffer_to_image_2d_align16;__cl_copy_buffer_to_image_3d;__cl_copy_buffer_to_image_3d_align4;__cl_copy_buffer_to_image_3d_align16;__cl_copy_image_1d_array_to_1d_array;__cl_copy_image_2d_array_to_2d_array;__cl_copy_image_2d_array_to_2d;__cl_copy_image_2d_array_to_3d;__cl_copy_image_2d_to_2d_array;__cl_copy_image_3d_to_2d_array;__cl_fill_region_unalign;__cl_fill_region_align2;__cl_fill_region_align4;__cl_fill_region_align8_2;__cl_fill_region_align8_4;__cl_fill_region_align8_8;__cl_fill_region_align8_16;__cl_fill_region_align128;__cl_fill_image_1d;__cl_fill_image_1d_array;__cl_fill_image_2d;__cl_fill_image_2d_array;__cl_fill_image_3d;block_motion_estimate_intel;
Device Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_3d_image_writes cl_khr_image2d_from_buffer cl_khr_depth_images cl_khr_spir cl_khr_icd cl_intel_accelerator cl_intel_subgroups cl_intel_subgroups_short cl_intel_required_subgroup_size cl_intel_media_block_io cl_intel_planar_yuv cl_khr_gl_sharing cl_intel_motion_estimation
Platform Name Intel(R) OpenCL
Number of devices 1
Device Name Intel(R) Core(TM) i7-3632QM CPU @ 2.20GHz
Device Vendor Intel(R) Corporation
Device Vendor ID 0x8086
Device Version OpenCL 1.2 (Build 475)
Driver Version 1.2.0.475
Device OpenCL C Version OpenCL C 1.2
Device Type CPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 8
Max clock frequency 2200MHz
Device Partition (core)
Max number of sub-devices 8
Supported partition types by counts, equally, by names (Intel)
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 8192x8192x8192
Max work group size 8192
Preferred work group size multiple 128
Preferred / native vector sizes
char 1 / 16
short 1 / 8
int 1 / 4
long 1 / 2
half 0 / 0 (n/a)
float 1 / 8
double 1 / 4 (cl_khr_fp64)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 8225779712 (7.661GiB)
Error Correction support No
Max memory allocation 2056444928 (1.915GiB)
Unified memory for Host and Device Yes
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Global Memory cache type Read/Write
Global Memory cache size 262144 (256KiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 480
Max size for 1D images from buffer 128527808 pixels
Max 1D or 2D image array size 2048 images
Max 2D image size 16384x16384 pixels
Max 3D image size 2048x2048x2048 pixels
Max number of read image args 480
Max number of write image args 480
Local memory type Global
Local memory size 32768 (32KiB)
Max number of constant args 480
Max constant buffer size 131072 (128KiB)
Max size of kernel argument 3840 (3.75KiB)
Queue properties
Out-of-order execution Yes
Profiling Yes
Local thread execution (Intel) Yes
Prefer user sync for interop No
Profiling timer resolution 1ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels Yes
SPIR versions 1.2
printf() buffer size 1048576 (1024KiB)
Built-in kernels (n/a)
Device Extensions cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_3d_image_writes cl_intel_exec_by_local_thread cl_khr_spir cl_khr_fp64
Platform Name Clover
Number of devices 1
Device Name AMD Radeon HD 7700M Series (VERDE, DRM 3.27.0, 5.0.0-arch1-1-ARCH, LLVM 7.0.1)
Device Vendor AMD
Device Vendor ID 0x1002
Device Version OpenCL 1.1 Mesa 19.0.0
Driver Version 19.0.0
Device OpenCL C Version OpenCL C 1.1
Device Type GPU
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Max compute units 8
Max clock frequency 575MHz
Max work item dimensions 3
Max work item sizes 256x256x256
Max work group size 256
Preferred work group size multiple 64
Preferred / native vector sizes
char 16 / 16
short 8 / 8
int 4 / 4
long 2 / 2
half 8 / 8 (cl_khr_fp16)
float 4 / 4
double 2 / 2 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Correctly-rounded divide and sqrt operations No
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 3221225472 (3GiB)
Error Correction support No
Max memory allocation 1717986918 (1.6GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 32768 bits (4096 bytes)
Global Memory cache type None
Image support No
Local memory type Local
Local memory size 32768 (32KiB)
Max number of constant args 16
Max constant buffer size 1717986918 (1.6GiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Profiling timer resolution 0ns
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Device Extensions cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_fp64 cl_khr_fp16
Platform Name AMD Accelerated Parallel Processing
Number of devices 1
Device Name Capeverde
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 1.2 AMD-APP (2766.4)
Driver Version 2766.4
Device OpenCL C Version OpenCL C 1.2
Device Type GPU
Device Board Name (AMD) AMD Radeon HD 7700M Series
Device Topology (AMD) PCI-E, 01:00.0
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 4
SIMD per compute unit (AMD) 4
SIMD width (AMD) 16
SIMD instruction width (AMD) 1
Max clock frequency 575MHz
Graphics IP (AMD) 6.0
Device Partition (core)
Max number of sub-devices 4
Supported partition types None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 1024x1024x1024
Max work group size 256
Preferred work group size (AMD) 256
Max work group size (AMD) 1024
Preferred work group size multiple 64
Wavefront width (AMD) 64
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (n/a)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals No
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 32, Little-Endian
Global memory size 2142801920 (1.996GiB)
Global free memory (AMD) <printDeviceInfo:78: get number of CL_DEVICE_GLOBAL_FREE_MEMORY_AMD : error -33>
Global memory channels (AMD) 4
Global memory banks per channel (AMD) 8
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 1596787097 (1.487GiB)
Unified memory for Host and Device No
Minimum alignment for any data type 128 bytes
Alignment of base address 2048 bits (256 bytes)
Global Memory cache type Read/Write
Global Memory cache size 16384 (16KiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 16
Max size for 1D images from buffer 134217728 pixels
Max 1D or 2D image array size 2048 images
Base address alignment for 2D image buffers 256 bytes
Pitch alignment for 2D image buffers 256 pixels
Max 2D image size 16384x16384 pixels
Max 3D image size 2048x2048x2048 pixels
Max number of read image args 128
Max number of write image args 8
Local memory type Local
Local memory size 32768 (32KiB)
Local memory syze per CU (AMD) 65536 (64KiB)
Local memory banks (AMD) 32
Max number of constant args 8
Max constant buffer size 65536 (64KiB)
Preferred constant buffer size (AMD) 16384 (16KiB)
Max size of kernel argument 1024
Queue properties
Out-of-order execution No
Profiling Yes
Prefer user sync for interop Yes
Profiling timer resolution 1ns
Profiling timer offset since Epoch (AMD) 1553301795496722366ns (Fri Mar 22 20:43:15 2019)
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Thread trace supported (AMD) No
Number of async queues (AMD) 2
Max real-time compute queues (AMD) 0
Max real-time compute units (AMD) 0
SPIR versions 1.2
printf() buffer size 4194304 (4MiB)
Built-in kernels (n/a)
Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_amd_bus_addressable_memory cl_khr_spir cl_khr_gl_event
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform
clCreateContext(NULL, ...) [default] No platform
clCreateContext(NULL, ...) [other] Success [Intel]
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform
SmoothVideo Project → Using SVP → SVP 4 Linux
Powered by PunBB, supported by Informer Technologies, Inc.