Home

Welcome to the vs-mlrt wiki!

The goal of the project to provide highly-optimized AI inference runtime for VapourSynth.

Runtimes

vs-ov: OpenVINO Pure CPU AI Inference Runtime
vs-ort: ONNX Runtime based CPU/CUDA AI Inference Runtime
vs-trt: TensorRT based CUDA AI Inference Runtime

Models

The following models are available:

waifu2x
DPIR

Benchmarking

DPIR

Model	Input Size	Speed(fps)	VRAM Usage(MB)	Backend
drunet_gray	1920x1080	2.45	5188	ort-cuda[1]
drunet_gray	1920x1080	5.20	3018	ort-cuda[1], fp16
drunet_gray	1920x1080	2.59	3979	trt[1]
drunet_gray	1920x1080	8.09	2831	trt[1], fp16
drunet_gray	1920x1080	2.27	11552	pytorch[2]
drunet_gray	1920x1080	5.10	11024	pytorch[2], fp16
drunet_gray	1920x1080	2.45	14853	pytorch[2], trt
drunet_gray	1920x1080	6.90	13565	pytorch[2], trt, fp16
drunet_gray	1920x1080	2.34	5791	ort-cuda[9]
drunet_gray	1920x1080	3.73	3621	ort-cuda[9], fp16
drunet_gray	1920x1080	2.75	4015	trt[9]
drunet_gray	1920x1080	6.67	3437	trt[9], fp16
drunet_gray	1920x1080	2.20	11837	pytorch[10]
drunet_gray	1920x1080	3.72	11583	pytorch[10], fp16
drunet_gray	1920x1080	2.67	4189	pytorch[10], trt
drunet_gray	1920x1080	6.17	4079	pytorch[10], trt, fp16
drunet_color	1920x1080	2.39	5220	ort-cuda[1]
drunet_color	1920x1080	4.95	3058	ort-cuda[1], fp16
drunet_color	1920x1080	2.51	4011	trt[1]
drunet_color	1920x1080	7.54	2863	trt[1], fp16
drunet_color	1920x1080	2.56	7187	trt[1], 2 streams
drunet_color	1920x1080	7.70	4135	trt[1], fp16, 2 streams
drunet_color	1920x1080	2.12	11558	pytorch[2]
drunet_color	1920x1080	4.29	11302	pytorch[2], fp16
drunet_color	1920x1080	2.26	14879	pytorch[2], trt
drunet_color	1920x1080	5.60	13575	pytorch[2], trt, fp16
drunet_color	1920x1080	2.29	5823	ort-cuda[9]
drunet_color	1920x1080	3.65	3661	ort-cuda[9], fp16
drunet_color	1920x1080	2.73	4075	trt[9]
drunet_color	1920x1080	6.26	3423	trt[9], fp16
drunet_color	1920x1080	2.12	11853	pytorch[10]
drunet_color	1920x1080	3.45	11597	pytorch[10], fp16
drunet_color	1920x1080	2.54	4209	pytorch[10], trt
drunet_color	1920x1080	5.25	4103	pytorch[10], trt, fp16

Waifu2x

Model	Input Size	Speed(fps)	Device RAM Usage(MB)	Backend
upconv7	1920x1080	5.98	5065	ort-cuda[1]
upconv7	1920x1080	10.4	5189	ort-cuda[1], fp16
upconv7	1920x1080	6.60	5033	trt[1]
upconv7	1920x1080	13.8	3041	trt[1], fp16
upconv7	1920x1080	8.43	9253	trt[1], 2 streams
upconv7	1920x1080	26.2	5253	trt[1], fp16, 2 streams
upconv7	1920x1080	5.66	3355	ort-cuda[1], 540p patch
upconv7	1920x1080	1.63	3248	caffe[3], 540p patch
upconv7	1920x1080	1.14	15547	ov-cpu[4]
upconv7	1920x1080	0.37	8612	ov-cpu[5]
upconv7	1920x1080	6.94	9765	ort-cuda[9]
upconv7	1920x1080	9.66	6049	ort-cuda[9], fp16
upconv7	1920x1080	7.83	5511	trt[9]
upconv7	1920x1080	16.1	3501	trt[9], fp16
upconv7	1920x1080	8.61	9731	trt[9], 2 streams
upconv7	1920x1080	19.9	5701	trt[9], fp16, 2 streams
upresnet10	1920x1080	4.36	5061	ort-cuda[1]
upresnet10	1920x1080	6.43	5059	ort-cuda[1], fp16
upresnet10	1920x1080	4.27	1879	ort-cuda[1], 540p patch
upresnet10	1920x1080	1.54	7232	caffe[3], 540p patch
upresnet10	1920x1080	1.27	7245	ov-cpu[4]
upresnet10	1920x1080	0.44	7143	ov-cpu[5]
upresnet10	1920x1080	3.90	5665	ort-cuda[9]
upresnet10	1920x1080	6.53	5663	ort-cuda[9], fp16
cunet	1920x1080	2.58	9155	ort-cuda[1]
cunet	1920x1080	4.10	9535	ort-cuda[1], fp16
cunet	1920x1080	2.48	4955	ort-cuda[1], 540p patch
cunet	1920x1080	1.11	11657	caffe[3], 540p patch
cunet	1920x1080	0.57	10943	ov-cpu[4]
cunet	1920x1080	0.23	10943	ov-cpu[5]
cunet	1920x1080	2.17	18469	ort-cuda[1]
cunet	1920x1080	3.22	10017	ort-cuda[1], fp16
anime rgb	1920x1080	0.62	15578	ov-cpu[4]
anime rgb	1920x1080	0.21	15439	ov-cpu[5]
anime rgb	1920x1080	0.048	1145	w2xc[6]
anime rgb	1920x1080	0.039	1183	w2xc[7]

RealESRGANv2

Model	Input Size	Speed(fps)	Device RAM Usage(MB)	Backend
animevideo-xsx2	1920x1080	5.27	2213	ort-cuda[1]
animevideo-xsx2	1920x1080	6.07	1835	trt[1]
animevideo-xsx2	1920x1080	11.8	1697	trt[1], fp16
animevideo-xsx2	1920x1080	6.86	2999	trt[1], 2 streams
animevideo-xsx2	1920x1080	15.5	2539	trt[1], fp16, 2 streams
animevideo-xsx2	1920x1080	3.64	6799	pytorch[2]
animevideo-xsx2	1920x1080	4.72	4291	pytorch[2], fp16
animevideo-xsx2	1920x1080	1.48	5239	ov-cpu[4]
animevideo-xsx2	1920x1080	0.42	5201	ov-cpu[5]
animevideo-xsx2	1920x1080	0.064	2883	pytorch[8]
animevideo-xsx2	1920x1080	4.15	2817	ort-cuda[9]
animevideo-xsx2	1920x1080	4.97	1965	trt[9]
animevideo-xsx2	1920x1080	11.5	1881	trt[9], fp16
animevideo-xsx2	1920x1080	3.47	7075	pytorch[2]
animevideo-xsx2	1920x1080	4.90	4585	pytorch[2], fp16

VapourSynth R57, Tesla V100, Windows Server 2019, Graphics Driver 511.23, vs-mlrt v6
VapourSynth R57, Tesla V100, Windows Server 2019, Graphics Driver 511.23, vs-dpir v1.7.1, vs-realesrgan v2.0.0, PyTorch 1.10.1+cu113, TensorRT 8.2.2, torch2trt 2732b35
VapourSynth R57, Tesla V100, Windows Server 2019, Graphics Driver 511.23, VapourSynth-Waifu2x-caffe r14
VapourSynth R57, Icelake Server 32C64T @2.90 GHz, Windows Server 2019, vs-mlrt v5
VapourSynth R57, EPYC Milan 16C32T @2.55 GHz, Windows Server 2019, vs-mlrt v5
VapourSynth R57, Icelake Server 32C64T @2.90 GHz, Windows Server 2019, VapourSynth-Waifu2x-w2xc r8
VapourSynth R57, EPYC Milan 16C32T @2.55 GHz, Windows Server 2019, VapourSynth-Waifu2x-w2xc r8
VapourSynth R57, Icelake Server 32C64T @2.90 GHz, Windows Server 2019, vs-dpir v1.7.1, PyTorch 1.10.1, NumPy 1.21.5+mkl
VapourSynth R57, Tesla A10, Windows Server 2019, Graphics Driver 511.23, vs-mlrt v6
VapourSynth R57, Tesla A10, Windows Server 2019, Graphics Driver 511.23, vs-dpir v1.7.1, vs-realesrgan v2.0.0, PyTorch 1.10.1+cu113, TensorRT 8.2.2, torch2trt 2732b35

Home

Runtimes
Models
- waifu2x
- DPIR
- RealESRGANv2
- Real-CUGAN
- RIFE
- External models
Device-specific benchmarks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Runtimes

Models

Benchmarking

DPIR

Waifu2x

RealESRGANv2

Clone this wiki locally