add support for nvidia/llama-3.2-nv-embedqa-1b-v2 and nvidia/llama-3.2-nv-rerankqa-1b-v2 #511
Job | Run time |
---|---|
4s | |
11s | |
10s | |
13s | |
19s | |
16s | |
13s | |
15s | |
10s | |
21s | |
16s | |
1s | |
2m 29s |
Job | Run time |
---|---|
4s | |
11s | |
10s | |
13s | |
19s | |
16s | |
13s | |
15s | |
10s | |
21s | |
16s | |
1s | |
2m 29s |