Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mistralrs-bench #166

Merged
merged 20 commits into from
Apr 18, 2024
Merged

mistralrs-bench #166

merged 20 commits into from
Apr 18, 2024

Conversation

lucasavila00
Copy link
Contributor

@lucasavila00 lucasavila00 commented Apr 17, 2024

Closes #164

llama.cpp

$ ./llama-bench -m /home/lucas/.cache/huggingface/hub/models--TheBloke--Mistral-7B-Instruct-v0.1-GGUF/snapshots/731a9fc8f06f5f5e2db8a0cf9d256197eb6e05d1/mistral-7b-instruct-v0.1.Q4_K_M.gguf 
| model                          |       size |     params | backend    | ngl | test       |              t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | ---------- | ---------------: |
| llama 7B Q4_K - Medium         |   4.07 GiB |     7.24 B | CUDA       |  99 | pp 512     |  1682.64 ± 58.17 |
| llama 7B Q4_K - Medium         |   4.07 GiB |     7.24 B | CUDA       |  99 | tg 128     |     64.46 ± 1.25 |

mistral.rs

$ cargo run --release --features cuda
+----------------------------------------+---------+--------+----------+-----------+-------------+--------------+
| model                                  | backend | test   | t/s      | ms/t      | concurrency | throughput/s |
+----------------------------------------+---------+--------+----------+-----------+-------------+--------------+
| TheBloke/Mistral-7B-Instruct-v0.1-GGUF | CUDA    | tg 128 | 59.78263 | 16.727266 |           1 |     59.78263 |
+----------------------------------------+---------+--------+----------+-----------+-------------+--------------+
| TheBloke/Mistral-7B-Instruct-v0.1-GGUF | CUDA    | pp 512 | 654.6095 | 1.5276283 |           1 |     654.6095 |
+----------------------------------------+---------+--------+----------+-----------+-------------+--------------+

Copy link

github-actions bot commented Apr 17, 2024

Code Metrics Report
  ───────────────────────────────────────────────────────────────────────────────
Language                 Files     Lines   Blanks  Comments     Code Complexity
───────────────────────────────────────────────────────────────────────────────
Rust                        61     20225     1446       860    17919       1157
───────────────────────────────────────────────────────────────────────────────
Total                       61     20225     1446       860    17919       1157
───────────────────────────────────────────────────────────────────────────────
Estimated Cost to Develop 59,209
Estimated Schedule Effort 11.027891 months
Estimated People Required 4.505036
───────────────────────────────────────────────────────────────────────────────
Processed 685030 bytes, 0.685 megabytes (SI)
───────────────────────────────────────────────────────────────────────────────
  

@lucasavila00 lucasavila00 marked this pull request as ready for review April 17, 2024 03:38
mistralrs-bench/src/main.rs Outdated Show resolved Hide resolved
@EricLBuehler EricLBuehler added the new feature New feature or request label Apr 17, 2024
@lucasavila00
Copy link
Contributor Author

@EricLBuehler I made model an argument of the benchmark, by moving that part to core.

I would like to merge this ASAP.

The PR is too big already.

There's a lot to improve, but I'd prefer to make the improvements in further smaller PRs

@EricLBuehler EricLBuehler merged commit 4143f77 into EricLBuehler:master Apr 18, 2024
11 checks passed
@EricLBuehler
Copy link
Owner

Great, thanks for adding this,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new feature New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Infra: Create mistralrs-bench
2 participants