shotoku

tool for sending concurrent requests and observing stream outputs

usage

Usage: main [OPTIONS] <URL> <PAYLOAD_FILE>

Arguments:
  <URL>           URL of the server
  <PAYLOAD_FILE>  Path to the payload file

Options:
  -u, --vus <VUS>                Number of virtual users [default: 1]
  -d, --duration <DURATION>      Duration of the test [default: 30]
  -s, --spawn-rate <SPAWN_RATE>  Spawn rate of virtual users [default: 1]
  -h, --help                     Print help
  -V, --version                  Print version

example: request to llama.cpp server with 5 virtual users

cargo run -r -- http://localhost:8080/completion examples/llamacpp.jsonl -u 5

demo

llama.cpp server configuration 1

./server -m tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf

a1.mov

llama.cpp server configuration 2

./server -m tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf -np 10

a2.mov

llama.cpp server configuration 3

./server -m tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf -np 10 -cb

a3.mov

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
examples		examples
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

shotoku

usage

demo

About

Releases

Packages

Languages

License

twaka/shotoku

Folders and files

Latest commit

History

Repository files navigation

shotoku

usage

demo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages