Skip to content

Commit

Permalink
fix broken llm deployment link (#3214)
Browse files Browse the repository at this point in the history
wasn't rendering correctly

Co-authored-by: Ankith Gunapal <agunapal@ischool.Berkeley.edu>
  • Loading branch information
msaroufim and agunapal committed Jul 2, 2024
1 parent f80e906 commit b4bbc29
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -67,7 +67,7 @@ docker run --rm -ti --gpus all -e HUGGING_FACE_HUB_TOKEN=$token -p 8080:8080 -v
curl -X POST -d '{"prompt":"Hello, my name is", "max_new_tokens": 50}' --header "Content-Type: application/json" "http://localhost:8080/predictions/model"
```

Refer to [LLM deployment][docs/llm_deployment.md] for details and other methods.
Refer to [LLM deployment](docs/llm_deployment.md) for details and other methods.

## ⚡ Why TorchServe
* Write once, run anywhere, on-prem, on-cloud, supports inference on CPUs, GPUs, AWS Inf1/Inf2/Trn1, Google Cloud TPUs, [Nvidia MPS](docs/nvidia_mps.md)
Expand Down

0 comments on commit b4bbc29

Please sign in to comment.