Skip to content

Weight Prefetch

Weight prefetch streams model weights to Parel storage before a deployment exists. When you create the deployment later, cached weights can shorten cold start time.

Terminal window
curl https://api.parel.cloud/v1/deployments/prefetch \
-H "Authorization: Bearer pk-dev-YOUR_KEY" \
-H "Content-Type: application/json" \
-d '{"huggingface_id": "Qwen/Qwen2.5-7B-Instruct"}'
Terminal window
curl https://api.parel.cloud/v1/deployments/prefetch/Qwen%2FQwen2.5-7B-Instruct \
-H "Authorization: Bearer pk-dev-YOUR_KEY"
Terminal window
curl -X POST https://api.parel.cloud/v1/deployments/prefetch/Qwen%2FQwen2.5-7B-Instruct/cancel \
-H "Authorization: Bearer pk-dev-YOUR_KEY"
ConstraintValue
Per-model size limit50 GB
Unused prefetch TTL10 minutes
Storage lifecycle30 days