Sagemaker inference endpoint deployment
#8
by
tnhlaing
- opened
Has anyone successfully been able to deploy Wan-AI/Wan2.2-I2V-A14B-Diffusers to SageMaker for inference endpoint? Running into issues with model loading when using this base image: 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.6.0-transformers4.51.3-gpu-py312-cu124-ubuntu22.04