We're teaming up with @cerebras to build the fastest possible inference.
Coming soon to Amazon Bedrock, weโre delivering inference performance an order of magnitude faster than whatโs available today by connecting AWS Trainium3 for compute-intensive prefill with Cerebras CS-3 to power decode.
Learn more about the partnership. https://t.co/O7piA1GLz1