1 min read


In-function concurrency is now part of Fluid compute
Fluid compute combines the efficiency of servers and the fleixbility of serverless, enabling real-time, dynamic workloads like APIs, streaming, and AI.
Learn about Fluid
In-function concurrency is now in public beta, and allows a single function instance to handle multiple invocations concurrently, improving resource utilization by taking advantage of idle time in existing function instances.
Traditionally, serverless architecture maps one function instance to a single invocation. With in-function concurrency, overlapping invocations can increase efficiency by 20%-50%, reducing gigabyte-hours and lowering costs.
As part of the beta, we’re limiting the number of concurrent invocations per instance, and will be gradually increasing the limit based on feedback. Note, this capability may increase latency for purely CPU-bound workloads.
In-function concurrency public beta is available for all Pro and Enterprise customers using Standard or Performance Function CPU, you can enable it through your dashboard and track resource savings in real time.
Read our blog post and documentation for more information.





