Google Cloud Announces Rapid Storage for Millisecond-Latency Workloads
Briefly

At Google Cloud Next 2025, Google unveiled Rapid Storage, a zonal bucket designed for latency-sensitive applications, offering under 1ms latencies and 6 TB/s throughput. This innovation co-locates storage with GPUs and TPUs for optimal performance, addressing the latency issues typical of traditional cloud storage. The system supports AI frameworks and enhances efficiency through technologies like gRPC streaming. This development positions Google as a competitive player against services like Amazon S3 Express, marking a significant step in responsive cloud storage solutions.
The Rapid Storage zonal bucket by Google delivers data access speeds under 1ms, optimizing performance for AI workloads through proximity to GPUs and TPUs.
For peak efficiency in AI model training and serving, minimizing latency is critical; Rapid Storage achieves this by colocating storage with processing units.
Innovations like the stateful gRPC-based streaming protocol enhance Rapid Storage's capabilities, allowing seamless integration with major AI frameworks while maintaining high throughput.
Google's entry into low-latency storage solutions positions it distinctively against Amazon S3, as it offers unique advantages tailored for data-sensitive applications.
Read at InfoQ
[
|
]