Optimizing Wellhub Autocomplete Service Latency: A Multi-Region Architecture
Briefly

Wellhub leveraged a multi-region architecture for its autocomplete service, utilizing Elasticsearch for user input predictions. This approach significantly improved search relevance with geo queries based on user locations.
We implemented AWS Global Accelerator to route traffic efficiently, which involved using static IPs and TCP optimizations to ensure that users experienced low-latency connections to the nearest service instance.
Data replication was carried out via AWS S3 Cross-Region Replication, which aligned with our non-real-time update requirements and enabled backups to be restored across different regions.
To enhance perceived performance, we introduced a pre-fetch endpoint after deploying our multi-region architecture, which further reduced latency and improved user experience for services across regions.
Read at InfoQ
[
|
]