Optimizely Search & Navigation (formerly Find) is a Cloud based Enterprise Search solution that delivers both enhanced relevance and powerful search functionality to websites. On Friday November 25, 2022, we experienced an event which impacted service functionality in the AP Digital Experience Cloud region. The following report describes additional details around the event.
Between November 25, 2022, 08:14 and 08:45 UTC the Search & Navigation cluster APAC01 experienced an intermittent service degradation.
The issue was triggered by a sudden increase of bulking requests from a tenant of shared resource, causing constant CPU spike on multiple data nodes. The affected nodes were gracefully restarted to clear the memory hooks and service was fully operational by 08:45 UTC.
November 25, 2022
08:14 UTC – First alert triggered and troubleshooting initiated.
08:19 UTC – Unhealthy nodes were identified and restarted by automation.
08:26 UTC – STATUSPAGE updated.
08:32 UTC – Service started recovering and was able to handle the requests.
08:45 UTC – Service was fully operational.
The cause of this incident was due to a large number of bulking requests from a tenant of shared resources which resulted in a sudden CPU spike on multiple data nodes.
During this series of events, a subset of requests to the Search & Navigation cluster may have experienced network timeouts (5xx-errors), or slower than normal response times while attempting to connect.
Short-term mitigation
Long-term mitigation
We are sorry! We recognize the negative impact to affected customers and regret the disruption that you sustained. We have a strong commitment to our customers, and to deliver high availability services including Search & Navigation. We will continue to prioritize our efforts to overcome these recent difficulties, and diligently apply lessons-learned to avoid future events of similar nature to ensure that we continue to develop Raving Fans!