As mentioned in a few recent incidents, we've made some significant changes to our system architecture and deploy process. Unfortunately today a hiccup in a deploy resulted in about 24 minutes of downtime to our app. Our ops team is working hard to add more fault tolerance to every level of our deploy pipeline to prevent corner cases that could cause issues like we've seen today. Early next week we'll be providing a public postmortem to provide insight into these issues and the steps we're taking to prevent them.
Feb 24, 11:31 PST
A fix has been implemented and we are monitoring the results.
Feb 24, 08:44 PST
The app is back up and running and we are closely monitoring our systems. We'll continue our investigation and post more details here shortly.
Feb 24, 08:20 PST
We are working to restore connectivity to our app.
Feb 24, 08:02 PST
Users are currently seeing failed requests on UserVoice. We are investigating.
Feb 24, 08:00 PST