in reply to The Matrix.org Foundation

So: the matrix.org database secondary lost its FS due to a RAID failure earlier today (11:17 UTC). Then, we lost the primary at 17:26. We're trying to restore the primary DB FS (which could be fastish), while also doing a point-in-time backup restore from last night (which takes >10h). We believe the incremental DB traffic since last night is intact however. Apologies for the downtime; folks on their own homeserver are of course not impacted.
in reply to The Matrix.org Foundation

Sorry, but it's bad news: we haven't been able to restore the DB primary filesystem to a state we're confident in running as a primary (especially given our experiences with slow-burning postgres db corruption). So we're having to do a full 55TB DB snapshot restore from last night, which will take >10h to recover the data, and then >4h to actually restore, and then >3h to catch up on missing traffic. Huge apologies for the outage. Again, folks using their own homeservers are not impacted.

reshared this

in reply to The Matrix.org Foundation

better run it ur own, i have very good experience with #conduit server conduit.rs
This entry was edited (20 hours ago)
in reply to The Matrix.org Foundation

weirdly this feels like actually a positive example reinforcing the idea of a decentral fediverse, as other instances are unaffected. Also we had been discussing running an own instance at the @chaotikumev just before the outage.
I just wish there were such an easy, neat account migration feature like @Mastodon has. (And I guess I can't just ex- and import chats + keys and use SRV records to have a seamless migration?)
in reply to The Matrix.org Foundation

Right, matrix.org is back online as of 17:00 UTC. The server is struggling a bit as it catches up. Huge apologies again for the outage; postmortem + ways to avoid a repeat will be forthcoming. See also theregister.com/2025/09/03/mat… & heise.de/en/news/Matrix-main-s…. Thanks all for your patience.

Bubu reshared this.

in reply to The Matrix.org Foundation

I don't think that this true, folks from other servers are not effected.
As result of the 18years policy I moved my community to other servers and now partially having issues with not shared security keys and lots of unencrytable messages.

When you restore and old save state every key generated and shared in the meantime is gone.

Right?