Hey everyone, so as I’m sure everyone is aware Lemmy.World has been experiencing several outages throughout the last few days.

We have been investigating the root cause of these outages but believe that they are related to our current hosting provider (Hetzner) blocking access from ClouldFlare as (we think) they believe that our CDN is a DDoS’er, and is causing these disconnects to our backend server, problematic for sure.

We’ve opened support tickets with our current provider and are awaiting a response. We have no issue with being as transparent as possible with downtime. Anyone that is curious, can feel free to check out https://status.lemmy.world and https://dash.lemmy.world for up to the minute outage information. We are also looking into other fediverse friendly methods of posting status and outage updates

In the meantime, we are evaluating alternative hosting options and solutions to provide a high level of reliability to you, our users. Really, we want to say thanks to everyone for soldiering through all our technical growing pains.

Cheers

  • LW Infra Team
  • Flying Squid
    link
    fedilink
    112 years ago

    Thanks for the transparency. I think you lost some trust with the Discord debacle, but this makes me feel a lot better about things.

  • Machefi
    link
    fedilink
    812 years ago

    I don’t blame you for this, but the uptime records are incomplete at best. I’ve experienced the site being down (and confirmed with Down for Everyone or Just Me), yet status.lemmy.world showed all systems operational. As I’m writing this, status.lemmy.world is missing most data up to yesterday and dash.lemmy.world shows 16 days uptime.

    I have lots of respect to you for even having these. I also remember status.lemmy.world work mostly fine some time ago. But as of right now, both uptime monitors fail to serve their purpose.

    • @[email protected]OPM
      link
      fedilink
      English
      81
      edit-2
      2 years ago

      You need to hover over the status bar to see if there is any down time for that day. We can enable it to log incidents every time there is a burp, but we are still tuning alerts as we only have it create a incident when we ACK it in PagerDuty. You can always check the dashboard for up to the minute stats, as well as https://lemmy-status.org/endpoints/_lemmy-world We’ll add this info to make things clearer <3

      EDIT: Added more info to our status page, thanks for the feedback Machefi!

      EDIT2: Also the missing data is due to us removing and adding more specific monitors for the different infra services.

      • Obinice
        link
        fedilink
        262 years ago

        Excuse me stop being so cool, you’re raising the bar too high for everyone else thank you

  • @[email protected]
    link
    fedilink
    802 years ago

    Maybe its just the times I’m accessing but its seem better this week to me compared to the last few ones.

  • @[email protected]
    link
    fedilink
    English
    42
    edit-2
    2 years ago

    On your Cloudflare account, if there was a change in the CNAME/A record being proxied vs. DNS only, that could cause an issue, as Cloudflare would then strip headers off the request that your Apache/Nginx would be looking for.

    If you enabled HTTP DDoS protection in your Security -> WAF tab (I think that’s where it is) that could do this too. Might be worth disabling.

    Also check for any headers your HTTP load balancer might be expecting, that Cloudflare could be stripping.

    Might be worth tailing the webserver logs to see what happens to requests coming in from Cloudflare.

  • @[email protected]
    link
    fedilink
    English
    102 years ago

    Is there anything we can do to help? Donations? Tech volunteers? Visit hosting company with a baseball bat?

  • @[email protected]
    link
    fedilink
    English
    52 years ago

    The dashboard requires a login to view. At least from here it does. Is there a way to view without making an account?

  • @[email protected]
    link
    fedilink
    English
    2702 years ago

    As always, the transparency is appreciated. Some growing pains are certainly to be expected

  • @[email protected]
    link
    fedilink
    20
    edit-2
    2 years ago

    Could look into Dacentec if you need more cheap servers. I use them for my stuff. YMMV since you’re getting a hell of a lot more traffic than I am but they haven’t blocked Cloudflare on mine yet so that’s a plus. :)

  • @[email protected]
    link
    fedilink
    372 years ago

    Everyone talking about the downtime including lemme and me just enjoying lemmy like never before. I’ve experienced no downtime so far