Good morning everyone. Just a quick heads up that I’ve banned a good chunk of IP space in China due to abusive traffic.

I’ve tried to restrict this where possible to datacenter blocks from Huawei, Tencent, and Alibaba, but China Telecom / Mobile were also heavy sources of suspicious traffic. I doubt we have many (if any) users in China, but if you are affected please let me know.

This has been ongoing for a while and I ignored it initially since the traffic levels were low, but it wasn’t anymore.

The ban has very visibly cut our traffic levels:

  • Arghblarg@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    Would a honeypot community help, where anyone visiting a post there gets blocked or tarpitted?

  • Greg Clarke@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    Neat, thanks team! Do you have requests per minute graphs for the same period? It would be interesting to see the scale of these scraps

  • Albbi@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    Has this affected posts at all? The Top 6 hour is suspiciously empty right now for me.

  • Jerkface (any/all)@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    2 months ago

    They could scrape us a lot more quietly and with less impact to the network by just setting up their own Lemmy instance. It’s just rude to hit ours.

  • Troy@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    Thanks for your hard work. Also it’s very interesting in that the nature of the traffic is unknown. Bots scouring content?

    • Yardy Sardley@lemmy.ca
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      My intuition says it’s probably LLM training. AI companies have been increasingly DDoSing the entire web for a while now.

  • Avid Amoeba@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    Haven’t done ops in a while, is there any good automated system that can block IPs on individual basis based on activity patterns? E.g. trying to login with the wrong SSH password too many times, but relevant to our use case?

    • Shadow@lemmy.caOPM
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      Cloudflare tries, but bots do a pretty good job looking like regular users these days. There’s some more advanced “AI” solutions that learn based on existing traffic patterns, but I’ve been out of that space for a while so not sure what the latest tech is.

    • Shadow@lemmy.caOPM
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      Too many IPs, so I did it by ASN at cloudflare.

      • AS4134 Chinanet backbone
      • AS45102 Alibaba cloud
      • AS136907 Huawei cloud
      • AS132203 Tencent
      • AS4812 China telecom
      • AS21859 Zenlayer
      • AS56041 China mobile
      • AS134762 Chinanet
      • AS56048 China mobile
      • AS24444 Shandong
      • AS38019 Tianjin mobile
      • AS134810 China mobile
      • AS56046 China mobile
      • AS56040 China mobile
      • AS24400 Shanghai mobile
      • AS17638 Tianjin provincial net
      • AS132525 Heilngjiang
      • AS24547 Hebei mobile
      • AS4808 Unicom bejing
      • AS17621 Unicom shanghai
      • AS56047 China mobile
      • AS4837 China unicom
      • AS56042 China mobile
      • AS9808 China Mobile

      There’s just no way we have thousands of legitimate users browsing old.lemmy.ca from their phone in China.