• 1 Post
  • 41 Comments
Joined 3 months ago
cake
Cake day: November 5th, 2024

help-circle



  • Critics debating Nepenthes’ utility on Hacker News suggested that most AI crawlers could easily avoid tarpits like Nepenthes, with one commenter describing the attack as being “very crawler 101.” Aaron said that was his “favorite comment” because if tarpits are considered elementary attacks, he has “2 million lines of access log that show that Google didn’t graduate.”

    You assume incorrectly that bots, scrapers and drive-by malware attacks are made by competent people. I have years worth of stories I’m not going to post on the open internet that says otherwise. I also have months worth of access logs that say otherwise. AhrefsBot in particular is completely unable to deal with anything you throw at it. It spent weeks in a tarpit I made very similar to the one in the article, looping links, until I finally put it out of its misery.






  • The misunderstanding seems to be between software and hardware. It is good to reboot Windows and some other operating systems because they accumulate errors and quirks. It is not good to powercycle your hardware, though. It increases wear.

    I’m not on an OS that needs to be rebooted, I count my uptime in months.

    I don’t want you to pick up a new anxiety about rebooting your PC, though. Components are built to last, generally speaking. Even if you powercycled your PC 5 times daily you’d most likely upgrade your hardware long before it wears out.



  • To me, the appeal is that my workflow depends less on my computer and more on my ability to connect to a server that handles everything for me. Workstation, laptop or phone? Doesn’t matter, just connect to the right IPs and get working. Linux is, of course, the holy grail of interoperability, and I’m all Linux. With a little bit of set up, I can make a lot of things talk to each other seamlessly. SMB on Windows is a nightmare but on Linux if I set up SSH keys then I can just open a file manager and type sftp://<hostname> and now I’m browsing that machine as if it was a local folder. I can do a lot of work from my genuinely-trash laptop because it’s the server that’s doing the heavy lifting

    TL;DR -

    My workflow becomes “client agnostic” and I value that a lot