• 1 Post
  • 17 Comments
Joined 2 years ago
cake
Cake day: July 3rd, 2023

help-circle



  • Yeah, it’s not technically impossible to stop web scrapers, but it’s difficult to have a lasting, effective solution. One easy way is to block their user-agent assuming the scraper uses an identifiable user-agent, but that can be easily circumvented. The also easy and somewhat more effective way is to block scrapers’ and caching services’ IP addresses, but that turns into a game of whack-a-mole. You could also have a paywall or login to view content and not approve a certain org, but that only will work for certain use cases, and that also is easy to circumvent. If stopping a single org’s scraping is the hill to die on, good luck.

    That said, I’m all for fighting ICE, even if it’s futile. Just slowing them down and frustrating them is useful.







  • This is why I believe scientists should be required to take liberal arts classes; especially related to written and spoken language.

    And yes, I also think liberal arts students should be required to take some level of hard STEM classes (not watered-down “libarts-compatible” stuff, but actual physics, chemistry, biology, etc) as well.

    Yes to both points! I’m eternally grateful to my high school AP English teachers for teaching me how to write and communicate.

    My somewhat unpopular opinion is that we’d be better off as a society if everyone in college took “real” STEM and liberal arts classes. The STEM folks can understand the why and societal implications of what they study (as well as just communication), and the liberal arts types can learn a bit about how the world actually works in a concrete way.

    Unfortunately, I’ve been continually struck by how incurious people are. I get that everyone has their interests, but that shouldn’t be to the exclusion of all other study. So, I don’t think this will happen. :/





  • The original paper itself, for those who are interested.

    Overall, this is really interesting research and a really good “first step.” I will be interested to see if this can be replicated on other models. One thing that really stood out, though, was that certain details are obfuscated because of Sonnet being proprietary. Hopefully follow-on work is done on one of the open source models to confirm the method.

    One of the notable limitations is quantifying activation’s correlation to text meaning, which will make any sort of controls difficult. Sure, you can just massively increase or decrease a weight, and for some things that will be fine, but for real manual fine tuning, that will prove to be a difficulty.

    I suspect this method is likely generalizable (maybe with some tweaks?), and I’d really be interested to see how this type of analysis could be done on other neural networks.



  • I’m all about this. When I made my personal webpage, this is how I do it. I’m surprised it’s not more popular (at least for certain things) because it looks nice and clean, is fast, and crucially, is easy to put together. Most webpages don’t need a ton of JS to “accomplish the mission.” I get that not everything can do this, but there are soooooo many sites that can strip down to a more minimal site and have better functionality and a better experience. This is a case of less-is-more.


  • This is a much better article. OP’s article just shows the author’s surface understanding of how coding works and how well an LLM can actually code. There’s way more that goes into a programming task than just coding.

    I see LLMs as having the potential of being almost like a super library. I can prompt GPT, Claude, etc. to write me a custom function that I copy, paste, test, scrutinize, and almost certainly change. It’s a tool that will make someone a more productive programmer. It won’t completely subsume a human’s ability to be creative and put the pieces together.

    At the absolute worst over the next decade, I could see programming changing from writing and debugging code to prompting, stitching together, and debugging.