'Don't parse markup languages with Regex' is an annoying trollpost and it should die... right?

ChubakPDP11+TakeWithGrainOfSalt@programming.dev · edit-2 1 year ago

'Don't parse markup languages with Regex' is an annoying trollpost and it should die... right?

some_guy@lemmy.sdf.org · 1 year ago

I’ve been automating tasks with HTML pages for years and years. I built two scripts for harvesting media and progressing through galleries at the start of the pan when we first had all that downtime. I kept refining them over time and they worked very well. If you know enough of a scripting language, you can absolutely use regex to take pages apart and do this. In the end, I was grabbing titles for different files and renaming them when I saved them to disk. It’s just trial and error to get what ultimately works.

I suspect that you’re doing things that are likely a bit more advanced than what I just described. Either way, cheers! I like this theme.