VoiceOfRa comments on The horrifying importance of domain knowledge - Less Wrong Discussion
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (236)
A lot of programmers believe they can parse HTML with regular expressions.
A lot of programmers believe they can parse HTML at all.
Go read the official W3C parser algorithm, I'll wait. First thing you'll notice is that there is no formal grammar - the spec is of the actual parser state machine. Then you notice each past-and-present HTML version has its own parser algorithm spec, and there is no official documentation on the differences between them, never mind rationale. Then you realize that HTML5 is now a "living spec", so the parser algorithm at that link occasionally changes, and past versions and changelogs are deliberately not published...
HTML is a parseable format like PHP is a programming language. There is no spec, there is only whatever bugs and quirks a particular browser version happens to contain.
(Oh, you thought browsers actually follow any of those published W3C specs? HAHAHAHAHA sob.)
HTML is indeed a turd of a standard.
Hey, can we try not invoking the blasphemous things that dangle from the staked corpses o̭̙̥͚̘͍̠f dead universes?
If I were feeling snarky I'd add, "a lot of people believe they are programmers."
(To adapt a quotation, "Anyone can code. That doesn't mean anyone should.")