You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

DanArmak comments on The horrifying importance of domain knowledge - Less Wrong Discussion

15 Post author: NancyLebovitz 30 July 2015 03:28PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (236)

You are viewing a single comment's thread. Show more comments above.

Comment author: PhilGoetz 31 July 2015 06:30:18PM 3 points [-]

Weirdly, there is no list of falsehoods programmers believe about html (or at least a fast search didn't turn anything up).

A lot of programmers believe they can parse HTML with regular expressions.

Comment author: DanArmak 01 August 2015 10:38:55AM *  5 points [-]

A lot of programmers believe they can parse HTML at all.

Go read the official W3C parser algorithm, I'll wait. First thing you'll notice is that there is no formal grammar - the spec is of the actual parser state machine. Then you notice each past-and-present HTML version has its own parser algorithm spec, and there is no official documentation on the differences between them, never mind rationale. Then you realize that HTML5 is now a "living spec", so the parser algorithm at that link occasionally changes, and past versions and changelogs are deliberately not published...

HTML is a parseable format like PHP is a programming language. There is no spec, there is only whatever bugs and quirks a particular browser version happens to contain.

(Oh, you thought browsers actually follow any of those published W3C specs? HAHAHAHAHA sob.)

Comment author: eternal_neophyte 02 August 2015 01:32:38AM 0 points [-]

HTML is indeed a turd of a standard.