JavaScript: Add new query `InvalidEntityTranscoding`. #556

xiemaisi · 2018-11-28T10:29:43Z

A simple, lightweight query that spots a common mistake people make when writing HTML entity encoders/decoders: when encoding, & has to be encoded first to avoid double-encoding ampersands introduced by the encoding of other characters; conversely, when decoding it has to be decoded last to avoid the decoded ampersand being interpreted as part of an entity reference later on.

Finds good results on LGTM.com, here is the full report (internal link). One of the results is in a moderately popular (but not very actively maintained) utility-belt library.

@esben-semmle suggested looking for a similar problem with URL encoding and %, but a quick exploratory query seems to indicate that people don't often implement URL transcoding by hand.

xiemaisi · 2018-11-28T10:29:57Z

No particular rush to review this, it's not going into 1.19.

asger-semmle

LGTM overall, just a clarification and maybe a comment.

asger-semmle · 2018-11-28T12:52:52Z

javascript/ql/src/Security/CWE-116/InvalidEntityTranscoding.ql

+ * A call to `String.prototype.replace` that replaces all instances of a pattern.
+ */
+class Replacement extends DataFlow::Node {
+  RegExpLiteral pattern;


I take it string patterns here will always be reported by IncompleteSanitization, and that's why we don't include them here? Maybe worth leaving a comment about this.

Correct; I'll add a note.

xiemaisi · 2018-11-28T13:21:22Z

Ping @mc-semmle for doc review (but ~~#555~~ #559 is more urgent).

esbena · 2018-11-28T22:10:09Z

LGTM too, but I see two features that could go in a later PR.

How about escapes with backslashes and the escaping of the backslash it self?

getPreviousReplacement seems relevant for:

https://github.com/Semmle/ql/blob/d31c9950f9a9dec68d06c2d7af78761520879871/javascript/ql/src/Security/CWE-116/IncompleteSanitization.ql#L88-L94

xiemaisi · 2018-11-29T08:18:13Z

How about escapes with backslashes and the escaping of the backslash it self?

That's a very interesting idea; let me play with that a little bit.

mchammer01

@xiemaisi - this LGTM. I only made a very minor suggestion, which you can reject if you think it's OTT. Just let me know and I'll approve instead of requesting changes. Thanks!

mchammer01 · 2018-11-29T10:54:22Z

javascript/ql/src/Security/CWE-116/InvalidEntityTranscoding.qhelp

+</p>
+
+<p>
+Instead, the decoding function should decode ampersand last:


Tiny suggestion:
Instead, the decoding function should decode THE ampersand last:

xiemaisi · 2018-11-29T11:59:58Z

I have significantly rewritten the query based on @esben-semmle's suggestion, generalising it beyond HTML transcoding to a few other kinds of (un-)escaping.

I like the new results a lot, for instance this one from an old version of underscore. Full comparison is running.

ghost

The generalization LGTM, thank you for supporting the suggestion.

javascript/ql/src/Security/CWE-116/DoubleEscaping.ql

xiemaisi · 2018-11-30T08:57:21Z

Results from full run look good as well: internal link.

xiemaisi · 2018-11-30T09:41:23Z

Rebased to resolve conflict on change notes.

Perhaps @mc-semmle wants to take another look at the query help, which has been rewritten to reflect the expanded scope of the query (but no rush; it's not relevant for 1.19).

mchammer01 · 2018-11-30T16:46:54Z

Documentation LGTM, I've approved the changes :)

xiemaisi · 2018-11-30T16:50:25Z

Thanks, @mc-semmle! I've resolved the conflict on the change notes, so this should be good to go in once it's green again.

Update CODEOWNERS

xiemaisi added the JS label Nov 28, 2018

xiemaisi requested a review from a team as a code owner November 28, 2018 10:29

asger-semmle previously approved these changes Nov 28, 2018

View reviewed changes

xiemaisi dismissed asger-semmle’s stale review via 6ddc333 November 28, 2018 12:59

asger-semmle previously approved these changes Nov 28, 2018

View reviewed changes

xiemaisi requested a review from mchammer01 November 28, 2018 13:20

mchammer01 requested changes Nov 29, 2018

View reviewed changes

xiemaisi dismissed asger-semmle’s stale review via 713a27d November 29, 2018 11:52

xiemaisi force-pushed the js/invalid-entity-transcoding branch 2 times, most recently from 713a27d to 9fab6a9 Compare November 29, 2018 11:56

ghost reviewed Nov 30, 2018

View reviewed changes

javascript/ql/src/Security/CWE-116/DoubleEscaping.ql Show resolved Hide resolved

JavaScript: Add new query DoubleEscaping.

10166be

xiemaisi force-pushed the js/invalid-entity-transcoding branch from cf0020f to 10166be Compare November 30, 2018 09:39

mchammer01 previously approved these changes Nov 30, 2018

View reviewed changes

Merge branch 'master' into js/invalid-entity-transcoding

52b8a6b

xiemaisi dismissed mchammer01’s stale review via 52b8a6b November 30, 2018 16:49

ghost approved these changes Dec 3, 2018

View reviewed changes

ghost merged commit 2cc235d into github:master Dec 3, 2018

xiemaisi deleted the js/invalid-entity-transcoding branch December 3, 2018 09:40

cklin pushed a commit that referenced this pull request May 23, 2022

Merge pull request #556 from github/shati-patel-patch-1

65e9262

Update CODEOWNERS

aliscco mentioned this pull request Aug 30, 2022

[Snyk] Security upgrade sanitize-html from 1.27.5 to 2.7.1 aliscco/codeql#28

Open

aliscco mentioned this pull request Sep 24, 2022

[Snyk] Fix for 8 vulnerabilities aliscco/codeql#34

Open

verdinjoshua1982 mentioned this pull request Oct 14, 2022

[Snyk] Upgrade sanitize-html from 1.27.5 to 2.7.2 verdinjoshua1982/codeql#3

Open

This pull request was closed.

JavaScript: Add new query InvalidEntityTranscoding. #556

JavaScript: Add new query InvalidEntityTranscoding. #556

Uh oh!

Conversation

xiemaisi commented Nov 28, 2018

Uh oh!

xiemaisi commented Nov 28, 2018

Uh oh!

asger-semmle left a comment

Choose a reason for hiding this comment

Uh oh!

asger-semmle Nov 28, 2018

Choose a reason for hiding this comment

Uh oh!

xiemaisi Nov 28, 2018

Choose a reason for hiding this comment

Uh oh!

xiemaisi commented Nov 28, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

esbena commented Nov 28, 2018

Uh oh!

xiemaisi commented Nov 29, 2018

Uh oh!

mchammer01 left a comment

Choose a reason for hiding this comment

Uh oh!

mchammer01 Nov 29, 2018

Choose a reason for hiding this comment

Uh oh!

xiemaisi commented Nov 29, 2018

Uh oh!

ghost left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xiemaisi commented Nov 30, 2018

Uh oh!

xiemaisi commented Nov 30, 2018

Uh oh!

mchammer01 commented Nov 30, 2018

Uh oh!

xiemaisi commented Nov 30, 2018

Uh oh!

Uh oh!

JavaScript: Add new query `InvalidEntityTranscoding`. #556

JavaScript: Add new query `InvalidEntityTranscoding`. #556

xiemaisi commented Nov 28, 2018 •

edited

Loading