Can I escape HTML special chars in JavaScript?

Question

I want to display text to HTML by a JavaScript function. How can I escape HTML special characters in JavaScript? Is there an API?

This is not a duplicate, since this question does not asks about jQuery. I am interested only in this one, since I do not use jQuery... — lvella
– lvella, Commented Aug 7, 2013 at 16:08
possible duplicate of HtmlSpecialChars equivalent in Javascript? — Bergi
– Bergi, Commented Aug 7, 2013 at 16:19
Note that the browsers are working on a new HTML Sanitizer API. — Flimm
– Flimm, Commented Jan 26, 2022 at 20:05
@AdrianWiik It looks like the HTML Sanitizer API was deprecated: developer.chrome.com/blog/sanitizer-api-deprecation — Flimm
– Flimm, Commented Feb 16 at 14:20

ggorlen · Accepted Answer · 2025-04-05 22:27:30Z

557

Here's a solution that will work in practically every web browser:

function escapeHtml(unsafe) {
  return unsafe
    .replace(/&/g, "&amp;")
    .replace(/</g, "&lt;")
    .replace(/>/g, "&gt;")
    .replace(/"/g, "&quot;")
    .replace(/'/g, "&#039;");
}

If you only support modern web browsers (2020+), then you can use the new replaceAll function:

const escapeHtml = unsafe => {
  return unsafe
    .replaceAll("&", "&amp;")
    .replaceAll("<", "&lt;")
    .replaceAll(">", "&gt;")
    .replaceAll('"', "&quot;")
    .replaceAll("'", "&#039;");
};

edited Apr 5 at 22:27

ggorlen

59.2k8 gold badges118 silver badges173 bronze badges

answered Jun 4, 2011 at 5:00

bjornd

23k4 gold badges59 silver badges74 bronze badges

Sign up to request clarification or add additional context in comments.

18 Comments

sereda Over a year ago

Why "'" and not "'" ?

Shreyans Over a year ago

because: stackoverflow.com/questions/2083754/…

jamix Over a year ago

I think regular expressions in replace() calls are unnecessary. Plain old single-character strings would do just as well.

Code Guru Over a year ago

is there any standard API or this is the only way?

Ishrak M Over a year ago

' is valid in HTML5 but not in HTML4

|

vsync · Accepted Answer · 2018-06-13 13:29:29Z

83

function escapeHtml(html){
  var text = document.createTextNode(html);
  var p = document.createElement('p');
  p.appendChild(text);
  return p.innerHTML;
}

// Escape while typing & print result
document.querySelector('input').addEventListener('input', e => {
  console.clear();
  console.log( escapeHtml(e.target.value) );
});

<input style='width:90%; padding:6px;' placeholder='&lt;b&gt;cool&lt;/b&gt;'>

edited Jun 13, 2018 at 13:29

vsync

133k59 gold badges344 silver badges430 bronze badges

answered Aug 20, 2014 at 2:50

spiderlama

1,70317 silver badges10 bronze badges

2 Comments

user8850199 Over a year ago

Working Here but Not working for me offline in browser

jdgregson Over a year ago

Note that this doesn't escape quotes (" or ') so strings from this function can still do damage if they are used in HTML tag attributes.

fgb · Accepted Answer · 2018-01-24 17:33:52Z

59

You can use jQuery's .text() function.

For example:

http://jsfiddle.net/9H6Ch/

From the jQuery documentation regarding the .text() function:

We need to be aware that this method escapes the string provided as necessary so that it will render correctly in HTML. To do so, it calls the DOM method .createTextNode(), does not interpret the string as HTML.

Previous Versions of the jQuery Documentation worded it this way (emphasis added):

We need to be aware that this method escapes the string provided as necessary so that it will render correctly in HTML. To do so, it calls the DOM method .createTextNode(), which replaces special characters with their HTML entity equivalents (such as < for <).

edited Jan 24, 2018 at 17:33

fgb

18.6k3 gold badges41 silver badges52 bronze badges

answered Jun 4, 2011 at 5:01

jeremysawesome

7,2745 gold badges35 silver badges38 bronze badges

2 Comments

amoebe Over a year ago

You can even use it on a fresh element if you just want to convert like this: const str = "foo<>'\"&"; $('<div>').text(str).html() yields foo<>'"&

Ben Philipp Over a year ago

Note that this leaves quotes ' and " unescaped, which may trip you up

Peter Mortensen · Accepted Answer · 2021-06-15 21:44:17Z

58

Using Lodash:

_.escape('fred, barney, & pebbles');
// => 'fred, barney, &amp; pebbles'

Source code

edited Jun 15, 2021 at 21:44

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Oct 30, 2016 at 19:41

cs01

5,8612 gold badges33 silver badges29 bronze badges

3 Comments

Code Guru Over a year ago

what is the opposite of this? name of the function that does the opposite of this?

juanmirocks Over a year ago

Same functions in underscore: underscorejs.org/#escape & underscorejs.org/#unescape

Kreidol Over a year ago

Doesn't seem to work for IP addresses when you try _.escape(192.168.1.1), but if I add quotes, then it works: _.escape('52.60.62.147') even though I'm referencing a variable where the value is not a string. LoDash is so great!

user2226755 · Accepted Answer · 2024-09-17 13:24:23Z

52

This is, by far, the fastest way I have seen it done. Plus, it does it all without adding, removing, or changing elements on the page.

function escapeHTML(unsafeText) {
    let div = document.createElement('div');
    div.innerText = unsafeText;
    return div.innerHTML;

⚠️ Warning: it does not escape quotes so you can't use the output inside attribute values in HTML code. E.g. var divCode = '<div data-title="' + escapeHTML('Jerry "Bull" Winston') + '">Div content</div>' will yield invalid HTML!

edited Sep 17, 2024 at 13:24

user2226755

13.3k6 gold badges55 silver badges77 bronze badges

answered Jan 2, 2018 at 0:11

arjunpat

7435 silver badges10 bronze badges

5 Comments

izogfif Over a year ago

Warning: it does not escape quotes so you can't use the output inside attribute values in HTML code. E.g. var divCode = '<div data-title="' + escapeHTML('Jerry "Bull" Winston') + '">Div content</div>' will yield invalid HTML!

Klesun Over a year ago

Using div.textContent instead of div.innerText would probably be more idiomatic.

Magnus Over a year ago

Just wondering, would repeatedly calling this eventually leave document full of extra div elements? Or does it get garbage collected?

Michael T Over a year ago

@Magnus The div isn't attached to the DOM, so it will eventually be garbage collected. So no, this will not fill the document with useless elements.

Torben Feb 13 at 10:03

Using div.innerText = unsafeText causes the function to return all line-breaks (\n) as <br>. This unwanted side-effect can be avoided by using div.textContent = unsafeText instead. Would be great, if the code in the answer could be updated to reflect this.

Brad · Accepted Answer · 2025-04-05 22:41:31Z

51

I think I found the proper way to do it...

// Create a DOM Text node:
var text_node = document.createTextNode(unescaped_text);

// Get the HTML element where you want to insert the text into:
var elem = document.getElementById('msg_span');

// Optional: clear its old contents
//elem.innerHTML = '';

// Append the text node into it:
elem.appendChild(text_node);

edited Apr 5 at 22:41

Brad

164k57 gold badges380 silver badges558 bronze badges

answered Aug 7, 2013 at 16:16

lvella

13.6k13 gold badges61 silver badges121 bronze badges

4 Comments

Sellorio Over a year ago

I learnt something new about HTML today. w3schools.com/jsref/met_document_createtextnode.asp.

maechler Over a year ago

Be aware that the content of the text node is not escaped if you try to access it like this: document.createTextNode("<script>alert('Attack!')</script>").textContent

jgmjgm Over a year ago

This is the correct way if all you're doing is setting text. That's also textContent but apparently it's not well supported. This won't work however if you're building up a string with some parts text some html, then you need to still escape.

TRiG Over a year ago

I really like this, because it's using the DOM properly. It feels less "hacky" than most of the other options.

ADJenks · Accepted Answer · 2024-10-17 17:25:44Z

By the books

When editing HTML content between `<tags>`, use "HTML Entity Encoding":

For for editing the HTML content between an opening and closing tag using JavaScript, OWASP recommends you to "look at the .textContent attribute. It is a Safe Sink and will automatically HTML Entity Encode."

When editing HTML attributes use recommended "HTML Attribute Encoding":

Previously I offered the function below to do the encoding yourself, but there are "Safe Sinks" for HTML Attribute Encoding as well. For example, the second argument of the setAttribute function is allowed to be "dangerous" because it will be encoded automatically. Please refer to this page for more safe places to put potentially dangerous content: https://cheatsheetseries.owasp.org/cheatsheets/Cross_Site_Scripting_Prevention_Cheat_Sheet.html#safe-sinks

Manual method for HTML Attributes:

It's better to not do it yourself, but if you so desire, OWASP recommends that "[e]xcept for alphanumeric characters, [you should] escape all characters with ASCII values less than 256 with the &#xHH; format (or a named entity if available) to prevent switching out of [an] attribute."

So here's a function that does that, with a usage example:

function escapeHTML(unsafe) {
  return unsafe.replace(
    /[\u0000-\u002F\u003A-\u0040\u005B-\u0060\u007B-\u00FF]/g,
    c => '&#' + ('000' + c.charCodeAt(0)).slice(-4) + ';'
  )
}

document.querySelector('div').innerHTML =
  '<span class=' +
  escapeHTML('"fakeclass" onclick="alert("test")') +
  '>' +
  escapeHTML('<script>alert("inspect the attributes")\u003C/script>') +
  '</span>'

<div></div>

You should verify the entity ranges I have provided to validate the safety of the function yourself. You could also use this regular expression which has better readability and should cover the same character codes, but is about 10% less performant in my browser:

/(?![0-9A-Za-z])[\u0000-\u00FF]/g

76484 · Accepted Answer · 2018-08-03 04:00:48Z

22

It was interesting to find a better solution:

var escapeHTML = function(unsafe) {
  return unsafe.replace(/[&<"']/g, function(m) {
    switch (m) {
      case '&':
        return '&amp;';
      case '<':
        return '&lt;';
      case '"':
        return '&quot;';
      default:
        return '&#039;';
    }
  });
};

I do not parse > because it does not break XML/HTML code in the result.

Here are the benchmarks: http://jsperf.com/regexpairs Also, I created a universal escape function: http://jsperf.com/regexpairs2

edited Aug 3, 2018 at 4:00

76484

9,0013 gold badges23 silver badges33 bronze badges

answered Feb 11, 2015 at 15:41

iegik

1,4932 gold badges18 silver badges30 bronze badges

7 Comments

Peter T. Over a year ago

It's interesting to see that using the switch is significantly faster than the map. I didn't expect this! Thanks for sharing!

vsync Over a year ago

There are many many more unicode characters than you could possible code & take into account. I wouldn't recommend this manual method at all.

Neonit Over a year ago

Why would you escape multi-byte characters at all? Just use UTF-8 everywhere.

jgmjgm Over a year ago

Skipping > can potentially break code. You must keep in mind that inside the <> is also html. In that case skipping > will break. If you're only escaping for between tags then you probably only need escape < and &.

Finesse Over a year ago

It can be simplified to unsafe.replace(/[&<>"']/g, c => `&#${c.charCodeAt(0)}`)

|

user · Accepted Answer · 2017-11-29 03:12:51Z

18

The most concise and performant way to display unencoded text is to use textContent property.

Faster than using innerHTML. And that's without taking into account escaping overhead.

document.body.textContent = 'a <b> c </b>';

edited Nov 29, 2017 at 3:12

answered Nov 29, 2017 at 2:57

user

26.2k13 gold badges118 silver badges104 bronze badges

Comments

teknopaul · Accepted Answer · 2017-08-21 10:27:56Z

8

DOM Elements support converting text to HTML by assigning to innerText. innerText is not a function but assigning to it works as if the text were escaped.

document.querySelectorAll('#id')[0].innerText = 'unsafe " String >><>';

answered Aug 21, 2017 at 10:27

teknopaul

6,8152 gold badges33 silver badges26 bronze badges

2 Comments

ZzZombo Over a year ago

At least in Chrome assigning multiline text adds <br> elements in place of newlines, that can break certain elements, like styles or scripts. The createTextNode is not prone to this problem.

Roy Tinker Over a year ago

innerText has some legacy/spec issues. Better to use textContent.

Dave Brown · Accepted Answer · 2015-07-26 13:54:10Z

You can encode every character in your string:

function encode(e){return e.replace(/[^]/g,function(e){return"&#"+e.charCodeAt(0)+";"})}

Or just target the main characters to worry about (&, inebreaks, <, >, " and ') like:

function encode(r){
return r.replace(/[\x26\x0A\<>'"]/g,function(r){return"&#"+r.charCodeAt(0)+";"})
}

test.value=encode('How to encode\nonly html tags &<>\'" nice & fast!');

/*************
* \x26 is &ampersand (it has to be first),
* \x0A is newline,
*************/

<textarea id=test rows="9" cols="55">&#119;&#119;&#119;&#46;&#87;&#72;&#65;&#75;&#46;&#99;&#111;&#109;</textarea>

Writing your own escape function is generally a bad idea. Other answers are better in this regard.

Finesse · Accepted Answer · 2024-05-14 09:46:36Z

4

A universal one-liner, working in browsers and Node.js:

const html = unsafe.replace(/[&<>"']/g, c => `&#${c.charCodeAt(0)};`)

edited May 14, 2024 at 9:46

answered May 3, 2024 at 1:21

Finesse

11k8 gold badges72 silver badges98 bronze badges

1 Comment

Wilco Over a year ago

Small remark to the code above, the html entity should be closed with a semicolon, so it should be &#${c.charCodeAt(0)};

Peter Mortensen · Accepted Answer · 2021-06-15 21:46:56Z

3

If you already use modules in your application, you can use escape-html module.

import escapeHtml from 'escape-html';
const unsafeString = '<script>alert("XSS");</script>';
const safeString = escapeHtml(unsafeString);

edited Jun 15, 2021 at 21:46

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Mar 11, 2020 at 15:13

Shimon S

4,2252 gold badges33 silver badges35 bronze badges

Comments

M. Justin · Accepted Answer · 2024-03-13 19:24:22Z

1

For a quick one-liner, the following works:

const escaped = new Option(unescaped).innerHTML;

For example:

const unescaped = "<h1>Header</h1>";
const escaped = new Option(unescaped).innerHTML; // "&lt;h1&gt;Header&lt;/h1&gt;"

answered Mar 13, 2024 at 19:24

M. Justin

23k12 gold badges133 silver badges167 bronze badges

2 Comments

Flimm Feb 17 at 8:44

This doesn't escape double quotes and single quotes, which makes it unsafe for use within HTML attributes.

M. Justin Feb 18 at 16:21

@Flimm While good to be aware of, nothing in the way the question is worded makes me think the question asker was intending to use this unescaped HTML within an attribute. If someone does intend to use it in that manner, you're right that further encoding is needed, since attributes have different restrictions.

Peter Mortensen · Accepted Answer · 2021-06-15 21:46:06Z

0

I came across this issue when building a DOM structure. This question helped me solve it. I wanted to use a double chevron as a path separator, but appending a new text node directly resulted in the escaped character code showing, rather than the character itself:

var _div = document.createElement('div');
var _separator = document.createTextNode('&raquo;');
//_div.appendChild(_separator); /* This resulted in '&raquo;' being displayed */
_div.innerHTML = _separator.textContent; /* This was key */

edited Jun 15, 2021 at 21:46

Peter Mortensen

31.4k22 gold badges110 silver badges134 bronze badges

answered Jul 30, 2019 at 8:36

Silas

91 bronze badge

Comments

Soumen Khara · Accepted Answer · 2021-04-14 08:41:38Z

Just write the code in between <pre><code class="html-escape">....</code></pre>. Make sure you add the class name in the code tag. It will escape all the HTML snippet written in
<pre><code class="html-escape">....</code></pre>.

const escape = {
    '"': '&quot;',
    '&': '&amp;',
    '<': '&lt;',
    '>': '&gt;',
}
const codeWrappers = document.querySelectorAll('.html-escape')
if (codeWrappers.length > 0) {
    codeWrappers.forEach(code => {
        const htmlCode = code.innerHTML
        const escapeString = htmlCode.replace(/"|&|<|>/g, function (matched) {
            return escape[matched];
        });
        code.innerHTML = escapeString
    })
}

<pre>
    <code class="language-html html-escape">
        <div class="card">
            <div class="card-header-img" style="background-image: url('/assets/card-sample.png');"></div>
            <div class="card-body">
                <p class="card-title">Card Title</p>
                <p class="card-subtitle">Srcondary text</p>
                <p class="card-text">Greyhound divisively hello coldly wonderfully marginally far upon
                    excluding.</p>
                <button class="btn">Go to </button>
                <button class="btn btn-outline">Go to </button>
            </div>
        </div>
    </code>
</pre>

user2226755 · Accepted Answer · 2024-09-17 13:31:41Z

I think you should change the way to do it, don't try to escape HTML to use innerHTML after, it is wrong. You should create an element with createElement and use innerText to add an insecure input, and then use appendChild, prependChild, inserAfter or inserBefore.

Solution for Vanilla JavaScript in a DOM environment

Instead of:

// vulnerable
const html = "<b>Hello World!</b>"
const element = `<div>${html}</div>`

document.body.innerHTML = element

You should do:

// secure
const html = '<b>Hello World!</b>'
const element = document.createElement('div')
element.innerText = html 

document.body.appendChild(element)

⚠️ Warning Never do document.body.innerHTML = element.innerText

Collectives™ on Stack Overflow

Can I escape HTML special chars in JavaScript?

17 Answers 17

18 Comments

2 Comments

2 Comments

3 Comments

5 Comments

4 Comments

By the books

When editing HTML content between `<tags>`, use "HTML Entity Encoding":

When editing HTML attributes use recommended "HTML Attribute Encoding":

Manual method for HTML Attributes:

Comments

7 Comments

Comments

2 Comments

1 Comment

1 Comment

Comments

2 Comments

Comments

Comments

Solution for Vanilla JavaScript in a DOM environment

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

17 Answers 17

18 Comments

2 Comments

2 Comments

3 Comments

5 Comments

4 Comments

By the books

When editing HTML content between <tags>, use "HTML Entity Encoding":

When editing HTML attributes use recommended "HTML Attribute Encoding":

Manual method for HTML Attributes:

Comments

7 Comments

Comments

2 Comments

1 Comment

1 Comment

Comments

2 Comments

Comments

Comments

Solution for Vanilla JavaScript in a DOM environment

Comments

Linked

Related

When editing HTML content between `<tags>`, use "HTML Entity Encoding":