How to remove HTML tags from a string using JavaScript ?
Last Updated :
10 Jan, 2025
Removing HTML tags from a string in JavaScript means stripping out the markup elements, leaving only the plain text content. This is typically done to sanitize user input or to extract readable text from HTML code, ensuring no unwanted tags remain in the string.
HTML tags come in two forms: opening tags and closing tags. Understanding this distinction is crucial when parsing and manipulating HTML content.
- Opening tag: It starts with a '<', followed by an HTML keyword, and ends with a '>'. <html>, <br>, <title> are some examples of HTML opening tags.
- Closing tag: It starts with a '</', followed by an HTML keyword, and ends with a '>'.</html>, </title> are examples of HTML closing tags.
Approach 1: Using replace() function
The replace() function, combined with regular expressions, can identify and remove HTML tags from a string. This method uses patterns to find tags, making it effective for quick, simple tag removal.
Example: In this example, the removeTags function strips HTML tags from a string using a regular expression. It returns the cleaned string, leaving only the text content without any HTML elements.
javascript
function removeTags(str) {
if ((str === null) || (str === ''))
return false;
else
str = str.toString();
// Regular expression to identify HTML tags in
// the input string. Replacing the identified
// HTML tag with a null string.
return str.replace(/(<([^>]+)>)/ig, '');
}
console.log(removeTags(
'<html>Welcome to GeeksforGeeks.</html>'));;
Output:
Welcome to GeeksforGeeks.
Approach 2 : Using .textContent property or .innerText property
Using the .textContent or .innerText properties involves creating a temporary DOM element, setting its .innerHTML to the HTML string, and then accessing .textContent or .innerText to extract plain text, effectively removing all HTML tags from the string.
Example: In this example we extracts and logs text content from an HTML string by setting the innerHTML of a created div element, then retrieving the text using textContent or innerText.
javascript
// HTML tags contain text
let html = "<p>A Computer Science "
+ "Portal for Geeks</p>";
let div = document.createElement("div");
div.innerHTML = html;
let text = div.textContent || div.innerText || "";
console.log(text)
Output:
A Computer Science Portal for Geeks
Approach 3: Using DOMParser to Parse and Extract Text Content
Using the DOMParser approach, you create a DOMParser instance to parse the HTML string into a DOM document. By accessing the parsed document’s `.textContent` from the body, you can extract the plain text, effectively stripping out all HTML tags.
Example: The function removeHTMLTags parses an HTML string into plain text by using DOMParser, extracting the text content, trimming whitespace, and returning it.
JavaScript
function removeHTMLTags(htmlString) {
// Create a new DOMParser instance
const parser = new DOMParser();
// Parse the HTML string
const doc = parser.parseFromString(htmlString, 'text/html');
// Extract text content
const textContent = doc.body.textContent || "";
// Trim whitespace
return textContent.trim();
}
const htmlString = "<p>Welcome to <strong>GeeksforGeeks</strong>.</p>";
const textContent = removeHTMLTags(htmlString);
console.log(textContent);
Output:
Welcome to GeeksforGeeks.
Similar Reads
Non-linear Components In electrical circuits, Non-linear Components are electronic devices that need an external power source to operate actively. Non-Linear Components are those that are changed with respect to the voltage and current. Elements that do not follow ohm's law are called Non-linear Components. Non-linear Co
11 min read
JavaScript Tutorial JavaScript is a programming language used to create dynamic content for websites. It is a lightweight, cross-platform, and single-threaded programming language. It's an interpreted language that executes code line by line, providing more flexibility.JavaScript on Client Side: On the client side, Jav
11 min read
Web Development Web development is the process of creating, building, and maintaining websites and web applications. It involves everything from web design to programming and database management. Web development is generally divided into three core areas: Frontend Development, Backend Development, and Full Stack De
5 min read
Spring Boot Tutorial Spring Boot is a Java framework that makes it easier to create and run Java applications. It simplifies the configuration and setup process, allowing developers to focus more on writing code for their applications. This Spring Boot Tutorial is a comprehensive guide that covers both basic and advance
10 min read
React Interview Questions and Answers React is an efficient, flexible, and open-source JavaScript library that allows developers to create simple, fast, and scalable web applications. Jordan Walke, a software engineer who was working for Facebook, created React. Developers with a JavaScript background can easily develop web applications
15+ min read
React Tutorial React is a powerful JavaScript library for building fast, scalable front-end applications. Created by Facebook, it's known for its component-based structure, single-page applications (SPAs), and virtual DOM,enabling efficient UI updates and a seamless user experience.Note: The latest stable version
7 min read
JavaScript Interview Questions and Answers JavaScript is the most used programming language for developing websites, web servers, mobile applications, and many other platforms. In Both Front-end and Back-end Interviews, JavaScript was asked, and its difficulty depends upon the on your profile and company. Here, we compiled 70+ JS Interview q
15+ min read
Class Diagram | Unified Modeling Language (UML) A UML class diagram is a visual tool that represents the structure of a system by showing its classes, attributes, methods, and the relationships between them. It helps everyone involved in a projectâlike developers and designersâunderstand how the system is organized and how its components interact
12 min read
Backpropagation in Neural Network Back Propagation is also known as "Backward Propagation of Errors" is a method used to train neural network . Its goal is to reduce the difference between the modelâs predicted output and the actual output by adjusting the weights and biases in the network.It works iteratively to adjust weights and
9 min read
3-Phase Inverter An inverter is a fundamental electrical device designed primarily for the conversion of direct current into alternating current . This versatile device , also known as a variable frequency drive , plays a vital role in a wide range of applications , including variable frequency drives and high power
13 min read