How to Remove HTML Tags from String in Python Last Updated : 26 Nov, 2024 Summarize Comments Improve Suggest changes Share Like Article Like Report Removing HTML tags from a string in Python can be achieved using various methods, including regular expressions and specialized libraries like Beautiful Soup. Each approach is suitable for specific scenarios, depending on your requirements. Let’s explore how to efficiently remove HTML tags.Using Regular ExpressionsThe simplest way to remove HTML tags is by using the re module. This method is lightweight and efficient for straightforward cases. Python import re # Sample string with HTML tags s1 = "<h1>Welcome to Python Programming</h1>" # Removing HTML tags using regex s2 = re.sub(r"<.*?>", "", s1) print(s2) OutputWelcome to Python Programming The re.sub() method replaces all occurrences of the pattern <.*?> with an empty string, effectively removing all HTML tags from the input string.Let's explore other methods of removing HTML tags from a string in python:Using Beautiful Soup ( For Nested HTML Structures)For more robust and complex cases, especially when dealing with malformed HTML, Beautiful Soup is a preferred choice. It ensures better accuracy and handles a wider range of edge cases. Python from bs4 import BeautifulSoup # Sample string with HTML tags s1 = "<h1>Welcome to <b>Python Programming</b></h1>" # Removing HTML tags using Beautiful Soup soup = BeautifulSoup(s1, "html.parser") s2 = soup.get_text() print(s2) Output:Welcome to Python ProgrammingBeautiful Soup parses the string as HTML and extracts only the text content using the get_text() method.Using lxmlThe lxml library is another efficient option, especially for performance-critical applications. It parses the HTML and extracts the text content with minimal overhead. Python from lxml.html import fromstring # Sample string with HTML tags s1 = "<h1>Welcome to Python Programming</h1>" # Removing HTML tags using lxml tree = fromstring(s1) s2 = tree.text_content() print(s2) Output:Welcome to Python ProgrammingThe text_content() method extracts text while ignoring all HTML tags. Comment More infoAdvertise with us Next Article How to Remove HTML Tags from String in Python A anuragtriarna Follow Improve Article Tags : Python Python Programs python-string Python string-programs Practice Tags : python Similar Reads Python Tutorial - Learn Python Programming Language Python is one of the most popular programming languages. Itâs simple to use, packed with features and supported by a wide range of libraries and frameworks. Its clean syntax makes it beginner-friendly. It'sA high-level language, used in web development, data science, automation, AI and more.Known fo 10 min read Python Interview Questions and Answers Python is the most used language in top companies such as Intel, IBM, NASA, Pixar, Netflix, Facebook, JP Morgan Chase, Spotify and many more because of its simplicity and powerful libraries. To crack their Online Assessment and Interview Rounds as a Python developer, we need to master important Pyth 15+ min read Python OOPs Concepts Object Oriented Programming is a fundamental concept in Python, empowering developers to build modular, maintainable, and scalable applications. By understanding the core OOP principles (classes, objects, inheritance, encapsulation, polymorphism, and abstraction), programmers can leverage the full p 11 min read Python Projects - Beginner to Advanced Python is one of the most popular programming languages due to its simplicity, versatility, and supportive community. Whether youâre a beginner eager to learn the basics or an experienced programmer looking to challenge your skills, there are countless Python projects to help you grow.Hereâs a list 10 min read Python Exercise with Practice Questions and Solutions Python Exercise for Beginner: Practice makes perfect in everything, and this is especially true when learning Python. If you're a beginner, regularly practicing Python exercises will build your confidence and sharpen your skills. To help you improve, try these Python exercises with solutions to test 9 min read Python Programs Practice with Python program examples is always a good choice to scale up your logical understanding and programming skills and this article will provide you with the best sets of Python code examples.The below Python section contains a wide collection of Python programming examples. These Python co 11 min read Python Introduction Python was created by Guido van Rossum in 1991 and further developed by the Python Software Foundation. It was designed with focus on code readability and its syntax allows us to express concepts in fewer lines of code.Key Features of PythonPythonâs simple and readable syntax makes it beginner-frien 3 min read Python Data Types Python Data types are the classification or categorization of data items. It represents the kind of value that tells what operations can be performed on a particular data. Since everything is an object in Python programming, Python data types are classes and variables are instances (objects) of thes 9 min read Input and Output in Python Understanding input and output operations is fundamental to Python programming. With the print() function, we can display output in various formats, while the input() function enables interaction with users by gathering input during program execution. Taking input in PythonPython input() function is 8 min read Enumerate() in Python enumerate() function adds a counter to each item in a list or other iterable. It turns the iterable into something we can loop through, where each item comes with its number (starting from 0 by default). We can also turn it into a list of (number, item) pairs using list().Let's look at a simple exam 3 min read Like