Python - Bigrams Frequency in String
Last Updated :
12 Apr, 2023
Sometimes while working with Python Data, we can have problem in which we need to extract bigrams from string. This has application in NLP domains. But sometimes, we need to compute the frequency of unique bigram for data collection. The solution to this problem can be useful. Lets discuss certain ways in which this task can be performed.
Method #1 : Using Counter() + generator expression The combination of above functions can be used to solve this problem. In this, we compute the frequency using Counter() and bigram computation using generator expression and string slicing.
Python3
# Python3 code to demonstrate working of
# Bigrams Frequency in String
# Using Counter() + generator expression
from collections import Counter
# initializing string
test_str = 'geeksforgeeks'
# printing original string
print("The original string is : " + str(test_str))
# Bigrams Frequency in String
# Using Counter() + generator expression
res = Counter(test_str[idx : idx + 2] for idx in range(len(test_str) - 1))
# printing result
print("The Bigrams Frequency is : " + str(dict(res)))
Output :
The original string is : geeksforgeeks The Bigrams Frequency is : {'ee': 2, 'ks': 2, 'ek': 2, 'sf': 1, 'fo': 1, 'ge': 2, 'rg': 1, 'or': 1}
Method #2 : Using Counter() + zip() + map() + join The combination of above functions can also be used to solve this problem. In this, we perform the task of constructing bigrams using zip() + map() + join.
Python3
# Python3 code to demonstrate working of
# Bigrams Frequency in String
# Using Counter() + zip() + map() + join
from collections import Counter
# initializing string
test_str = 'geeksforgeeks'
# printing original string
print("The original string is : " + str(test_str))
# Bigrams Frequency in String
# Using Counter() + zip() + map() + join
res = Counter(map(''.join, zip(test_str, test_str[1:])))
# printing result
print("The Bigrams Frequency is : " + str(dict(res)))
Output :
The original string is : geeksforgeeks The Bigrams Frequency is : {'ee': 2, 'ks': 2, 'ek': 2, 'sf': 1, 'fo': 1, 'ge': 2, 'rg': 1, 'or': 1}
Time Complexity: O(n)
Auxiliary Space: O(n)
Method 3: use a loop and a dictionary to keep track of the bigram frequencies.
- Initialize an empty dictionary to keep track of the bigram frequencies.
- Loop through the characters in the input string, starting from the second character.
- For each character, get the previous character and concatenate them to form a bigram.
- Check if the bigram is already in the dictionary.
- If the bigram is not in the dictionary, add it with a frequency of 1.
- If the bigram is already in the dictionary, increment its frequency by 1.
- Print the bigram frequencies.
Python3
# Python3 code to demonstrate working of
# Bigrams Frequency in String
# Using a loop and dictionary
# initializing string
test_str = 'geeksforgeeks'
# printing original string
print("The original string is : " + str(test_str))
# Bigrams Frequency in String
# Using a loop and dictionary
freq_dict = {}
for i in range(1, len(test_str)):
bigram = test_str[i-1:i+1]
if bigram in freq_dict:
freq_dict[bigram] += 1
else:
freq_dict[bigram] = 1
# printing result
print("The Bigrams Frequency is : " + str(freq_dict))
OutputThe original string is : geeksforgeeks
The Bigrams Frequency is : {'ge': 2, 'ee': 2, 'ek': 2, 'ks': 2, 'sf': 1, 'fo': 1, 'or': 1, 'rg': 1}
Time complexity: O(n), where n is the length of the input string.
Auxiliary space: O(k), where k is the number of unique bigrams in the input string.
Method #4 : Using count() method
Approach
- Initiated a for loop to append all the bigrams of string test_str to a list x using slicing, create an empty dictionary freq_dict
- Initiated another for loop to create a dictionary with values of list x(bigrams ) as keys and count of each bigram in test_str as values
- Display the dictionary
Python3
# Python3 code to demonstrate working of
# Bigrams Frequency in String
# Using a loop and dictionary
# initializing string
test_str = 'geeksforgeeks'
# printing original string
print("The original string is : " + str(test_str))
# Bigrams Frequency in String
# Using a loop and dictionary
freq_dict = {}
x=[]
for i in range(1, len(test_str)):
bigram = test_str[i-1:i+1]
x.append(bigram)
for i in x:
freq_dict[i]=test_str.count(i)
# printing result
print("The Bigrams Frequency is : " + str(freq_dict))
OutputThe original string is : geeksforgeeks
The Bigrams Frequency is : {'ge': 2, 'ee': 2, 'ek': 2, 'ks': 2, 'sf': 1, 'fo': 1, 'or': 1, 'rg': 1}
Time Complexity : O(N) N - length of bigrams list
Auxiliary Space : O(N) N - length of dictionary freq_dict
Similar Reads
Python Tutorial - Learn Python Programming Language Python is one of the most popular programming languages. Itâs simple to use, packed with features and supported by a wide range of libraries and frameworks. Its clean syntax makes it beginner-friendly. It'sA high-level language, used in web development, data science, automation, AI and more.Known fo
10 min read
Python Interview Questions and Answers Python is the most used language in top companies such as Intel, IBM, NASA, Pixar, Netflix, Facebook, JP Morgan Chase, Spotify and many more because of its simplicity and powerful libraries. To crack their Online Assessment and Interview Rounds as a Python developer, we need to master important Pyth
15+ min read
Python OOPs Concepts Object Oriented Programming is a fundamental concept in Python, empowering developers to build modular, maintainable, and scalable applications. By understanding the core OOP principles (classes, objects, inheritance, encapsulation, polymorphism, and abstraction), programmers can leverage the full p
11 min read
Python Projects - Beginner to Advanced Python is one of the most popular programming languages due to its simplicity, versatility, and supportive community. Whether youâre a beginner eager to learn the basics or an experienced programmer looking to challenge your skills, there are countless Python projects to help you grow.Hereâs a list
10 min read
Python Exercise with Practice Questions and Solutions Python Exercise for Beginner: Practice makes perfect in everything, and this is especially true when learning Python. If you're a beginner, regularly practicing Python exercises will build your confidence and sharpen your skills. To help you improve, try these Python exercises with solutions to test
9 min read
Python Programs Practice with Python program examples is always a good choice to scale up your logical understanding and programming skills and this article will provide you with the best sets of Python code examples.The below Python section contains a wide collection of Python programming examples. These Python co
11 min read
Python Introduction Python was created by Guido van Rossum in 1991 and further developed by the Python Software Foundation. It was designed with focus on code readability and its syntax allows us to express concepts in fewer lines of code.Key Features of PythonPythonâs simple and readable syntax makes it beginner-frien
3 min read
Python Data Types Python Data types are the classification or categorization of data items. It represents the kind of value that tells what operations can be performed on a particular data. Since everything is an object in Python programming, Python data types are classes and variables are instances (objects) of thes
9 min read
Input and Output in Python Understanding input and output operations is fundamental to Python programming. With the print() function, we can display output in various formats, while the input() function enables interaction with users by gathering input during program execution. Taking input in PythonPython input() function is
8 min read
Enumerate() in Python enumerate() function adds a counter to each item in a list or other iterable. It turns the iterable into something we can loop through, where each item comes with its number (starting from 0 by default). We can also turn it into a list of (number, item) pairs using list().Let's look at a simple exam
3 min read