NLP MTE Syllabus and Practice Problems
NLP MTE Syllabus and Practice Problems
Practice Problem
1.Write a Python program to count the number of characters in a given text.
2. Create a function that takes a string as input and returns the reversed version of it.
3. Write a program to check if a given string is a palindrome or not.
4. Implement a function that counts the occurrences of each word in a text.
5. Read a text file and print the number of lines it contains.
6. Write a Python script to copy the contents of one text file to another.
7. Create a program that searches for a specific word in a text file and prints its occurrences.
8. Write a regular expression to match valid email addresses.
12. Create a regex pattern to extract all the phone numbers from a given text.
13. Implement a regex pattern to identify and extract dates from a text.
14. Tokenize a given sentence into individual words using Python.
16. Apply stemming to a list of words using a stemming algorithm of your choice.
17. Use a lemmatization library to lemmatize a set of words.
18. Remove stop words from a given text using Python.
19. Write a program that uses phrase matching to find specific phrases in a text.
20. Create a simple unigram language model for a given corpus.
21. Implement a bigram language model for predicting the next word in a sentence.
22. Use a part-of-speech tagging library to tag the parts of speech in a sentence.
23. Calculate precision, recall, and F1 score for a binary classification model.
24. Visualize the parts of speech distribution in a given text.
25. Calculate the Term Frequency-Inverse Document Frequency (TF-IDF) for a set of documents.
26. Apply Named Entity Recognition to identify entities in a given text.
27. Apply unsupervised text classification techniques to cluster similar documents.
28. Create a confusion matrix for a multi-class text classification problem
29. Analyze the sentiment of a set of tweets using a sentiment mining approach.
30. Implement a supervised text classification model to classify news articles into categories.