NLP Assignment Questions
NLP Assignment Questions
1. Use requests and beautiful soup to download a page from the internet
a. Follow the example under https://www.digitalocean.com/community/tutorials/how-to-
work-with-web-data-using-requests-and-beautiful-soup-with-python-3
2. Once you have the texts from the webpage, perform some basic text analytics
1. Split the text into sentences
2. Tokenize
3. Person POS tagging
4. Remove the stop words
5. Perform lemmatization
6. Plot a frequency distribution of the most important words in the document before and after
lemmatization
a. What are the top 5 most important words before and after lemmatization?
b. What are the bottom 5 words before and after lemmatization?