0% found this document useful (0 votes)
38 views2 pages

Ongoing FYP Progress: Current Module

The document summarizes the current progress of an ongoing FYP project involving resume parsing. It discusses using pypdf2 instead of pdfminer for resume extraction. It notes some CVs with images are not processed correctly and wide datasets are needed to capture experience and qualifications. Regular expressions have problems extracting phone numbers correctly in some cases. The current workspace and tools being used are listed, along with problems extracting phone numbers from some resumes consistently. Questions about context, self, and as_view functions are also included. Storing applicant resume details in a database and handling one login form for applicants and companies are discussed as other issues.

Uploaded by

haider
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views2 pages

Ongoing FYP Progress: Current Module

The document summarizes the current progress of an ongoing FYP project involving resume parsing. It discusses using pypdf2 instead of pdfminer for resume extraction. It notes some CVs with images are not processed correctly and wide datasets are needed to capture experience and qualifications. Regular expressions have problems extracting phone numbers correctly in some cases. The current workspace and tools being used are listed, along with problems extracting phone numbers from some resumes consistently. Questions about context, self, and as_view functions are also included. Storing applicant resume details in a database and handling one login form for applicants and companies are discussed as other issues.

Uploaded by

haider
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Ongoing FYP Progress

Current module
Resume Parser

What I did so far?


As an alternative to pdfminer, pypdf2 is studied and resume extraction steps are followed.

Some CVs that start with some images or have images aren’t processed.

Wide datasets of experience and qualifications are required to capture both from resumes.

Regular expression problem in getting phone numbers. The international digit number is truncated
sometimes.

The project so far is being saved in a zip file.

Current workspace
1. Chrome tabs
a. http://www.nltk.org/book/
b. https://www.nltk.org/book/ch07.html
c. http://www.nltk.org/howto/chunk.html
d. https://m-clark.github.io/text-analysis-with-R/img/POS-Tags.png
e. https://medium.com/@divalicious.priya/information-extraction-from-cv-acec216c3f48
f. https://regexr.com/
g. https://help.libreoffice.org/Common/List_of_Regular_Expressions
h. https://www.onlinegdb.com/online_python_interpreter
i. https://www.youtube.com/watch?v=nxhCyeRR75Q&t=18s
j. https://www.youtube.com/watch?v=yGKTphqxR9Q&list=PLQVvvaa0QuDf2JswnfiGkliBIn
ZnIC4HL&index=3
k. https://www.datacamp.com/community/tutorials/stemming-lemmatization-python
l. https://pythonprogramming.net/lemmatizing-nltk-tutorial/

Problems
1. Phone extracting regular expression seems correct (as proved by regexr.com as well). It scans
complete phone number of Salman Anjum CV but not haider’s cv and Resume –Rohini Prakash.

Questions
1. What is the difference between context={} and context=[]
2. What does 'self' keyword do in python classes
3. What does .as_view( ) function
do?
4. Difference between path(‘’) and url(‘’)
Other issues in project
How to recognize applicant / company from one login / register form

Resume db. Storing every applicant's resume details

You might also like