With the emergence of Job portal's like Linked-In, Indeed, Zip-Recruiter, Hire, etc. , job searching has become more convenient. The portals allow the job seeker to find all relevant jobs at one place. The job portals provide job seekers with easy-apply, apply via link & email to the recruiter type of facilities. These enable the job seeker to apply for many jobs in quick time which resulted in getting many applications for a single job posting. The companies shortlist candidates by parsing their resumes to match with job-specific criteria. The companies find it difficult to parse & extract the candidate's information due to the presence of different resume structures. Only those candidates get shortlisted whose resume is correctly parsed & which satisfy the job-specific criteria. Further, The job seekers fail to understand why their resume not getting shortlisted. Also, job seekers find it hard to identify whether the resume has all keywords (i.e. skills, experience, qualification, etc.) & does it meet the job description criteria which could be due to the content or format of the resume. This project focuses on extracting candidate information from its resume which is in PDF format. A Hybrid technique was used content-based & layout-based techniques for resume parsing. The hybrid approach uses a blend of rule-based and segmentation-based techniques for effective resume parsing.
Meteor, Python 2.7, Anaconda, PyCharm IDE
pdfminer, bson, gridfs, datefinder, fuzzywuzzy, nltk, pymongo
Cloning the repository:
git clone https://github.com/jineshdhruv8/ResumeParser.gitTo install Meteor in OS X & Linux:
curl https://install.meteor.com/ | shTo install Meteor in Windows:
choco install meteorTo setup Meteor, go to "ResumeParser/resume-parser" directory and run all the below command sequentially:
meteor npm install --save babel-runtimemeteor npm install --save core-jsmeteor add sessionmeteor remove autopublishmeteorNow the App will be running at: http://localhost:3000/
Don't close the terminal and open new terminal to install python dependencies
After installing anaconda and Python 2.7, install all dependencies
conda install -c conda-forge pdfminer conda install -c conda-forge bson pip install datefinderpip install fuzzywuzzypip install -U nltkpython -m pip install pymongoNow goto /Code/ResumeParser directory and run below command to insert sample resume files to database
python insert_pdf.pyAfter this step, resume files will be inserted into meteor MongoDB database and unique identifier to access these files will be stored in keys.csv file. To parse the resumes, run the following command:
python parser.py- Prof. Christopher M. Homan (Rochester Institute of Technology)