Resume Parsing Across Multiple Job Domains Using a BERT-Based NER Model

Madhumita Srivastava, Paul Greaney

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study presents a resume information extraction system using named entity recognition (NER) techniques. By harnessing the power of BERT, a state-of-the-art transfer learning model, in conjunction with NER, we develop a model which accurately extracts relevant information from resumes. Our approach involves fine-tuning the pre-trained BERT base model on a customised NER resume dataset, which comprises a limited volume of annotated resume data from across four diverse job domains: information technology, human resources, consultancy, and engineering. To achieve this, we utilised the NLP capabilities of spaCy pipelines. Our results show that even with a constrained training dataset and minimal fine-tuning, transfer learning can be successfully leveraged to extract named entities from resumes, achieving respectable accuracy tailored to our specific application. Our findings underscore the pivotal role of data size and annotation quality in custom NER training. The model's generalisation and contextual comprehension heavily depend on these factors, reinforcing the need for carefully selected training data. This paper sheds light on the relationship between transfer learning, NER, and data quality in developing a sophisticated resume information extraction system.

Original languageEnglish
Title of host publication2023 31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350360219
DOIs
Publication statusPublished - 2023
Event31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023 - Letterkenny, Ireland
Duration: 7 Dec 20238 Dec 2023

Publication series

Name2023 31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023

Conference

Conference31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023
Country/TerritoryIreland
CityLetterkenny
Period7/12/238/12/23

Keywords

  • machine learning
  • named entity recognition
  • natural language processing

Fingerprint

Dive into the research topics of 'Resume Parsing Across Multiple Job Domains Using a BERT-Based NER Model'. Together they form a unique fingerprint.

Cite this