Lessons Learned from Developing and Deploying a Large-Scale Employer Name Normalization System for O
Qiaoling Liu (CareerBuilder LLC); Josh Chao (CareerBuilder LLC); Thomas Mahoney (CareerBuilder LLC); Chris Min (CareerBuilder LLC); Faizan Javed (CareerBuilder LLC); Alan Chern (Georgia Institute of Technology); Valentin Jijkoun (Textkernel BV)
Employer name normalization, or linking employer names in job postings or resumes to entities in an employer knowledge base (KB), is important for many downstream applications in the online recruitment domain. Key challenges for employer name normalization include handling employer names from both job postings and resumes, leveraging the corresponding location and URL context, and handling name variations and duplicates in the KB. In this paper, we describe the CompanyDepot system developed at CareerBuilder, which uses machine learning techniques to address these challenges. We discuss the main challenges and share our lessons learned in deployment, maintenance, and utilization of the system over the past two years. We also share several examples of how the system has been used in applications at CareerBuilder to deliver value to end customers.