Determining Column Numbers in Résumés with Clustering

Keskin Ş. R., Balı Y., Orman G. K., Daniş F. S., Turhan S. N.

18th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2022, Hersonissos, Yunanistan, 17 - 20 Haziran 2022, cilt.647 IFIP, ss.460-471, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 647 IFIP
Doi Numarası: 10.1007/978-3-031-08337-2_38
Basıldığı Şehir: Hersonissos
Basıldığı Ülke: Yunanistan
Sayfa Sayıları: ss.460-471
Anahtar Kelimeler: Information Extraction, Resume Parse, DBSCAN, K-means
Galatasaray Üniversitesi Adresli: Evet

Özet

In the recruitment process, the workload of manual résumé reviews is quite time consuming for the recruiters. This review process can benefit from Artificial Intelligent-aided intelligent systems to extract the actual meaning within the résumés and structure their forms. However, writing résumés has no standards, and the personalized structure of each received résumé makes this task highly challenging. This work is dedicated to tackling a part of this issue on structuring résumés. More specifically, we firstly focus on finding the column number of any résumé since once the main parts of the résumé are separated, the subdivisions can easily be analysed. This study, thus, formalizes the problem of finding columns of a résumé as a clustering problem. The experiments are performed on a data set of custom Turkish résumés having up to two-columns, on which we apply two algorithms: K-means and Density-based spatial clustering of applications with noise. As a result of the experiments, we observe that an optimal cluster size relates strongly to the valid column number. Our method is not limited to résumés but can be applied to any unstructured textual data.