Adapting support vector optimisation algorithms to textual gender classification
dc.contributor.author | Gómez, Javier | |
dc.contributor.author | Alfaro, Cesar | |
dc.contributor.author | Ortega, Felipe | |
dc.contributor.author | Moguerza, Javier M. | |
dc.contributor.author | Algar, Maria Jesus | |
dc.contributor.author | Moreno, Raul | |
dc.date.accessioned | 2024-06-26T09:29:49Z | |
dc.date.available | 2024-06-26T09:29:49Z | |
dc.date.issued | 2024-04-13 | |
dc.description | Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature | es |
dc.description.abstract | In this paper, we focus on the problem of determining the gender of the person described in a biographical text. Since support vector machine classifiers are well suited for text classification tasks, we present a new stopping criterion for support vector optimisation algorithms tailored to this problem. This new approach exploits the geometric properties of the vector representation of such content. An experiment on a set of English and Spanish biographical articles retrieved from Wikipedia illustrates this approach and compares it to other machine learning classification algorithms. The proposed method allows real-time classification algorithm training. Moreover, these results confirm the advantage of leveraging additional gender information in strongly inflected languages, like Spanish, for this task | es |
dc.identifier.citation | Gomez, J., Alfaro, C., Ortega, F. et al. Adapting support vector optimisation algorithms to textual gender classification. TOP (2024). https://doi.org/10.1007/s11750-024-00671-1 | es |
dc.identifier.doi | 10.1007/s11750-024-00671-1 | es |
dc.identifier.issn | 1863-8279 (online) | |
dc.identifier.issn | 1134-5764 (print) | |
dc.identifier.uri | https://hdl.handle.net/10115/35140 | |
dc.language.iso | eng | es |
dc.publisher | Springer | es |
dc.rights | Atribución 4.0 Internacional | * |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Support vector machines | |
dc.subject | Machine learning | |
dc.subject | Nonlinear optimisation | |
dc.subject | Text mining | |
dc.subject | Gender identification | |
dc.title | Adapting support vector optimisation algorithms to textual gender classification | es |
dc.type | info:eu-repo/semantics/article | es |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- s11750-024-00671-1.pdf
- Tamaño:
- 2.18 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 2.67 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: