Ekaterina Garanina

Hi! I'm Ekaterina (Kate), an EM LCT Master's student currently studying at the University of Groningen. I'm passionate about NLP research and have experience in multiple NLP fields, with NLG being a current focus. In my free time, I enjoy hiking, reading, and drawing.

Profile photo
Languages Russian English French German Czech
Programming Python bash R Java
Machine learning numpy pandas scikit-learn tensorflow pytorch transformers
Databases SQL MongoDB Elasticsearch

Education

2021–2023

EM Master’s Program in Language and Communication Technologies

1st year: Charles University, Czech Republic

2nd year: University of Groningen, the Netherlands

Holder of Erasmus Mundus Joint Master's scholarship.

2014–2018

National Research University Higher School of Economics

BA in Fundamental and Applied Linguistics, minor in Data Analysis

Thesis: Automated Error Detection in English Examination Essays Written by Russian Students

GPA: 9.54

Work experience

2022–2023

Master's projects

  • Implemented a system for sexism detection and classification at SemEval-2023 (github, paper in review);
  • Developed a toolkit for table-to-text generation (github, paper in review);
  • Developed a Transformer-based model for semantic parsing (github);
  • Did research on length-based overfitting in Transformers (github).
2019–2022

NLP developer at LegalTech startup nlogic.ai

  • Implemented ML-based classification of legal documents and entity parsing in semi-structured texts;
  • Created complex retrieval-based systems for answering questions.
2018, 2019

Teaching at Summer and Winter schools for secondary school students at Moscow Institute of Physics and Technology

  • Taught NLP, curated a final project, organized a local hackathon.
2015–2018

Bachelor's projects

Publications and conference presentations

Before 2020, my surname was Gerasimenko.

Publications

  • Kasner, Z., Garanina, E., Plátek, O., & Dušek, O. (2023). TabGenie: A Toolkit for Table-to-Text Generation (arXiv:2302.14169). arXiv.
  • Daniel, M., von Waldenfels, R., Ter-Avanesova, A., Kazakova, P., Schurov, I., Gerasimenko, E., Ignatenko, D., Makhlina, E., Tsfasman, M., Verhees, S., Vinyar, A., Zhigulaskaja, V., Ovsjannikova, M., Say, S., & Dobrushina, N. (2019). Dialect loss in the Russian North: Modeling change across variables. Language Variation and Change, 31(3), 353–376.
  • Puzhaeva, S. Yu., Gerasimenko, E. A., Zakharova, E. S., & Rakhilina, E. V. (2018). Automatic Extraction of Formulaic Expressions from Russian Texts. Vestnik NSU. Series: Linguistics and Intercultural Communication, 16(2), 5–18.
  • Vinyar, A. I., & Gerasimenko, E. A. (2018). Non-syntactic restrictions on incorporation in Chukchi. Acta Linguistica Petropolitana, XIV(2), 78–110.

Conference presentations

  • Vinogradova, O., & Gerasimenko, E. Design of Test-Making Tools for the Learner Corpus. The 9th International Corpus Linguistics Conference. 24-28 July 2017, Birmingham.
  • Vinyar, A., & Gerasimenko, E. Sociolinguistic Variation of the Distribution of -to Postpositional Clitic Forms in one North Russian dialect. International Conference on Linguistic Variation in Europe (ICLaVE). 06-09 June 2017, Malaga.