Centro de Investigación en Computación

Olga Kolesnikova

Laboratorio de Procesamiento de Lenguaje Natural

Curriculum

Contacto

Nivel SNII: 2 (2022-2026 )
Email: kolesnikova@cic.ipn.mx
Extensión(es): 56544

Total de publicaciones 26

Título Descripción Fecha
Advanced machine learning techniques for social support detection on social media Heliyon 2025-05-01
Detection of Biased Phrases in the Wiki Neutrality Corpus for Fairer Digital Content Management Using Artificial Intelligence Big Data and Cognitive Computing 2025-07-21
Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding 31st International Conference on Computational Linguistics, COLING 2025 2025-01-19
Explainable AI: XAI-guided context-aware data augmentation Expert Systems with Applications 2025-09-15
Harnessing Uncleaned Data for Stress Detection in Tamil and Telugu Code-Mixed Texts Computacion y Sistemas 2025-07-01
Hybrid Machine Learning and Deep Learning Approaches for Insult Detection in Roman Urdu Text AI 2025-02-01
Lexical Function Detection in Spanish Collocations Using Transformer Architecture Computacion y Sistemas 2025-04-01
Multi-Level Depression Severity Detection with Deep Transformers and Enhanced Machine Learning Techniques AI 2025-07-15
ORUD-Detect: A Comprehensive Approach to Offensive Language Detection in Roman Urdu Using Hybrid Machine Learning–Deep Learning Models with Embedding Techniques Information 2025-02-01
Performance Tradeoffs in Adaptive Hybrid Encryption and Decryption Techniques Security Analysis for Optimized Protection in IoT-Environ-mental Data Systems Contemporary Mathematics 2025-08-26
Reconocimiento de habla interna usando señales EEG: Estado del arte y metodología propuesta Research in Computing Science 2025-02-01

Título Descripción Fecha
Analyzing Emotional Trends from X Platform Using SenticNet: A Comparative Analysis with Cryptocurrency Price Cognitive Computation 2024-08-09
From Simple Detection to Quality-aware Prediction: Exploring Argument Complexity with Machine Learning Research in Computing Science 2024-12-01
Multi-Instrument Based N-Grams for Composer Classification Task Computación y Sistemas 2024-01-01
Psycholinguistic and emotion analysis of cryptocurrency discourse on X platform Scientific Reports 2024-12-01

Título Descripción Fecha
Ginger Disease Detection Using a Computer Vision Pre-trained Model Innovations in Machine and Deep Learning: Case Studies and Applications 2023-06-27
Low-Resource Neural Machine Translation Improvement Using Source-Side Monolingual Data Applied Sciences (Switzerland) 2023-01-01

Título Descripción Fecha
Improved Twitter Virality Prediction using Text and RNN-LSTM International Journal of Combinatorial Optimization Problems and Informatics 2021-09-01
Resolución de anáfora directa basada en conocimiento para pronombres definitivos Computación y Sistemas 2021-04-01

Título Descripción Fecha
Automatic Detection of Semantic Classes of Verb-Noun Collocations Computación y Sistemas 2020-06-01

Título Descripción Fecha
Measuring Non-compositionality of Verb-Noun Collocations using Lexical Functions and WordNet Hypernyms Lecture Notes in Artificial Intelligence 2015-12-10

Título Descripción Fecha
Modelo computacional del diálogo basado en reglas aplicado a un robot guía móvil Polibits 2014-12-01

Título Descripción Fecha
Multiword Expressions in NLP: General Survey and a Special Case of Verb-Noun Constructions Emerging Applications of Natural Language Processing: Concepts and New Research 2013-01-01
Semantic analysis of verbal collocations with lexical functions Springer 2013-01-01

Título Descripción Fecha
Semantic relations between collocations: A Spanish case study Revista Signos 2012-03-01
Supervised Learning Algorithms Evaluation on Recognizing Semantic Types of Spanish Verb-Noun Collocations Computación y Sistemas 2012-03-01

Total de congresos 45

Título Descripción Fecha
CIC-NLP at GenAI Detection Task 1: Advancing Multilingual Machine-Generated Text Detection 1st Workshop on GenAI Content Detection, GenAIDetect 2025 2025-01-19
CIC-NLP at GenAI Detection Task 1: Leveraging DistilBERT for Detecting Machine-Generated Text in English 1st Workshop on GenAI Content Detection, GenAIDetect 2025 2025-01-19
CULEMO: Cultural Lenses on Emotion - Benchmarking LLMs for Cross-Cultural Emotion Understanding 63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 2025-07-27
Pragmatic Generalization in LLMs: Insights from Fine-Tuning and Evaluating on Multilingual Sarcasm 24th Mexican International Conference on Artificial Intelligence, MICAI 2025 2025-11-03
Rewarding Sentiment Consistency: Reinforcement Learning for Multilingual Summarization 24th Mexican International Conference on Artificial Intelligence, MICAI 2025 2025-11-03
Sarcasm Detection in Roman Urdu Text: A Comprehensive Study Using Machine Learning and Large Language Model 24th Mexican International Conference on Artificial Intelligence, MICAI 2025 2025-11-03
Synthetic Data Generation for Purépecha Machine Translation Using Linguistically Augmented LLMs 24th Mexican International Conference on Artificial Intelligence, MICAI 2025 2025-11-03

Título Descripción Fecha
CEthio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI 6th International Conference on AI in Computational Linguistics, ACLing 2024 2024-09-21
Ethio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI 6th International Conference on AI in Computational Linguistics, ACLing 2024 2024-09-21
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024 2024-05-20
EthioMT: Parallel Corpus for Low-resource Ethiopian Languages 5th Workshop on Resources for African Indigenous Languages, RAIL 2024 at LREC-COLING 2024 - Workshop Proceedings 2024-05-25
Habesha@DravidianLangTech 2024: Detecting Fake News Detection in Dravidian Languages using Deep Learning 4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, DravidianLangTech 2024 2024-01-01
Hope Speech in Social Media Texts using Transformer 6th Iberian Languages Evaluation Forum, IberLEF 2024 2024-09-24
HOPE2024@IberLEF: A Cross-Linguistic Exploration of Hope Speech Detection in Social Media 6th Iberian Languages Evaluation Forum, IberLEF 2024 2024-09-24
HOPE@IberLEF 2024: Beyond Binary Bounds—Classifying Hope in Online Discourse 6th Iberian Languages Evaluation Forum, IberLEF 2024 2024-09-24
IntelliLeksika at HOMO-MEX 2024: Detection of Homophobic Content in Spanish Lyrics with Machine Learning 6th Iberian Languages Evaluation Forum, IberLEF 2024 2024-09-24
Lexicon-based Language Relatedness Analysis 6th International Conference on AI in Computational Linguistics, ACLing 2024 2024-09-21
Lidoma@LT-EDI 2024:Tamil Hate Speech Detection in Migration Discourse 4th Workshop on Language Technology for Equality, Diversity, Inclusion, LT-EDI 2024 2024-03-22
NLP Progress in Indigenous Latin American Languages 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 2024-06-16
Pinealai at SemEval-2024 Task 1: Exploring Semantic Relatedness Prediction using Syntactic, TF-IDF, and Distance-Based Features 18th International Workshop on Semantic Evaluation, SemEval 2024, co-located with the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2024 2024-06-20
Social Media Hate and Offensive Speech Detection Using Machine Learning Method 4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, DravidianLangTech 2024 2024-03-22

Título Descripción Fecha
Analysis of Emotions in Speech Acts for Chatbots: An Overview and a Model Proposal 2023 IEEE Symposium Series on Computational Intelligence, SSCI 2023 2023-12-05
Bilingual Word-Level Language Identification for Omotic Languages 11th EAI International Conference on Advancement of Science and Technology, ICAST 2023 2023-08-25
Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models 3rd Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2023, co-located with ACL 2023 2023-07-14
Evaluating the Effectiveness of Hybrid Features in Fake News Detection on Social Media 5th International Conference on Information and Communication Technology for Development for Africa, ICT4DA 2023 2023-10-26
Habesha@DravidianLangTech: Abusive Comment Detection using Deep Learning Approach 3rd Workshop on Speech and Language Technologies for Dravidian Languages, DravidianLangTech 2023 2023-09-07
Habesha@DravidianLangTech: Utilizing Deep and Transfer Learning Approaches for Sentiment Analysis 3rd Workshop on Speech and Language Technologies for Dravidian Languages, DravidianLangTech 2023 2023-09-07
LIDOMA at HOMO-MEX2023@IberLEF: Hate Speech Detection Towards the Mexican Spanish-Speaking LGBT+ Population. The Importance of Preprocessing Before Using BERT-Based Models 2023 Iberian Languages Evaluation Forum, IberLEF 2023 2023-09-26
LIDOMA at HOPE2023@IberLEF: Hope Speech Detection Using Lexical Features and Convolutional Neural Networks 2023 Iberian Languages Evaluation Forum, IberLEF 2023 2023-09-26
LIDOMA@DravidianLangTech: Convolutional Neural Networks for Studying Correlation Between Lexical Features and Sentiment Polarity in Tamil and Tulu Languages 3rd Workshop on Speech and Language Technologies for Dravidian Languages, DravidianLangTech 2023 2023-09-07
Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities 4th Workshop on Resources for African Indigenous Languages, RAIL 2023 2023-05-06
Parallel Corpus for Indigenous Language Translation: Spanish-Mazatec and Spanish-Mixtec 3rd Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2023, co-located with ACL 2023 2023-07-14
Stock Market Performance Analytics Using XGBoost 22nd Mexican International Conference on Artificial Intelligence, MICAI 2023 2023-11-13
Transformer-Based Hate Speech Detection for Multi-Class and Multi-Label Classification 2023 Iberian Languages Evaluation Forum, IberLEF 2023 2023-09-26
Zavira at HOPE2023@IberLEF: Hope Speech Detection from Text using TF-IDF Features and Machine Learning Algorithms 2023 Iberian Languages Evaluation Forum, IberLEF 2023 2023-09-26

Título Descripción Fecha
CIC at CheckThat! 2022: Multi-class and Cross-lingual Fake News Detection 2022 Conference and Labs of the Evaluation Forum, CLEF 2022 2022-09-05
Detection of Aggressive and Violent Incidents from Social Media in Spanish using Pre-trained Language Model 2022 Iberian Languages Evaluation Forum, IberLEF 2022 2022-09-20
Improving Neural Machine Translation for Low Resource Languages Using Mixed Training: The Case of Ethiopian Languages 21st Mexican International Conference on Artificial Intelligence, MICAI 2022 2022-10-24
The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation 2022 International Conference on Information and Communication Technology for Development for Africa, ICT4DA 2022 2022-11-28
Urdu Named Entity Recognition with Attention Bi-LSTM-CRF Model 21st Mexican International Conference on Artificial Intelligence, MICAI 2022 2022-10-24

Título Descripción Fecha
Virality Prediction for News Tweets Using RoBERTa 20th Mexican International Conference on Artificial Intelligence, MICAI 2021 2021-10-25

Título Descripción Fecha
Lexical Function Identification Using Word Embeddings and Deep Learning 18th Mexican International Conference on Artificial Intelligence, MICAI 2019 2019-10-27

Título Descripción Fecha
Supervised Learning for Semantic Classification of Spanish Collocations Lecture Notes in Computer Science 2010-09-27
Supervised Machine Learning for Predicting the Meaning of Verb-Noun Combinations in Spanish Lecture Notes in Computer Science; 9th Mexican International Conference on Artificial Intelligence 2010-11-08

Título Descripción Fecha
Social Media Fake News Classification Using Machine Learning Algorithm 4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, DravidianLangTech 2024

Total de proyectos 17

Título Rol
Modelos inteligentes de lenguaje natural para recuperación de información a partir de datos textuales Responsable técnico
Aprendizaje profundo para datos tabulares. Colaborador
Optimización combinatoria inteligente con aplicación a entornos urbanos Colaborador

Título Rol
TRADUCCIÓN AUTOMÁTICA PARA LENGUAS CON BAJOS RECURSOS DIGITALES Responsable técnico

Título Rol
Traducción automática para lenguas con bajos recursos digitales Responsable técnico

Título Rol
Investigación y desarrollo de funciones de correlación en conjuntos involutivos Colaborador
Análisis computacional de similitud semántica en colocaciones y combinaciones libres de palabras Responsable técnico

Título Rol
Análisis de emociones, condiciones mentales, lenguaje persuasivo y lenguaje de odio en los textos en Internet y redes sociales Indefinido

Título Rol
Modelos del lenguaje, sentimientos y opiniones con métodos de aprendizaje automático y aprendizaje profundo Colaborador
Modelos del lenguaje, sentimientos y opiniones con métodos de aprendizaje automático y aprendizaje profundo Colaborador

Título Rol
Búsqueda automática de respuestas en textos basada en la similitud semántica y sintáctica Colaborador
Tratamiento computacional de afectividad en el texto y en la música Colaborador

Título Rol
Análisis sintáctico y semántico de textos aplicado a tareas de educación, derecho y redes sociales. Colaborador
Desarrollo de métodos de construcción de medidas de asociación para diferentes áreas de aplicación. Colaborador

Título Rol
Extracción de hechos y desambiguación en la detección de opiniones y polaridad en el texto Colaborador

Título Rol
Análisis de expresiones compuestas, afectividad y personalidad en los textos con los métodos de aprendizaje automático. Colaborador

Título Rol
Desambiguación y agrupación automática de los sentidos de las palabras para las aplicaciones en el procesamiento computacional de lenguaje natural. Colaborador

Total de alumnos graduados: 13


Alumno Tesis Programa Rol
Diana Anahí Ledesma Roque "Interpretabilidad del modelo BERT en el contexto de la similitud semántico" MCC Director 1
Mikhail Krasitskii Automatic Evaluation of Affect of Summarization on Sentiment Analysis MCC Director 1
Maksim Olimpiadi Automatic Humor Identification in Short Texts Using Maching Learning MCC Director 1
Noman Ashraf Context-Based Abusive Language Detection DCC Director 2
Cristina Alicia Díaz Jiménez Coreference Resolution Using Methods of Word Sense Disambiguation MCC Director 2
Amna Naseeb Detection of Conversational Implicatures using Machine Learning Methods MCIC Director 1
Angel Raul Maldonado Soriano Large Language Models and Sentiment Analysis for Recommender Systems MCC Director 1
Mesay Gemeda Yigezu Multilingual Hate Speech Detection for Low Resourced Languages DCC Director 1
Atnafu Lambebo Tonja Neural Machine Translation for Low Resource Languages DCC Director 1
Christian Efraín Maldonado Sifuentes Prediction of News Tweets Virality using Linguistic Features and Deep Learning DCC Director 2
Jesús Alexander Alvarado Gutiérrez Resolución de correferencia con aprendizaje profundo DCC Director 2
Arturo Hernández Miranda “Funciones léxicas en español utilizando embeddings” MCC Director 2
Jesús Alexander Alvarado Gutiérrez “Knowledge-rich Techniques for Anaphora Resolution” MCC Director 2

Total de tesis que dirige: 5

  • DCC: 3

  • MCC: 2

Alumno Tesis Programa Rol
Emmanuel Quetzalcóatl Castro Munguía Análisis de emociones en actos del habla para agentes conversacionales con características afectivas DCC Director 1
Moein Shahiki Tash Cryptocurrency Discourse Analysis Using Machine Learning DCC Director 1
Alan Rodrigo López López Herramienta de diálogo automatizada para apoyar el diagnóstico de enfermedades mentales MCC Director 1
Antonia Almudena López Gómez Hybrid Method for Measuring Semantic Similarity in Computational Linguistics MCC Director 1
Cecilia González Servín Traducción automática purépecha-español usando modelos grandes de lenguaje DCC Director 1