I hold a PhD in Computer Science. My main area of interests is computational linguistics (natural language processing; see an introductory book, an introductory text in Spanish and another introductory text). See a video presentation (mostly in Spanish) of our NLP Lab. See also some of my courses.
Research Professor and Head of the Natural Language Laboratory at the Centro de Investigación en Computación (CIC) of the Instituto Politécnico Nacional (IPN), Mexico. Member of the Mexican Academy of Sciences, founding member of the Mexican Academy of Computing, and National Researcher of Mexico (SNI level 3, the highest).
Collaborator of the L.N. Gumilyev Eurasian National University, Kazakhstan.
Senior Researcher of the International Laboratory for Intelligent Systems and Structural Analysis, National Research University Higher School of Economics (2015–2016); Invited Professor at the Institute for Modern Linguistic Research, Sholokhov Moscow State University for the Humanities (2013–2015), Russia.
Distinguished Visiting Professor at Chung-Ang University (2003–2004), Seoul, Korea.
My CV lists my publications, projects, and awards. I have h-index 53, with" more than 13,000 citations to my papers (I have checked manually first 2000 citations to my papers to be not self-citations)-->. According to Guide2Research, I am within top 4 researchers in Mexico (and top 3000 in the world) in computer science and electronics. In addition, According to Springer's rating, I am one of the most productive authors in Artificial Intelligence and Computational Linguistics, and one of the most productive Mexican computer scientists. Also, According to Google Scholar, I am within top 100 most-cited authors in Computational Linguistics. My Erdős number is 3 or 4 and my Dijkstra number is 5. My Microsoft Academic Search rating is 10 in NLP, 7 in DB, 4 in AI. See info about me in DBLP, DBLife, ResearchGate, LinkedIn, Wikipedia.
I am the founder and chair of the CICLing International Conference series. I have been Honorary Chair of ENC-2008, Program co-chair of some recent MICAI, CIC, CORE, and some other conferences. I am founding Editor-in-Chief of the International Journal of Computational Linguistics and Applications (IJCLA) and Editor-in-Chief of the journal POLIBITS. I am a member of the Board of Ex-Presidents of the Mexican Society of Artificial Intelligence (SMIA), of which I have been the President for the term 2013–2014.
I have been, or currently am, advisor of more than 30 PhD, more than 30 MSc, and some BSc students, who are citizens of 17 countries: . I supervise, or have supervised, theses in 7 countries: . My students have received important awards, because I taught them my key to success. All my students (including foreigners) receive very attractive scholarships (see details). Foreign students can apply for an even higher scholarship to study with us. Our PhD and MSc programs are certified as international-level quality programs.
To help students to better choose their research topic, plan their research, present their findings, and write papers, I prepared a presentation on Do-s and Don't-s in your PhD (and not only in PhD). There is a lot more to say about it. You may want to invite me for a talk on this topic. I also maintain a handbook for our Lab's students.
Tools or lexical resources (most of them, free to use) developed in our Laboratory or together with our collaborators over last decades include:
CORDIAM: diachronic and diatopic corpus of Spanish of the Americas, with search interface
CrossLexica dictionary web interface: a very large a very large dictionary of Russian collocations and semantic derivates
PerSent: Persian sentiment analysis and opinion mining lexicon: polarity for about 1500 Persian words.
Corpus for implicit aspect extraction and implicit aspect indicator extraction in opinion mining
EmoSenticNet: Augmented SenticNet / expanded WordNet Affect for opinion mining and sentiment analysis
Spanish verb-noun lexical functions corpus (semantics of Spanish collocations)
Open Fact Extraction gold standard datasets for Spanish and parallel Spanish-English
Russian corpus of coordinated pairs in Internet queries (association network); search term co-occurrences
CICWN: a Java WordNet API that allows Java applications to retrieve data from WordNet
CICWSD: a Java WSD API for Word Sense Disambiguation
EvolutionJ: a Java API for global numerical optimization (to appear soon)
Classifier: a tool for determining the main topic of a document and topical clustering of documents
AGME Spanish morphology: analyzer and dictionary of Spanish word forms with morphological information
RMorph Russian morphology: analyzer and dictionary of Russian word forms with morphological information
Parser: a simple syntactic parser with a grammar for Spanish
See also my other research projects.
One of our great rewards in academic life is travelling. I've visited 61 countries (shown in green):
Thank you for your interest!
I belong to
Life I lead
Last modified 23-abr.-2021
My name is A. Gelbukh, Alexander Gelbukh, Alexander F. Gelbukh, A. F. Gelbukh, Александр Гельбух, but people sometimes incorrectly spell it as A. F. Gel'bukh, A. F. Gel'bukh, Alexandre Guelboukh Kahn, Alexandre Felixovitch Guelboukh, Gelboukh, Guelbukh, Gelbuch, Guelbouch, Gelbouch, Guelbuch, Gelbuk, Guelbouk, Gelbouk, Guelbuk, Gelbuck, Guelbouck, Gelbouck, Guelbuck, Gelbuh, Guelbouh, Gelbouh, Guelbuh, Gelbuckh, Guelbuckh, Gelbouckh, Guelbouckh. Keywords in Spanish: procesamiento de lenguaje natural, procesamiento de texto, lingüística computacional, inteligencia artificial, Doctorado, Maestría. See also International Conference on Computational Linguistics and Intelligent Text Processing, Natural Language Processing, Human Language Technologies. See also my file list and site list.